Binance is a leading global blockchain ecosystem behind the worlds largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 250 million people in 100 countries for our industry-leading security user fund transparency trading engine speed deep liquidity and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education research payments institutional services Web3 features and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.
We are seeking a highly skilled professional to join our team focusing on advancing customer service scheduling optimization through innovative AI solutions. This role involves researching and implementing cutting-edge algorithms to enhance scheduling systems leveraging business domain knowledge to elevate the impact of AI products. The successful candidate will develop and refine Large Language Models (LLMs) to extract actionable insights improve business decision-making and optimize prompt design for more accurate outputs. Additionally the role includes creating scalable and robust LLM/RAG frameworks tailored to customer service scheduling fostering innovation and maintaining a competitive market edge.
Responsibilities:
Own the full LLM pipeline from data preparation to production real case usage.
Design iterate and optimize prompts (zero-/few-shot chain-of-thought tool-calling etc.) to maximize model utility and safety across products and languages.
Build and maintain Retrieval-Augmented Generation (RAG) QA/search systems that connect to multi-source knowledge bases.
Familiar with vLLM/SGLang inference architectures and have proven experience deploying and operating LLM services on multiGPU or cluster environments.
Design implement and operate multiagent LLM architectures (e.g. LangGraph CrewAI AutoGen) including task decomposition agent orchestration memory sharing and toolcalling workflows.
Develop evaluation pipelines (automatic metrics & human feedback) to measure prompt and model quality bias and hallucination rates.
Collaborate with product and CS teams to integrate AI models into conversational Chatbotin differentscenarios.
Track cutting-edge research author tech blogs and keep improve current architecture.
Qualifications:
Masters degree or higher in Computer Science Data Science or related field..
2 years of deep-learning/NLP experience including 1 year practical LLM work (SFT DPO RAG quantization inference optimization etc.).
Practical experience building and deploying multiagent LLM workflows with understanding of agentorchestrator patterns shared memory longhorizon planning and guardrail design.
Clean coding practices good English communication skills and a passion for rapid learning.
Excellent self-driven and ownership with good deliverables.
Eager to learn be curious about AI new technologies
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.