3 years of experience as a technical lead guiding teams through complex design decisions and setting high benchmarks for code quality performance and scalability
In-depth understanding of large language models (LLMs) and their application in AI-driven solutions including inferencing embedding and knowledge base integration (RAG) for improved data retrieval and contextualization
Hands-on experience designing and building GenAI platforms that allow users to create configure and deploy AI applications supporting features like agent orchestration prompt engineering RAG integration and model selection
Experience building AI agents capable of complex multi-step reasoning and tool usage with a focus on reliability traceability and composability
Proven experience in fine-tuning and customizing foundation models to improve task-specific performance and domain alignment
Deep knowledge of LLM inference optimization techniques including prompt tuning caching quantization and latency reduction across different model families
Strong programming skills in Python Java or similar languages with an emphasis on AI/ML systems development and platform engineering
Demonstrated ability to work cross-functionally and influence product development through a combination of technical leadership and user-centered thinking
Passion for operational excellence automation and delivering scalable developer-friendly AI infrastructure
B.S M.S. or PhD Degree in Computer Science/Engineering or equivalent work experience
Expertise in AWS Cloud
Hands on experience in using Kubernetes as orchestration layer
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.