Senior AI Lead ML Engineer — LLM & Duplicate Detection Lead

BMW TechWorks India

Job Location:

Bengaluru - India

Monthly Salary: Not Disclosed

Posted on: 30+ days ago

Vacancies: 1 Vacancy

Job Summary

What awaits you/ Job Profile	We are seeking a Senior AI/Lead ML Engineer LLM & Duplicate Detection Lead to join our BMW Techworks teams of highly skilled specialists. You will architect and implement LLMbased pipelines for semantic understanding duplicate detection and structured information extraction driving innovation and production readiness across critical AI services. Our systems are primarily deployed on AWS (Bedrock ecosystem) and BMW-standard cloud platforms. This is a hands-on technical leadership role where you will influence architecture guide engineering direction and ensure operational excellence. If you are enthusiastic about cuttingedge advancements in Large Language Models semantic search and hybrid ML architectures this role offers an opportunity to make a significant impact.
What should you bring along	3 years of strong backend/ML engineering experience with at least 2 years focused on LLM/GenAI systems. Experience building LLM-driven semantic pipelines and retrieval workflows. Strong background in designing and implementing scalable backend systems. Experience with evaluation frameworks guardrails and reliability scoring for AI workflows. Experience with cloud deployments (preferably AWS) and strong understanding of production ML requirements. Ability to work cross-functionally drive end-to-end ownership and mentor junior engineers.
Must have technical skill	Backend & Core Engineering Advanced proficiency in Python Experience building production APIs using FastAPI Async programming Pydantic type-hints and performance tuning Strong grounding in system design LLM GenAI and Search Systems AWS Bedrock (Claude 3.5 Sonnet Titan Embeddings Reranker) Hybrid semantic search using OpenSearch (BM25 vector search) Embeddings similarity search reranking Prompt engineering and template design Building evaluation frameworks ML Pipelines & Cloud Services DynamoDB Neptune S3 Athena Step Functions Lambda CI/CD observability monitoring for ML systems Integration with enterprise data and security standards Architecture & ML System Design Designing hybrid reasoning pipelines (LLMs deterministic logic) Cost/performance optimization for LLM workloads
Good to have technical skills	Experience with MLOps / LLMOps Experience with multi-agent systems Experience with RAG pipelines & vector DBs Familiarity with vLLM TGI or fine-tuning frameworks

Required Experience:

Manager

What awaits you/ Job ProfileWe are seeking a Senior AI/Lead ML Engineer LLM & Duplicate Detection Lead to join our BMW Techworks teams of highly skilled specialists. You will architect and implement LLMbased pipelines for semantic understanding duplicate detection and structured information extract...

What awaits you/ Job Profile	We are seeking a Senior AI/Lead ML Engineer LLM & Duplicate Detection Lead to join our BMW Techworks teams of highly skilled specialists. You will architect and implement LLMbased pipelines for semantic understanding duplicate detection and structured information extraction driving innovation and production readiness across critical AI services. Our systems are primarily deployed on AWS (Bedrock ecosystem) and BMW-standard cloud platforms. This is a hands-on technical leadership role where you will influence architecture guide engineering direction and ensure operational excellence. If you are enthusiastic about cuttingedge advancements in Large Language Models semantic search and hybrid ML architectures this role offers an opportunity to make a significant impact.
What should you bring along	3 years of strong backend/ML engineering experience with at least 2 years focused on LLM/GenAI systems. Experience building LLM-driven semantic pipelines and retrieval workflows. Strong background in designing and implementing scalable backend systems. Experience with evaluation frameworks guardrails and reliability scoring for AI workflows. Experience with cloud deployments (preferably AWS) and strong understanding of production ML requirements. Ability to work cross-functionally drive end-to-end ownership and mentor junior engineers.
Must have technical skill	Backend & Core Engineering Advanced proficiency in Python Experience building production APIs using FastAPI Async programming Pydantic type-hints and performance tuning Strong grounding in system design LLM GenAI and Search Systems AWS Bedrock (Claude 3.5 Sonnet Titan Embeddings Reranker) Hybrid semantic search using OpenSearch (BM25 vector search) Embeddings similarity search reranking Prompt engineering and template design Building evaluation frameworks ML Pipelines & Cloud Services DynamoDB Neptune S3 Athena Step Functions Lambda CI/CD observability monitoring for ML systems Integration with enterprise data and security standards Architecture & ML System Design Designing hybrid reasoning pipelines (LLMs deterministic logic) Cost/performance optimization for LLM workloads
Good to have technical skills	Experience with MLOps / LLMOps Experience with multi-agent systems Experience with RAG pipelines & vector DBs Familiarity with vLLM TGI or fine-tuning frameworks