What awaits you/ Job Profile
| We are seeking a Senior AI/Lead ML Engineer LLM & Duplicate Detection Lead to join our BMW Techworks teams of highly skilled specialists. You will architect and implement LLMbased pipelines for semantic understanding duplicate detection and structured information extraction driving innovation and production readiness across critical AI services. Our systems are primarily deployed on AWS (Bedrock ecosystem) and BMW-standard cloud platforms. This is a hands-on technical leadership role where you will influence architecture guide engineering direction and ensure operational excellence. If you are enthusiastic about cuttingedge advancements in Large Language Models semantic search and hybrid ML architectures this role offers an opportunity to make a significant impact. |
What should you bring along
| - 3 years of strong backend/ML engineering experience with at least 2 years focused on LLM/GenAI systems.
- Experience building LLM-driven semantic pipelines and retrieval workflows.
- Strong background in designing and implementing scalable backend systems.
- Experience with evaluation frameworks guardrails and reliability scoring for AI workflows.
- Experience with cloud deployments (preferably AWS) and strong understanding of production ML requirements.
- Ability to work cross-functionally drive end-to-end ownership and mentor junior engineers.
|
Must have technical skill | Backend & Core Engineering - Advanced proficiency in Python
- Experience building production APIs using FastAPI
- Async programming Pydantic type-hints and performance tuning
- Strong grounding in system design
LLM GenAI and Search Systems - AWS Bedrock (Claude 3.5 Sonnet Titan Embeddings Reranker)
- Hybrid semantic search using OpenSearch (BM25 vector search)
- Embeddings similarity search reranking
- Prompt engineering and template design
- Building evaluation frameworks
ML Pipelines & Cloud Services - DynamoDB Neptune S3 Athena
- Step Functions Lambda
- CI/CD observability monitoring for ML systems
- Integration with enterprise data and security standards
Architecture & ML System Design - Designing hybrid reasoning pipelines (LLMs deterministic logic)
- Cost/performance optimization for LLM workloads
|
Good to have technical skills | - Experience with MLOps / LLMOps
- Experience with multi-agent systems
- Experience with RAG pipelines & vector DBs
- Familiarity with vLLM TGI or fine-tuning frameworks
|
Required Experience:
Manager
What awaits you/ Job ProfileWe are seeking a Senior AI/Lead ML Engineer LLM & Duplicate Detection Lead to join our BMW Techworks teams of highly skilled specialists. You will architect and implement LLMbased pipelines for semantic understanding duplicate detection and structured information extract...
What awaits you/ Job Profile
| We are seeking a Senior AI/Lead ML Engineer LLM & Duplicate Detection Lead to join our BMW Techworks teams of highly skilled specialists. You will architect and implement LLMbased pipelines for semantic understanding duplicate detection and structured information extraction driving innovation and production readiness across critical AI services. Our systems are primarily deployed on AWS (Bedrock ecosystem) and BMW-standard cloud platforms. This is a hands-on technical leadership role where you will influence architecture guide engineering direction and ensure operational excellence. If you are enthusiastic about cuttingedge advancements in Large Language Models semantic search and hybrid ML architectures this role offers an opportunity to make a significant impact. |
What should you bring along
| - 3 years of strong backend/ML engineering experience with at least 2 years focused on LLM/GenAI systems.
- Experience building LLM-driven semantic pipelines and retrieval workflows.
- Strong background in designing and implementing scalable backend systems.
- Experience with evaluation frameworks guardrails and reliability scoring for AI workflows.
- Experience with cloud deployments (preferably AWS) and strong understanding of production ML requirements.
- Ability to work cross-functionally drive end-to-end ownership and mentor junior engineers.
|
Must have technical skill | Backend & Core Engineering - Advanced proficiency in Python
- Experience building production APIs using FastAPI
- Async programming Pydantic type-hints and performance tuning
- Strong grounding in system design
LLM GenAI and Search Systems - AWS Bedrock (Claude 3.5 Sonnet Titan Embeddings Reranker)
- Hybrid semantic search using OpenSearch (BM25 vector search)
- Embeddings similarity search reranking
- Prompt engineering and template design
- Building evaluation frameworks
ML Pipelines & Cloud Services - DynamoDB Neptune S3 Athena
- Step Functions Lambda
- CI/CD observability monitoring for ML systems
- Integration with enterprise data and security standards
Architecture & ML System Design - Designing hybrid reasoning pipelines (LLMs deterministic logic)
- Cost/performance optimization for LLM workloads
|
Good to have technical skills | - Experience with MLOps / LLMOps
- Experience with multi-agent systems
- Experience with RAG pipelines & vector DBs
- Familiarity with vLLM TGI or fine-tuning frameworks
|
Required Experience:
Manager
View more
View less