Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailAbout the Role:
We are seeking a highly skilled Senior AI Engineer with deep expertise in Agentic frameworks Large Language Models (LLMs) RetrievalAugmented Generation (RAG) systems MLOps/LLMOps and endtoend GenAI application development. In this role you will design develop finetune deploy and optimize stateoftheart AI solutions across diverse enterprise use cases including AI Copilots Summarization Enterprise Search and Intelligent Tool Orchestration.
Key Responsibilities:
Develop and FineTune LLMs (e.g. GPT4 Claude LLaMA Mistral Gemini) using instruction tuning prompt engineering chainofthought prompting and finetuning techniques.
Build RAG Pipelines: Implement RetrievalAugmented Generation solutions leveraging embeddings chunking strategies and vector databases like FAISS Pinecone Weaviate and Qdrant.
Implement and Orchestrate Agents: Utilize frameworks like MCP OpenAI Agent SDK LangChain LlamaIndex Haystack and DSPy to build dynamic multiagent systems and serverless GenAI applications.
Deploy Models at Scale: Manage model deployment using HuggingFace Azure Web Apps vLLM and Ollama including handling local models with GGUF LoRA/QLoRA PEFT and Quantization methods.
Integrate APIs: Seamlessly integrate with APIs from OpenAI Anthropic Cohere Azure and other GenAI providers.
Ensure Security and Compliance: Implement guardrails perform PII redaction ensure secure deployments and monitor model performance using advanced observability tools.
Optimize and Monitor: Lead LLMOps practices focusing on performance monitoring cost optimization and model evaluation.
Work with AWS Services: Handson usage of AWS Bedrock SageMaker S3 Lambda API Gateway IAM CloudWatch and serverless computing to deploy and manage scalable AI solutions.
Contribute to Use Cases: Develop AIdriven solutions like AI copilots enterprise search engines summarizers and intelligent functioncalling systems.
Crossfunctional Collaboration: Work closely with product data and DevOps teams to deliver scalable and secure AI products.
Qualifications :
Required Skills and Experience:
46 years of experience in AI/ML roles focusing on LLM agent development data science workflows and system deployment.
Demonstrated experience in designing domainspecific AI systems and integrating structured/unstructured data into AI models.
Proficiency in designing scalable solutions using LangChain and vector databases.
Deep knowledge of LLMs and foundational models (GPT4 Claude Mistral LLaMA Gemini).
Strong expertise in Prompt Engineering ChainofThought reasoning and FineTuning methods.
Proven experience building RAG pipelines and working with modern vector stores (FAISS Pinecone Weaviate Qdrant).
Handson proficiency in LangChain LlamaIndex Haystack and DSPy frameworks.
Model deployment skills using HuggingFace vLLM Ollama and handling LoRA/QLoRA PEFT GGUF models.
Practical experience with AWS serverless services: Lambda S3 API Gateway IAM CloudWatch.
Strong coding ability in Python or similar programming languages.
Experience with MLOps/LLMOps for monitoring evaluation and cost management.
Familiarity with security standards: guardrails PII protection secure API interactions.
Use Case Delivery Experience: Proven record of delivering AI Copilots Summarization engines or Enterprise GenAI applications.
Additional Information :
Preferred Skills:
Experience in BPO or IT Outsourcing environments.
Knowledge of workforce management tools and CRM integrations.
Handson experience with AI technologies and their applications in data analytics.
Familiarity with Agile/Scrum methodologies.
Soft Skills:
Strong analytical and problemsolving capabilities.
Excellent communication and stakeholder management skills.
Ability to thrive in a fastpaced dynamic environment.
Remote Work :
No
Employment Type :
Fulltime
Full-time