ROLE OVERVIEW
We are seeking an AI Engineer to design and build applications leveraging Large Language Models (LLMs) and advanced Retrieval-Augmented Generation (RAG) techniques. The role involves prototyping finetuning and deploying AI powered solutions while ensuring performance scalability and alignment with business objectives.
KEY RESPONSIBILITIES
- Develop integrate and optimize AI applications using LLM APIs (e.g. OpenAI Anthropic).
- Implement RAG pipelines with vector databases and retrieval mechanisms.
- Design test and refine prompts for effective model performance.
- Integrate AI models with backend services and APIs.
- Collaborate with developers and product teams to bring AI solutions into production.
- Apply basic MLOps practices to ensure scalable deployment and monitoring.
QUALIFICATIONS
- Bachelors or Masters degree in Computer Science AI Data Science or related field.
- 4 years of software development experience including at least 2 years in AI/ML.
- Proficiency in Python and familiarity with frameworks (LangChain LangGraph).
- Experience with vector databases (Pinecone Weaviate FAISS).
- Knowledge of RAG architectures embeddings and LLM optimization.
- Strong problem-solving experimentation and debugging skills.
Preferred Skills
- Experience with cloud AI platforms (Azure AWS GCP).
- Exposure to MLOps tools (MLflow Kubeflow Docker).
- Understanding of ethical AI bias mitigation and security practices.
Vertical
Technology