Job Summary:
We are seeking a highly skilled and visionaryGenerative AI Architectto lead the design development and deployment of nextgeneration AI solutions leveraging large language models (LLMs) diffusion models and other generative technologies. This role requires deep technical expertise in machine learning software engineering and AI infrastructure as well as the ability to collaborate crossfunctionally to bring innovative solutions to life.
Key Responsibilities:
- Design endtoend architectures for generative AI solutions including model selection training pipelines data infrastructure APIs and deployment workflows.
- Evaluate finetune and integrate foundational models (e.g. OpenAI Anthropic LLaMA Mistral Claude etc.) for specific business use cases.
- Develop strategies for responsible AI usage including data governance model bias mitigation and ethical AI practices.
- Collaborate with product data and engineering teams to deliver scalable AI applications across various domains (e.g. content generation code generation image/video synthesis customer support automation).
- Lead the development of POCs prototypes and productiongrade solutions in cloud environments (AWS Azure GCP).
- Define best practices for model lifecycle management prompt engineering vector search (e.g. Pinecone FAISS) and retrievalaugmented generation (RAG).
- Stay current with advances in GenAI research and tools (e.g. LangChain Hugging Face Weights & Biases) and translate emerging technologies into business impact.
- Mentor junior AI engineers and contribute to building a highperforming AI team.
Required Qualifications:
- Bachelors or Masters degree in Computer Science Machine Learning or a related field. PhD is a plus.
- 7 years of experience in software engineering machine learning or AI architecture roles.
- Strong experience with LLMs NLP Transformers and generative models (e.g. GPT BERT T5 DALLE Stable Diffusion).
- Proficiency in Python and ML frameworks like TensorFlow PyTorch and Hugging Face Transformers.
- Experience designing ML workflows data pipelines and deploying AI models in production.
- Handson experience with vector databases prompt engineering and RAG frameworks.
- Deep understanding of cloud infrastructure containerization (Docker Kubernetes) and CI/CD for AI systems.
Preferred Qualifications:
- Experience with GenAI orchestration tools like LangChain LlamaIndex or Haystack.
- Exposure to enterprise AI use cases across industries like healthcare finance retail or telecom.
- Experience with model evaluation techniques monitoring and observability for LLMbased systems.
- Strong communication and leadership skills with the ability to influence stakeholders and drive AI strategy.
Required Experience:
Senior IC