Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailLocation: Remote
Employment Type: Full-time
Experience: 3 years
Job Summary
We are looking for an experienced AI/ML Engineer to research develop and optimize AI language models for text generation and semantic search. The ideal candidate will work on embedding models inference optimization and distributed AI training while ensuring performance security and scalability in an on-premise or cloud environment.
Key Responsibilities
Research develop and fine-tune AI language models for text generation.
Implement and optimize embedding models for semantic search and retrieval.
Work on distributed model training and inference optimization for efficient performance.
Collaborate with backend engineers to integrate AI models into the application.
Ensure AI model security performance and scalability in on-premise and cloud deployments.
Requirements
Hands-on experience with Generative AI and LLM (Large Language Models).
Experience in setting up on-premises systems for LLM solutions.
Exposure to voice-enabled Chatbot is mandatory.
Strong knowledge of Natural Language Processing (NLP) deep learning and machine learning.
Experience with AI frameworks like TensorFlow PyTorch or similar.
Hands-on experience with vector databases (Elasticsearch Weaviate Milvus FAISS).
Proficiency in deploying AI models using Docker Kubernetes and CI/CD pipelines.
Familiarity with RAG-based AI solutions LangChain LlamaIndex is a plus.
Bachelors/Masters degree in Computer Science AI Machine Learning or a related field
TENSORFLOW , MACHINE LEARNING , ELASTICSEARCH , PYTORCH , DEEP LEARNING , DOCKER , KUBERNETES
Full Time