Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailJob Description:
Design ML pipelines for experiment management model management feature management and model retraining.
Design APIs for model inferencing at scale.
Proven expertise with MLflow SageMaker Vertex AI and Azure AI.
LLM Serving and GPU Architecture:
Possess deep knowledge of GPU architectures.
Expertise in distributed training and serving of large language models.
Proficient in model and data parallel training using frameworks like DeepSpeed and service frameworks like vLLM.
Model Fine-Tuning and Optimization:
Demonstrate proven expertise in model fine-tuning and optimization techniques.
Achieve better latencies and accuracies in model results.
Reduce training and resource requirements for fine-tuning LLM and LVM models.
DevOps and LLMOps Proficiency:
Proven expertise in DevOps and LLMOps practices.
Knowledgeable in Kubernetes Docker and container orchestration.
Deep understanding of LLM orchestration frameworks like Flowise Langflow and Langgraph.
Full-time