Get to Know the Team
The AI Platform team empowers Grab teams to leverage advanced AI seamlessly and effectively. Were building cutting-edge tools and infrastructure to democratize AI capabilities accelerate innovation and enhance Grabs products and services at scale.
Get to Know the Role
As a Principal Machine Learning Engineer focused on Foundation Model Post-Training youll report into the Head of Engineering Machine Learning and Experimentation Platforms and work onsite in Grab One North Singapore office.
Youll be the technical anchor for aligning our large-scale foundation models with human intent and domain requirements. Youll architect pipelines using Supervised Fine-Tuning (SFT) and RLHF to transform raw base models into safe high-performance products for Grab. Youll also bridge deep learning research systems engineering and data strategy requiring a leader to drive technical direction and execute large-scale experiments.
The Critical Tasks You Will Perform
- Strategic Technical Leadership: Define and drive the roadmap for post-training strategies including SFT RLHF (PPO/DPO/GPRO) and instruction tuning to improve model alignment safety and reasoning capabilities.
- Pipeline Architecture: Design and implement robust scalable and distributed training pipelines using frameworks like PyTorch DeepSpeed Ray or Megatron-LM to handle models with billions of parameters.
- Data Strategy & Curation: Oversee the data engine for post-training; collaborate with data teams to design high-quality instruction sets manage human annotation workflows and implement automated data filtering/deduplication techniques.
- Evaluation & Benchmarking: Develop comprehensive evaluation suites (both automated benchmarks and human-in-the-loop protocols) to rigorously measure model performance hallucination rates and alignment drift.
- Optimization & Efficiency: Optimize training jobs for GPU utilization and cost-efficiency including quantization distillation LoRA/Q-LoRA implementation and memory optimization techniques.
- Cross-Functional Collaboration: Partner with multi-functional teams to translate user requirements into specific reward functions and fine-tuning objectives.
- Bridge Research and Engineering: Translate the latest AI research into robust scalable production-grade systems that drive tangible business outcomes.
- Mentorship: Provide technical mentorship foster innovation and inspire excellence across engineering research and product teams.
Qualifications :
The Must-Haves
- Proven Experience: At least 8 years of professional experience in Machine Learning with at least 3 years directly focused on NLP LLMs or Generative AI and at least 2 years in technical leadership mentorship or people management.
- Post-Training Expertise: Experience training Large Language Models (LLMs) specifically in post-training stages. Experience with RLHF (Reinforcement Learning from Human Feedback) DPO (Direct Preference Optimization) GRPO (Group Relative Policy Optimization) and SFT (Supervised Fine-Tuning).
- Distributed Systems Mastery: Hands-on experience with distributed training of massive models across multi-node GPU clusters (e.g. A100/H100 pods) using Kubernetes or Ray.
- Framework Proficiency: Expert-level fluency in Python and deep learning frameworks (PyTorch JAX). Familiarity with the Hugging Face ecosystem and training libraries like DeepSpeed Megatron-LM or FSDP.
- Data Intuition: Experience in dataset engineering including cleaning balancing and synthesizing high-quality instruction data. You have experience in large-scale data processing frameworks like Spark Ray or Dask.
- Mathematical Depth: Solid grasp of the underlying mathematics of Transformers optimization algorithms (AdamW Lion) and probability theory as it applies to language modelling.
Additional Information :
Life at Grab
We care about your well-being at Grab here are some of the global benefits we offer:
- We have your back with Term Life Insurance and comprehensive Medical Insurance.
- With GrabFlex create a benefits package that suits your needs and aspirations.
- Celebrate moments that matter in life with loved ones through Parental and Birthday leave and give back to your communities through Love-all-Serve-all (LASA) volunteering leave
- We have a confidential Grabber Assistance Programme to guide and uplift you and your loved ones through lifes challenges.
- Balancing personal commitments and lifes demands are made easier with our FlexWork arrangements such as differentiated hours
What We Stand For at Grab
We are committed to building an inclusive and equitable workplace that enables diverse Grabbers to grow and perform at their best. As an equal opportunity employer we consider all candidates fairly and equally regardless of nationality ethnicity religion age gender identity sexual orientation family commitments physical and mental impairments or disabilities and other attributes that make them unique.
Remote Work :
No
Employment Type :
Full-time
Get to Know the TeamThe AI Platform team empowers Grab teams to leverage advanced AI seamlessly and effectively. Were building cutting-edge tools and infrastructure to democratize AI capabilities accelerate innovation and enhance Grabs products and services at scale.Get to Know the RoleAs a Principa...
Get to Know the Team
The AI Platform team empowers Grab teams to leverage advanced AI seamlessly and effectively. Were building cutting-edge tools and infrastructure to democratize AI capabilities accelerate innovation and enhance Grabs products and services at scale.
Get to Know the Role
As a Principal Machine Learning Engineer focused on Foundation Model Post-Training youll report into the Head of Engineering Machine Learning and Experimentation Platforms and work onsite in Grab One North Singapore office.
Youll be the technical anchor for aligning our large-scale foundation models with human intent and domain requirements. Youll architect pipelines using Supervised Fine-Tuning (SFT) and RLHF to transform raw base models into safe high-performance products for Grab. Youll also bridge deep learning research systems engineering and data strategy requiring a leader to drive technical direction and execute large-scale experiments.
The Critical Tasks You Will Perform
- Strategic Technical Leadership: Define and drive the roadmap for post-training strategies including SFT RLHF (PPO/DPO/GPRO) and instruction tuning to improve model alignment safety and reasoning capabilities.
- Pipeline Architecture: Design and implement robust scalable and distributed training pipelines using frameworks like PyTorch DeepSpeed Ray or Megatron-LM to handle models with billions of parameters.
- Data Strategy & Curation: Oversee the data engine for post-training; collaborate with data teams to design high-quality instruction sets manage human annotation workflows and implement automated data filtering/deduplication techniques.
- Evaluation & Benchmarking: Develop comprehensive evaluation suites (both automated benchmarks and human-in-the-loop protocols) to rigorously measure model performance hallucination rates and alignment drift.
- Optimization & Efficiency: Optimize training jobs for GPU utilization and cost-efficiency including quantization distillation LoRA/Q-LoRA implementation and memory optimization techniques.
- Cross-Functional Collaboration: Partner with multi-functional teams to translate user requirements into specific reward functions and fine-tuning objectives.
- Bridge Research and Engineering: Translate the latest AI research into robust scalable production-grade systems that drive tangible business outcomes.
- Mentorship: Provide technical mentorship foster innovation and inspire excellence across engineering research and product teams.
Qualifications :
The Must-Haves
- Proven Experience: At least 8 years of professional experience in Machine Learning with at least 3 years directly focused on NLP LLMs or Generative AI and at least 2 years in technical leadership mentorship or people management.
- Post-Training Expertise: Experience training Large Language Models (LLMs) specifically in post-training stages. Experience with RLHF (Reinforcement Learning from Human Feedback) DPO (Direct Preference Optimization) GRPO (Group Relative Policy Optimization) and SFT (Supervised Fine-Tuning).
- Distributed Systems Mastery: Hands-on experience with distributed training of massive models across multi-node GPU clusters (e.g. A100/H100 pods) using Kubernetes or Ray.
- Framework Proficiency: Expert-level fluency in Python and deep learning frameworks (PyTorch JAX). Familiarity with the Hugging Face ecosystem and training libraries like DeepSpeed Megatron-LM or FSDP.
- Data Intuition: Experience in dataset engineering including cleaning balancing and synthesizing high-quality instruction data. You have experience in large-scale data processing frameworks like Spark Ray or Dask.
- Mathematical Depth: Solid grasp of the underlying mathematics of Transformers optimization algorithms (AdamW Lion) and probability theory as it applies to language modelling.
Additional Information :
Life at Grab
We care about your well-being at Grab here are some of the global benefits we offer:
- We have your back with Term Life Insurance and comprehensive Medical Insurance.
- With GrabFlex create a benefits package that suits your needs and aspirations.
- Celebrate moments that matter in life with loved ones through Parental and Birthday leave and give back to your communities through Love-all-Serve-all (LASA) volunteering leave
- We have a confidential Grabber Assistance Programme to guide and uplift you and your loved ones through lifes challenges.
- Balancing personal commitments and lifes demands are made easier with our FlexWork arrangements such as differentiated hours
What We Stand For at Grab
We are committed to building an inclusive and equitable workplace that enables diverse Grabbers to grow and perform at their best. As an equal opportunity employer we consider all candidates fairly and equally regardless of nationality ethnicity religion age gender identity sexual orientation family commitments physical and mental impairments or disabilities and other attributes that make them unique.
Remote Work :
No
Employment Type :
Full-time
View more
View less