Vox AI is transforming the quickservice restaurant industry by pioneering voicedriven AI solutions tailored specifically for drivethru automation and employee assistance. With a rapidly growing presence across multiple continents we take pride in our pragmatic innovative approach leveraging AI to deliver seamless customer experiences and operational excellence. As we scale were seeking exceptional talent to join our Amsterdambased team and help drive the next generation of voice technology.
Were looking for a passionate and experienced AI Engineer to specialize in Large Language Model (LLM) training and alignment. In this key role youll develop sophisticated training pipelines using reinforcement learning supervised finetuning and cuttingedge alignment techniques like DPO and ORPO. Youll create voice interaction systems that deliver natural contextuallyaware customer conversations and build robust API integrations enabling seamless interactions between our AI and restaurant systems. Your work will directly impact model performance safety and user satisfaction positioning Vox AI at the forefront of conversational AI for the hospitality sector.
Tasks
- Develop and optimize training pipelines incorporating reinforcement learning and supervised finetuning for LLM alignment
- Create and maintain voice interaction capabilities for conversational AI agents with natural language understanding
- Implement API integration frameworks allowing LLMs to interact with external systems and tools
- Build evaluation frameworks to measure model performance alignment and safety across different behaviors
- Develop inference optimization systems for lowlatency model serving in production environments
- Create behaviorspecific LoRA adapters for distinct use cases while maintaining a unified base model
- Implement monitoring systems for alignment drift detection in deployed agents
Requirements
- Masters degree in Computer Science Machine Learning Artificial Intelligence or related field
- Demonstrated experience building and optimizing LLM training pipelines for largescale models
- Proven expertise in alignment techniques including SFT RLHF DPO and ORPO
- Strong experience with PEFT methods particularly LoRA and QLoRA implementations
- Proficiency in developing and deploying multiadapter architectures for different agent behaviors
- Experience with distributed training frameworks (DeepSpeed FSDP MegatronLM)
- Knowledge of quantization techniques (FP8/INT8 for efficient model deployment
- Expertise in Python and deep learning frameworks such as PyTorch
- Experience with production ML systems and MLOps practices
- Knowledge of prompt engineering and instruction tuning methodologies
Preferred Qualifications:
- PhD in Computer Science Machine Learning or related field
- Experience developing multimodal models and systems combining text and audio modalities
- Knowledge of audio processing and voicebased AI systems
- Contributions to opensource LLM projects or research publications in NLP/ML
- Experience building commercial AI products with significant user adoption
Benefits
- Venturefunded & growing fast this is your chance to join early and make an impact.
- Build cuttingedge conversational AI systems with realworld impact
- Work with modern opensource technology stack
- Hybrid work Minimum 3 days/week in our Amsterdam office for highimpact collaboration.
- Equity included were building something big and we want you to grow with us.
If you thrive in dynamic environments enjoy tackling complex challenges and want to shape the future of voice AI technology with a global impactthis role at Vox AI is your opportunity. Apply now.