This role requires experience in vision-language models and ability to fine-tune/adapt/distill multi-modal LLMs. As a Machine Learning Research Engineer you will help design and develop models and algorithms for multimodal perception and reasoning leveraging Vision-Language Models (VLMs) and Multimodal Large Language Models (MLLMs). You will collaborate with expert researchers and engineers to explore new techniques evaluate performance and translate product needs into impactful ML solutions. Your work will contribute directly to user-facing features across billions of devices.
- Masters or Ph.D. in Computer Science Artificial Intelligence Machine Learning or related field; or relevant proven experience
- Proficiency in Python and deep learning frameworks such as PyTorch
- Practical experience with training and evaluating neural networks
- Familiarity with multimodal learning vision-language models or large language models
- Strong problem solving skills and ability to work in a collaborative product-focused environment
- Ability to communicate technical results clearly and concisely
- Proven track record of research contributions demonstrated through publications in top-tier conferences and journals
- Background in multi-modal reasoning VLM and MLLM research with impactful software projects
- Solid understanding of natural language processing (NLP) and computer vision fundamentals
This role requires experience in vision-language models and ability to fine-tune/adapt/distill multi-modal LLMs. As a Machine Learning Research Engineer you will help design and develop models and algorithms for multimodal perception and reasoning leveraging Vision-Language Models (VLMs) and Multimo...
This role requires experience in vision-language models and ability to fine-tune/adapt/distill multi-modal LLMs. As a Machine Learning Research Engineer you will help design and develop models and algorithms for multimodal perception and reasoning leveraging Vision-Language Models (VLMs) and Multimodal Large Language Models (MLLMs). You will collaborate with expert researchers and engineers to explore new techniques evaluate performance and translate product needs into impactful ML solutions. Your work will contribute directly to user-facing features across billions of devices.
- Masters or Ph.D. in Computer Science Artificial Intelligence Machine Learning or related field; or relevant proven experience
- Proficiency in Python and deep learning frameworks such as PyTorch
- Practical experience with training and evaluating neural networks
- Familiarity with multimodal learning vision-language models or large language models
- Strong problem solving skills and ability to work in a collaborative product-focused environment
- Ability to communicate technical results clearly and concisely
- Proven track record of research contributions demonstrated through publications in top-tier conferences and journals
- Background in multi-modal reasoning VLM and MLLM research with impactful software projects
- Solid understanding of natural language processing (NLP) and computer vision fundamentals
View more
View less