Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailOur client, a visionary leader in the technology sector, is dedicated to pushing the boundaries of Large Language Models (LLM). The client's dynamic team is at the forefront of LLM technology, focusing on various facets including:
Foundational model training
Web-scale data collection
Efficient inference strategies
Model alignment techniques
Model evaluation methodologies
The client's mission is to craft cutting-edge language generation technology, both for internal innovation and customer-centric applications. By doing so, they're driving the evolution of the next wave of AI-driven products.
The Opportunity
Our client is on the lookout for skilled Machine Learning (ML) Engineers who share their enthusiasm for expansive language models. Your expertise will play a pivotal role in shaping the state-of-the-art ALM stack.
In this position, responsibilities will span a range of tasks, including:
Architecting and developing the distributed training and data processing infrastructure
Innovating with advanced deep learning techniques to elevate the model quality
Exploring methodologies to enrich training data quality and quantity
Optimizing training and inference infrastructure to maximize hardware performance
What They're Looking For
To excel in this role, candidates should possess:
A robust background in deep learning, whether in industry or academia
A solid grasp of machine learning theory
Familiarity with a modern deep learning framework (such as TensorFlow, PyTorch, or JAX)
Proficiency in contemporary software engineering practices like CI/CD, version control, and unit testing
A genuine passion for large language models and generative AI
An unwavering commitment to precision and thoroughness in their work
Preferred Qualifications
Additional advantages include:
Experience with language models or similar NLP technologies
A track record of building and delivering products (not exclusively ML-related) in a dynamic startup-like environment
Strong engineering skills, including the development of large distributed systems or high-load web services
Notable open-source projects that showcase their engineering prowess
Full Time