drjobs ML Engineer Post-training

ML Engineer Post-training

Employer Active

1 Vacancy
The job posting is outdated and position may be filled
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

San Francisco, CA - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

About Sesame

Sesame believes in a future where computers are lifelike with the ability to see hear and collaborate with us in ways that feel natural and human. With this vision were designing a new kind of computer focused on making voice companions part of our daily lives. Our team brings together founders from Oculus and Ubiquity6 alongside proven leaders from Meta Google and Apple with deep expertise spanning hardware and software. Join us in shaping a future where computers truly come alive.

About the Role
We are seeking a Machine Learning Engineer specializing in posttraining optimization to refine and enhance large language models (LLMs) and AI companions. This role focuses on alignment tuning preference optimization reinforcement learning (RLHF) and continual learning to create AI systems that are more conversational emotionally aware and adaptive to user interactions. You will work on making AI more intelligent personalized and capable of remembering and reasoning over longterm interactions ensuring alignment with human values and realworld applications.

Responsibilities:

  • Implement finetuning strategies such as alignment tuning preference modeling and reinforcement learning to enhance AI fluency coherence and alignment with user preferences.

  • Improve AIs ability to engage in empathetic contextaware conversations remember interactions and dynamically personalize responses.

  • Develop methods for LLMbased agents to reason plan and take actions making interactions more natural and goaldriven.

  • Track model drift bias emergence and performance degradation ensuring realworld robustness through continual adaptation.

  • Design feedback loops and adaptive learning strategies that allow models to refine their behavior based on implicit and explicit user feedback.

  • Optimize LLMs on domainspecific datasets enhancing contextual understanding and reducing hallucinations.

  • Work with AI researchers engineers and product teams to bring cuttingedge posttraining techniques into production.

  • Develop scalable methodologies for AI evaluation integrating human feedback and realworld testing to ensure continuous improvement.

Required Qualifications:

  • Strong expertise in machine learning reinforcement learning (RLHF PPO etc. and deep learning techniques.

  • Proficiency in Python PyTorch TensorFlow and modern ML frameworks.

  • Experience with LLM finetuning preference learning and alignment techniques.

  • Deep understanding of natural language processing (NLP) conversational AI and agentic AI systems.

  • Ability to analyze and mitigate biases ethical concerns and safety risks in AI systems.

  • Experience with deploying models in cloud environments (AWS Azure GCP) and optimizing ML pipelines for scalability.

  • Strong communication skills with the ability to work across research engineering and product teams.

Benefits:

  • 401k matching

  • 100 employerpaid health vision and dental benefits

  • Unlimited PTO and sick time

  • Flexible spending account matching (medical FSA)

Sesame is committed to a workplace where everyone feels valued respected and empowered. We welcome all qualified applicants embracing diversity in race gender identity orientation ability and more. We provide reasonable accommodations for applicants with disabilitiescontact for assistance.

Employment Type

Full-Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.