AI Research Lead

1mind

Not Interested
Bookmark
Report This Job

profile Job Location:

San Francisco, CA - USA

profile Monthly Salary: Not Disclosed
Posted on: 5 days ago
Vacancies: 1 Vacancy

Department:

Engineering

Job Summary

About Us

1mind is a platform that deploys multimodal Superhumans for revenue teams. These Superhumans combine a face a voice and a GTM brain equipped with deep technical and product knowledge. They can lead unlimited simultaneous conversations 24/7 meeting buyers when theyre most active and engaged. Superhumans qualify leads book meetings deliver pitches give interactive demos handle objections uncover pain points build value models provide support and onboard customers. They live across websites inside your product can join live calls as active participants and work alongside your team in deal rooms. 1mind Superhumans integrate seamlessly into existing workflows scale instantly and drive measurable impact growing revenue reducing headcount accelerating pipeline to closed-won and creating a more delightful buyer experience.

Job Description

Were looking for an AI Research Lead to define and drive 1minds research agenda. This is one of the most important hires well make and the person who fills it will have an outsized impact on the trajectory of the company.

Youll lead exploratory research into vertical post-training for sales and GTM domains developing models that understand how humans sell buy and build relationships. Youll work directly with the CTO and have the freedom to shape the research direction build your own team and publish your work. This is applied research with active signal: our agents are live in customer environments today generating high-fidelity data from thousands of real buyer interactions.

If youve been doing cutting edge research post-training work at a frontier lab and want to build something that ships into production on a unique dataset that no one else has this is your role.

Key Responsibilities

  • Own and drive the post-training research roadmap for 1minds vertical AI models from exploration through production deployment.

  • Design and execute experiments on sales LLM fine-tuning copilot behavior modeling and domain-specific reinforcement learning.

  • Leverage 1minds live RL environment and high-fidelity reward signals from real-world agent interactions to train and iterate on models.

  • Develop novel post-training techniques RLHF DPO reward modeling and beyond tailored to GTM and conversational commerce use cases.

  • Collaborate cross-functionally with engineering product and GTM teams to translate research into measurable product improvements.

  • Build and lead the research org over time hiring mentoring and setting the technical bar for a world-class applied research team.

  • Evaluate new model architectures training strategies and inference optimizations for 1minds multimodal agent stack.

  • Publish research findings and contribute to open-source and open-weight model initiatives where appropriate.

Qualifications

Required

  • 4 years of experience in machine learning research or applied AI with at least 12 years focused on post-training (RLHF DPO reward modeling alignment or related techniques).

  • Deep technical fluency in LLM training pipelines fine-tuning methodologies and evaluation frameworks.

  • Demonstrated ability to take research from exploration to production-grade systems.

  • Strong product intuition ability to identify where research creates real business value and prioritize accordingly.

  • Based in or willing to relocate to San Francisco.

Preferred

  • 6 years of total experience; Staff Researcher/level or equivalent.

  • Experience at a frontier research lab

  • Experience leading research teams.

  • Familiarity with reinforcement learning from real-world feedback loops (not just simulated environments).

Why Join Us

  • Build post-training models no one else can. 1mind is the only company with the vertical GTM data and live agent interactions needed to train domain-specific models. You wont be fine-tuning on synthetic benchmarks youll be training on real sales conversations with real reward signals.

  • Live RL environment from day one. Our Superhumans are already operating in the wild generating detailed reward data from thousands of real buyer interactions. Youll have a production feedback loop most researchers only dream about.

  • Freedom to build. Define the research agenda choose the problems hire your team and shape the direction of a category-defining company.

  • Publishing and open source encouraged. We support publishing your work and contributing open-weight models. IP is evaluated case by case but the default is openness.

  • Competitive compensation. We offer aggressive market-leading compensation for this role including base salary equity and full benefits.

  • High-impact early-stage opportunity. Work directly with a world-class team at a Series A company backed by top investors with 50 enterprise customers like LinkedIn HubSpot Nutanix Samsara and Boston Dynamics.

Location

San Francisco CA. Visa sponsorship is available for exceptional candidates.

Employment Type

Full-time

1minds total compensation package is designed to be competitive and includes base salary equity and a full range of benefits and perks. Final compensation will depend on factors such as your skills experience qualifications and location and will be determined during the interview process. The hiring manager will share more details about the full compensation package and benefits as you move through the process.

Please note that all legitimate communication from 1mind will come only from email addresses ending in @. We will never ask for payment financial information or personal details outside of our official application process. If you receive a suspicious message please disregard it and alert us at

About Us1mind is a platform that deploys multimodal Superhumans for revenue teams. These Superhumans combine a face a voice and a GTM brain equipped with deep technical and product knowledge. They can lead unlimited simultaneous conversations 24/7 meeting buyers when theyre most active and engaged....
View more view more