Founding Engineer (Full Stack, ML DevTools & Systems)

HUD

Job Location:

San Francisco, CA - USA

Monthly Salary: Not Disclosed

Posted on: 30+ days ago

Vacancies: 1 Vacancy

Department:

Engineering

Job Summary

About HUD

HUD (YC W25) does RL and evaluations for frontier AI agents. Our HUD agentic RL platform is used by frontier labs Fortune 500 companies and startups. We have grown revenue and raised funding from YC A16Z and other leading VCs to scale fast.

About the role

If you would like to be at the frontier of AI agent training apply here.

You will work with leading foundational labs to develop production agent training infrastructure. You will get a lot tokens to do this.

Responsibilities

Design and build core platform systems. Post training workflows dataset pipelines run orchestration and execution infrastructure.
Own the Python SDK and the overall developer experience. Developer best practices like slean APIs sensible defaults clear errors and strong documentation.
Build evaluation pipelines that connect naturally to training loops; measure create data train and re evaluate.
Work with Docker Linux and cloud infrastructure to ensure reliable and reproducible environments across local development CI and production.
Talk directly with customers understand their workflows and turn messy real world feedback into product improvements.

Skills and Experience

Strong experience owning products end-to-end. Comfortable working across the stack including APIs data systems and frontend work when needed.
Real understanding of Docker and Linux environments.
Ability to design APIs and interfaces that age well. You care about ergonomics correctness and developer experience.
Comfort working with AI coding tools and agentic workflows and tradeoffs between speed and technical

Strong candidates may also have:

Experience with reinforcement learning post training or model training workflows.
Experience building or using LLM or agent evaluation frameworks such as Inspect EleutherAI tooling or custom harnesses.
Experience designing SDKs CLIs or developer platforms.
Kubernetes experience including deployment scaling or job orchestration.
Startup experience at early stage companies with the ability to work independently.

We prioritize technical ability and learning speed over years of experience. If you have built impressive things open source contributions side projects research code or production systems we want to see them.

Team and Company Details

Team Size: Approximately 15 people currently mostly full time in person with some remote.
Our team: Includes four international Olympiad medallists across IOI ILO and IPhO serial AI startup founders and researchers with publications at ICLR NeurIPS and similar venues.
Company stage: We have raised tens of millions in venture funding and have strong revenue growth. We are scaling quickly and profitably to meet demand.

Logistics

Employment: Full time.
Location: On site only for now. You can join the team in the San Francisco Bay Area or Singapore offices.
Visa Sponsorship: We support relocation and visas for strong candidates to the United States or Singapore.
Timeline: Applications are rolling. The process includes two technical interviews and a one week work trial.

Unlimited access to tokens

You will have unlimited* access to API credits for providers such as OpenAI Anthropic Gemini Cursor and others. *No one on our token usage leaderboard has ever hit the limit so we do not know what the limit actually is.

Due to high volume we may not actively respond to every application but feel free to contact us at if we missed your application.

Required Experience:

About HUDHUD (YC W25) does RL and evaluations for frontier AI agents. Our HUD agentic RL platform is used by frontier labs Fortune 500 companies and startups. We have grown revenue and raised funding from YC A16Z and other leading VCs to scale fast.About the roleIf you would like to be at the fronti...

About HUD

About the role

If you would like to be at the frontier of AI agent training apply here.

You will work with leading foundational labs to develop production agent training infrastructure. You will get a lot tokens to do this.

Responsibilities

Design and build core platform systems. Post training workflows dataset pipelines run orchestration and execution infrastructure.
Own the Python SDK and the overall developer experience. Developer best practices like slean APIs sensible defaults clear errors and strong documentation.
Build evaluation pipelines that connect naturally to training loops; measure create data train and re evaluate.
Work with Docker Linux and cloud infrastructure to ensure reliable and reproducible environments across local development CI and production.
Talk directly with customers understand their workflows and turn messy real world feedback into product improvements.

Skills and Experience

Strong experience owning products end-to-end. Comfortable working across the stack including APIs data systems and frontend work when needed.
Real understanding of Docker and Linux environments.
Ability to design APIs and interfaces that age well. You care about ergonomics correctness and developer experience.
Comfort working with AI coding tools and agentic workflows and tradeoffs between speed and technical

Strong candidates may also have:

Experience with reinforcement learning post training or model training workflows.
Experience building or using LLM or agent evaluation frameworks such as Inspect EleutherAI tooling or custom harnesses.
Experience designing SDKs CLIs or developer platforms.
Kubernetes experience including deployment scaling or job orchestration.
Startup experience at early stage companies with the ability to work independently.

Team and Company Details

Team Size: Approximately 15 people currently mostly full time in person with some remote.
Our team: Includes four international Olympiad medallists across IOI ILO and IPhO serial AI startup founders and researchers with publications at ICLR NeurIPS and similar venues.
Company stage: We have raised tens of millions in venture funding and have strong revenue growth. We are scaling quickly and profitably to meet demand.

Logistics

Employment: Full time.
Location: On site only for now. You can join the team in the San Francisco Bay Area or Singapore offices.
Visa Sponsorship: We support relocation and visas for strong candidates to the United States or Singapore.
Timeline: Applications are rolling. The process includes two technical interviews and a one week work trial.

Unlimited access to tokens

You will have unlimited* access to API credits for providers such as OpenAI Anthropic Gemini Cursor and others. *No one on our token usage leaderboard has ever hit the limit so we do not know what the limit actually is.

Due to high volume we may not actively respond to every application but feel free to contact us at if we missed your application.

Required Experience:

Key Skills

Apply Now

About Company

HUD

View Profile View Profile

AI AutoApply

Apply to 100+ jobs with one click

AI Resume Builder

Create an ATS-ready CV in minutes

AI Cover Letter

Write a personalized letter instantly

Founding Engineer (Full Stack, ML DevTools & Systems)

San Francisco, CA - USA

Department:

Job Summary

About HUD

About the role

Responsibilities

Skills and Experience

Strong candidates may also have:

Team and Company Details

Logistics

Unlimited access to tokens

About HUD

About the role

Responsibilities

Skills and Experience

Strong candidates may also have:

Team and Company Details

Logistics

Unlimited access to tokens

Key Skills

About Company

Related Jobs