About HUD
HUD (YC W25) does RL and evaluations for frontier AI agents. Our HUD agentic RL platform is used by frontier labs Fortune 500 companies and startups. We have grown revenue and raised funding from YC A16Z and other leading VCs to scale fast.
About the role
If you would like to be at the frontier of AI agent training apply here.
You will work with leading foundational labs to train real production agents. You will get a lot tokens to do this.
Responsibilities
Design and build core platform systems. Post training workflows dataset pipelines run orchestration and execution infrastructure.
Own the Python SDK and the overall developer experience. Clean APIs sensible defaults clear errors and strong documentation.
Build evaluation pipelines that connect naturally to training loops; measure create data train and re evaluate.
Work with Docker Linux and cloud infrastructure to ensure reliable and reproducible environments across local development CI and production.
Talk directly with customers understand their workflows and turn messy real world feedback into product improvements.
Skills and Experience
Strong production experience in Python. Comfortable working across the stack including APIs data systems and frontend work when needed.
Real understanding of Docker and Linux environments.
Strong product instincts and a bias toward shipping. You build products not isolated features.
Ability to design APIs and interfaces that age well. You care about ergonomics correctness and developer experience.
Cloud competence. Familiarity with Kubernetes and AWS fundamentals such as compute networking and storage.
Comfort working with AI coding tools and agentic workflows. You move quickly without sacrificing rigor.
Strong candidates may also have:
Experience with reinforcement learning post training or model training workflows.
Experience building or using LLM or agent evaluation frameworks such as Inspect EleutherAI tooling or custom harnesses.
Experience designing SDKs CLIs or developer platforms.
Kubernetes experience including deployment scaling or job orchestration.
Active participation in the ML or open source community.
Startup experience at early stage companies with the ability to work independently.
We prioritize technical ability and learning speed over years of experience. If you have built impressive things open source contributions side projects research code or production systems we want to see them.
Team and Company Details
Team Size: Approximately 15 people currently mostly full time in person with some remote.
Our team: Includes four international Olympiad medallists across IOI ILO and IPhO serial AI startup founders and researchers with publications at ICLR NeurIPS and similar venues.
Company stage: We have raised tens of millions in venture funding and have strong revenue growth. We are scaling quickly and profitably to meet demand.
Logistics
Employment: Full time.
Location: On site only for now. You can join the team in the San Francisco Bay Area or Singapore offices.
Visa Sponsorship: We support relocation and visas for strong candidates to the United States or Singapore.
Timeline: Applications are rolling. The process includes two technical interviews and a one week work trial.
Unlimited access to tokens
Due to high volume we may not actively respond to every application but feel free to contact us at if we missed your application.
Required Experience:
IC
About HUDHUD (YC W25) does RL and evaluations for frontier AI agents. Our HUD agentic RL platform is used by frontier labs Fortune 500 companies and startups. We have grown revenue and raised funding from YC A16Z and other leading VCs to scale fast.About the roleIf you would like to be at the fronti...
About HUD
HUD (YC W25) does RL and evaluations for frontier AI agents. Our HUD agentic RL platform is used by frontier labs Fortune 500 companies and startups. We have grown revenue and raised funding from YC A16Z and other leading VCs to scale fast.
About the role
If you would like to be at the frontier of AI agent training apply here.
You will work with leading foundational labs to train real production agents. You will get a lot tokens to do this.
Responsibilities
Design and build core platform systems. Post training workflows dataset pipelines run orchestration and execution infrastructure.
Own the Python SDK and the overall developer experience. Clean APIs sensible defaults clear errors and strong documentation.
Build evaluation pipelines that connect naturally to training loops; measure create data train and re evaluate.
Work with Docker Linux and cloud infrastructure to ensure reliable and reproducible environments across local development CI and production.
Talk directly with customers understand their workflows and turn messy real world feedback into product improvements.
Skills and Experience
Strong production experience in Python. Comfortable working across the stack including APIs data systems and frontend work when needed.
Real understanding of Docker and Linux environments.
Strong product instincts and a bias toward shipping. You build products not isolated features.
Ability to design APIs and interfaces that age well. You care about ergonomics correctness and developer experience.
Cloud competence. Familiarity with Kubernetes and AWS fundamentals such as compute networking and storage.
Comfort working with AI coding tools and agentic workflows. You move quickly without sacrificing rigor.
Strong candidates may also have:
Experience with reinforcement learning post training or model training workflows.
Experience building or using LLM or agent evaluation frameworks such as Inspect EleutherAI tooling or custom harnesses.
Experience designing SDKs CLIs or developer platforms.
Kubernetes experience including deployment scaling or job orchestration.
Active participation in the ML or open source community.
Startup experience at early stage companies with the ability to work independently.
We prioritize technical ability and learning speed over years of experience. If you have built impressive things open source contributions side projects research code or production systems we want to see them.
Team and Company Details
Team Size: Approximately 15 people currently mostly full time in person with some remote.
Our team: Includes four international Olympiad medallists across IOI ILO and IPhO serial AI startup founders and researchers with publications at ICLR NeurIPS and similar venues.
Company stage: We have raised tens of millions in venture funding and have strong revenue growth. We are scaling quickly and profitably to meet demand.
Logistics
Employment: Full time.
Location: On site only for now. You can join the team in the San Francisco Bay Area or Singapore offices.
Visa Sponsorship: We support relocation and visas for strong candidates to the United States or Singapore.
Timeline: Applications are rolling. The process includes two technical interviews and a one week work trial.
Unlimited access to tokens
Due to high volume we may not actively respond to every application but feel free to contact us at if we missed your application.
Required Experience:
IC
View more
View less