About HUDHUD (YC W25) is developing agentic evals for Computer Use Agents (CUAs) that browse the web. Our CUA Evals framework is the first comprehensive evaluation tool for CUAs.Our Mission: People dont actually know if AI agents are working. To make AI agents work in the real world w

About HUDHUD (YC W25) is developing agentic evals for Computer Use Agents (CUAs) that browse the web. Our CUA Evals framework is the first comprehensive evaluation tool for CUAs.Our Mission: People dont actually know if AI agents are working. To make AI agents work in the real world w

Apply Now
Full Time

About HUDHUD (YC W25) does RL and evaluations for frontier AI agents. Our HUD agentic RL platform is used by frontier labs Fortune 500 companies and startups.About the roleHUD trains frontier AI agents and we want your vision.When you design how humans evaluate and train AI agents on

About HUDHUD (YC W25) does RL and evaluations for frontier AI agents. Our HUD agentic RL platform is used by frontier labs Fortune 500 companies and startups.About the roleHUD trains frontier AI agents and we want your vision.When you design how humans evaluate and train AI agents on

Apply Now

About HUDHUD (YC W25) does RL and evaluations for frontier AI agents. Our HUD agentic RL platform is used by frontier labs Fortune 500 companies and startups. We have from YC A16Z and other leading VCs to scale fast.About the roleWere looking for an experienced senior infrastructure e

About HUDHUD (YC W25) does RL and evaluations for frontier AI agents. Our HUD agentic RL platform is used by frontier labs Fortune 500 companies and startups. We have from YC A16Z and other leading VCs to scale fast.About the roleWere looking for an experienced senior infrastructure e

Apply Now