drjobs Research Engineer, Agentic AI Evals

Research Engineer, Agentic AI Evals

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

San Francisco, CA - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

About HUD

HUD (YC W25) is developing agentic evals for Computer Use Agents (CUAs) that browse the web. Our CUA Evals framework is the first comprehensive evaluation tool for CUAs.

Our Mission: People dont actually know if AI agents are working. To make AI agents work in the real world we need detailed evals for a huge range of tasks.

Were backed by Y Combinator and work closely with frontier AI labs to provide agent evaluation infrastructure at scale.

About the role

Were looking for a research engineer to help build out task configs and environments for evaluation datasets on HUDs CUA evaluation framework.

Responsibilities

Experience

Technical Skills

Strong candidates may have:

We prioritize technical aptitude and learning potential over years of experience. Motivated candidates are encouraged to apply even if they dont meet all criteria.

Representative projects:

We prioritise contributions that show quality and quantity such as building out large high-quality datasets. Imagine making about 10 small puzzles in mock web environments a day.

Team & Company Details

Logistics

Due to high volume we may not actively respond to every application but feel free to contact us at or elsewhere if we missed your application!


Required Experience:

Unclear Seniority

Employment Type

Full-Time

Department / Functional Area

Engineering

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.