Why Harvey
At Harvey were transforming how legal and professional services operate not incrementally but end-to-end. By combining frontier agentic AI an enterprise-grade platform and deep domain expertise were reshaping how critical knowledge work gets done for decades to come.
This is a rare chance to help build a generational company at a true inflection point. With 700 customers in 58 countries strong product-market fit and world-class investor support were scaling fast and defining a new category in real time. The work is ambitious the bar is high and the opportunity for growth personal professional and financial is unmatched.
Our team is sharp motivated and deeply committed to the mission. We move fast operate with intensity and take real ownership of the problems we tackle from early thinking to long-term outcomes. We stay close to our customers from leadership to engineers and work together to solve real problems with urgency and care. If you thrive in ambiguity push for excellence and want to help shape the future of work alongside others who raise the bar we invite you to build with us.
At Harvey the future of professional services is being written today and were just getting started.
Role Overview
Were looking for a technical systems-minded operator to build and scale the evaluation engine behind Harveys platform. As we expand globally ensuring our models behave reliably accurately and jurisdictionally correctly is mission-criticaland evaluation complexity is increasing 10x.
As a member of our Product Operations team youll work closely with Applied Legal Researchers Product Engineering AI Research and human data providers to operationalize evaluation methodologies and embed them into our product development lifecycle. Youll create the workflows systems and tooling that make evaluation a first-class product capability at Harvey.
This is a high-ownership role for someone who thrives in ambiguity loves building structure from ambiguity and wants to help scale the evaluation infrastructure of a global AI company.
What Youll Do
Build and scale the systems that power model and product evaluations across Harvey
Embed evaluation workflows and readiness checkpoints into the product development lifecycle
Create the single source of truth for evaluation status results history and launch readiness
Turn Expert-designed evaluation methodologies into scalable repeatable operational processes
Manage relationships with human data vendors and ensure evaluation quality meets legal standards
Work with Engineering and Research to improve evaluation tooling automation and dashboards
Drive evaluation readiness for major product and model launches across geographies and jurisdictions
Document and operationalize evaluation governance as complexity increases
Help define how Harvey ensures model accuracy reliability and trust at global scale
What You Have
47 years in technical program management product operations research operations or evaluation/benchmarking roles
Experience working with ML/AI evaluations benchmarking frameworks or scientific workflows
Comfort with statistical methodologies and SQL or Python or similar tools to interpret evaluation data
Ability to work deeply with legal experts and operationalize complex evaluation methodologies
Strong cross-functional coordination skills across Product Engineering Research and data providers/vendors
High attention to detail and a bias toward clarity rigor and reproducibility
Ability to navigate extreme ambiguity and bring order to complex systems
Strong communication skills and comfort translating technical nuance for diverse stakeholders
Desire to do whatever it takes to make evaluation systems successfulfrom writing documentation to diagnosing pipeline issues
Bonus Points
Experience in legal tech or working with domain experts in regulated industries
Experience managing human data providers or human-in-the-loop evaluation pipelines
Background in ML research data quality management or evaluation science
Early employee at a hyper-growth startup
Experience at world-class product or platform operations orgs (ex: Stripe Ramp)
Compensation
$178500 - $210000 USD
Please find our CA applicant privacy notice here.
#LI-CL1
Harvey is an equal opportunity employer and does not discriminate on the basis of race gender sexual orientation gender identity/expression national origin disability age genetic information veteran status marital status pregnancy or related condition or any other basis protected by law.
We are committed to providing reasonable accommodations to applicants with disabilities and requests can be made by emailing
Required Experience:
Manager
Why HarveyAt Harvey were transforming how legal and professional services operate not incrementally but end-to-end. By combining frontier agentic AI an enterprise-grade platform and deep domain expertise were reshaping how critical knowledge work gets done for decades to come.This is a rare chance ...
Why Harvey
At Harvey were transforming how legal and professional services operate not incrementally but end-to-end. By combining frontier agentic AI an enterprise-grade platform and deep domain expertise were reshaping how critical knowledge work gets done for decades to come.
This is a rare chance to help build a generational company at a true inflection point. With 700 customers in 58 countries strong product-market fit and world-class investor support were scaling fast and defining a new category in real time. The work is ambitious the bar is high and the opportunity for growth personal professional and financial is unmatched.
Our team is sharp motivated and deeply committed to the mission. We move fast operate with intensity and take real ownership of the problems we tackle from early thinking to long-term outcomes. We stay close to our customers from leadership to engineers and work together to solve real problems with urgency and care. If you thrive in ambiguity push for excellence and want to help shape the future of work alongside others who raise the bar we invite you to build with us.
At Harvey the future of professional services is being written today and were just getting started.
Role Overview
Were looking for a technical systems-minded operator to build and scale the evaluation engine behind Harveys platform. As we expand globally ensuring our models behave reliably accurately and jurisdictionally correctly is mission-criticaland evaluation complexity is increasing 10x.
As a member of our Product Operations team youll work closely with Applied Legal Researchers Product Engineering AI Research and human data providers to operationalize evaluation methodologies and embed them into our product development lifecycle. Youll create the workflows systems and tooling that make evaluation a first-class product capability at Harvey.
This is a high-ownership role for someone who thrives in ambiguity loves building structure from ambiguity and wants to help scale the evaluation infrastructure of a global AI company.
What Youll Do
Build and scale the systems that power model and product evaluations across Harvey
Embed evaluation workflows and readiness checkpoints into the product development lifecycle
Create the single source of truth for evaluation status results history and launch readiness
Turn Expert-designed evaluation methodologies into scalable repeatable operational processes
Manage relationships with human data vendors and ensure evaluation quality meets legal standards
Work with Engineering and Research to improve evaluation tooling automation and dashboards
Drive evaluation readiness for major product and model launches across geographies and jurisdictions
Document and operationalize evaluation governance as complexity increases
Help define how Harvey ensures model accuracy reliability and trust at global scale
What You Have
47 years in technical program management product operations research operations or evaluation/benchmarking roles
Experience working with ML/AI evaluations benchmarking frameworks or scientific workflows
Comfort with statistical methodologies and SQL or Python or similar tools to interpret evaluation data
Ability to work deeply with legal experts and operationalize complex evaluation methodologies
Strong cross-functional coordination skills across Product Engineering Research and data providers/vendors
High attention to detail and a bias toward clarity rigor and reproducibility
Ability to navigate extreme ambiguity and bring order to complex systems
Strong communication skills and comfort translating technical nuance for diverse stakeholders
Desire to do whatever it takes to make evaluation systems successfulfrom writing documentation to diagnosing pipeline issues
Bonus Points
Experience in legal tech or working with domain experts in regulated industries
Experience managing human data providers or human-in-the-loop evaluation pipelines
Background in ML research data quality management or evaluation science
Early employee at a hyper-growth startup
Experience at world-class product or platform operations orgs (ex: Stripe Ramp)
Compensation
$178500 - $210000 USD
Please find our CA applicant privacy notice here.
#LI-CL1
Harvey is an equal opportunity employer and does not discriminate on the basis of race gender sexual orientation gender identity/expression national origin disability age genetic information veteran status marital status pregnancy or related condition or any other basis protected by law.
We are committed to providing reasonable accommodations to applicants with disabilities and requests can be made by emailing
Required Experience:
Manager
View more
View less