Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailUSD 175000 - 250000
1 Vacancy
At Harvey were transforming how legal and professional services operate not incrementally but end-to-end. By combining frontier agentic AI an enterprise-grade platform and deep domain expertise were reshaping how critical knowledge work gets done for decades to come.
This is a rare chance to help build a generational company at a true inflection point. With 500 customers in 50 countries strong product-market fit and world-class investor support were scaling fast and defining a new category in real time. The work is ambitious the bar is high and the opportunity for growth personal professional and financial is unmatched.
Our team is sharp motivated and deeply committed to the mission. We move fast operate with intensity and take real ownership of the problems we tackle from early thinking to long-term outcomes. We stay close to our customers from leadership to engineers and work together to solve real problems with urgency and care. If you thrive in ambiguity push for excellence and want to help shape the future of work alongside others who raise the bar we invite you to build with us.
At Harvey the future of professional services is being written today and were just getting started.
As a Software Engineer on the Site Reliability team at Harvey you will ensure the reliability scalability and performance of our legal AI platform. Youll join a high-leverage team that sits at the intersection of infrastructure and product owning the systems that keep our platform fast secure and always on. From scaling across 50 regions to automating mission-critical operations your work will ensure that Harvey remains resilient as we grow. If youre passionate about building robust systems and reducing complexity through automation wed love to work with you.
This role is based in San Francisco CA. We use an in-person work model and offer relocation assistance to new employees.
Design implement and manage monitoring alerting and infrastructure resources (compute storage networking) across 50 global regions
Lead incident management processes including postmortems root cause analyses and driving actionable improvements
Automate operational tasks and workflows building tools and processes for capacity planning graceful rollouts and safe data access to maintain high reliability and reduce manual intervention
Collaborate across teams to drive reliability security and compliance throughout the software lifecycle
Optimize infrastructure costs through strategic capacity planning and build-versus-buy decisions while maintaining system performance reliability and functionality.
5 years of experience in Site Reliability Engineering or similar roles supporting production environments
Expertise in infrastructure as code(IaC) tools (Pulumi Terraform CloudFormation etc.).
Deep familiarity with observability tools (Datadog Sentry etc.) and incident response practices (PagerDuty IncidentIO etc.)
Proficiency with cloud infrastructure platforms (Azure GCP AWS etc.)
Strong programming skills (Python Bash Go or similar languages)
Proven track record of diagnosing complex system problems and implementing durable solutions
Solid understanding of CI/CD Kubernetes containerization networking databases and cloud security principles
Excellent problem-solving skills meticulous attention to detail and a commitment to operational excellence
$200000 - $260000 USD
#LI-AN2
Harvey is an equal opportunity employer and does not discriminate on the basis of race gender sexual orientation gender identity/expression national origin disability age genetic information veteran status marital status pregnancy or related condition or any other basis protected by law.
We are committed to providing reasonable accommodations to applicants with disabilities and requests can be made by emailing .
Required Experience:
Senior IC
Full-Time