Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailUSD 175000 - 250000
1 Vacancy
Harvey is a secure AI platform for legal and professional services that augments productivity and automates complex workflows. Harvey uses algorithms with reasoning-adept LLMs that have been customized and developed by our expert team of lawyers engineers and research scientists. Weve found product market fit and are scaling our team very quickly. Some reasons to join Harvey are:
Exceptional product market fit: We have partnered with the largest law firms and professional service providers in the world including Paul Weiss A&O Shearman Ashurst OMelveny & Myers PwC KKR and many others.
Strategic investors: Raised over $500 million from strategic investors including Sequoia Google Ventures Kleiner Perkins and OpenAI.
World-class team: Harvey is hiring the best talent from DeepMind Google Brain Stripe FAIR Tesla Autopilot Glean Superhuman Figma and more.
Partnerships: Our engineers and researchers work directly with OpenAI to build the future of generative AI and redefine professional services.
Performance: 4x ARR in 2024.
Competitive compensation.
As a Software Engineer on the Site Reliability team at Harvey you will ensure the reliability scalability and performance of our legal AI platform. Youll join a high-leverage team that sits at the intersection of infrastructure and product owning the systems that keep our platform fast secure and always on. From scaling across 50 regions to automating mission-critical operations your work will ensure that Harvey remains resilient as we grow. If youre passionate about building robust systems and reducing complexity through automation wed love to work with you.
This role is based in San Francisco CA. We use an in-person work model and offer relocation assistance to new employees.
Design implement and manage monitoring alerting and infrastructure resources (compute storage networking) across 50 global regions
Lead incident management processes including postmortems root cause analyses and driving actionable improvements
Automate operational tasks and workflows building tools and processes for capacity planning graceful rollouts and safe data access to maintain high reliability and reduce manual intervention
Collaborate across teams to drive reliability security and compliance throughout the software lifecycle
Optimize infrastructure costs through strategic capacity planning and build-versus-buy decisions while maintaining system performance reliability and functionality.
3 years of experience in Site Reliability Engineering or similar roles supporting production environments
Expertise in infrastructure as code(IaC) tools (Pulumi Terraform CloudFormation etc.).
Deep familiarity with observability tools (Datadog Sentry etc.) and incident response practices (PagerDuty IncidentIO etc.)
Proficiency with cloud infrastructure platforms (Azure GCP AWS etc.)
Strong programming skills (Python Bash Go or similar languages)
Proven track record of diagnosing complex system problems and implementing durable solutions
Solid understanding of CI/CD Kubernetes containerization networking databases and cloud security principles
Excellent problem-solving skills meticulous attention to detail and a commitment to operational excellence
$175000 - $250000 USD
Harvey is an equal opportunity employer and does not discriminate on the basis of race gender sexual orientation gender identity/expression national origin disability age genetic information veteran status marital status pregnancy or related condition or any other basis protected by law.
We are committed to providing reasonable accommodations to applicants with disabilities and requests can be made by emailing .
Full-Time