Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailSite Reliability Engineer - FullStack
Were looking for a builder-minded Site Reliability Engineer - FullStack to solve our customers most complex problems in RAG/AI applications recommendation/personalization systems and AI-powered search. Youll work directly with enterprise prospects in the Bay Area translating their complex technical requirements into compelling Vespa solutions that drive adoption and growth.
About :
is a team of passionate builders. We maintain and develop the Apache 2.0 licensed open-source project Vespa. Vespa lets our users run big data AI online. At any scale with unbeatable performance.
Vespa is a fully featured search engine and vector database. It supports vector search (ANN) lexical search and search in structured data all in the same query. Integrated machine-learned model inference allows you to apply AI to make sense of data in real-time. Together with Vespas proven scaling and high availability this empowers to create production-ready search applications at any scale and with any combination of features. Our users and customers are #1 in e-commerce content and financial services globally and used by companies like Spotify Yahoo Wix and many more.
In addition to our open-source platform runs Vespa Cloud a robust SaaS offering that allows businesses to harness the power of our technology with ease.
At we are extremely focused on automating whatever we do to be able to grow fast with high all roles we scale using technology not simply larger teams. We take pride in being small nimble and the most productive.
Position Overview:
As a Site Reliability Engineer at you will play a crucial role in ensuring the availability reliability and performance of our SaaS platform for customers around the world. You will collaborate with cross-functional teams to design implement and maintain robust infrastructure solutions that meet our high availability requirements. The ideal candidate will have a strong background in system architecture and automationa passion for optimizing and scaling systems and proficiency in frontend development to bring full-stack capabilities to the team.
The Vespa Services Team develops the infrastructure for Vespa Cloud. This includes AWS/GCP/Azure automation using Terraform as well as auth and security integration with services like Auth0 and Teleport. Other examples are billing integration with credit card providers compliance automation and custom maintainer modules with code written in Java.
This role covers both frontend and backend development. Youll work with ReactJS and TypeScript on the frontend using Mantine UI for components and Playwright for testing. An interest in user-centered design is a plus helping ensure the tools and services we build are intuitive and effective.
An ideal candidate dislikes doing things twice and automates using Java or scripts with proper monitoring like creating alerts badges and dashboards. Experience with monitoring and alerting tools such as Grafana and OpsGenie is a big plus.
Responsibilities:
System Architecture and Design:
Automation and Infrastructure as Code:
Monitoring and Incident Response:
Capacity Planning and Performance Optimization:
Security and Compliance:
User Experience and Design:
Collaboration and Documentation:
Qualifications:
Why Join Us:
If you are excited about the intersection of open source search and recommendation systems AI integration and have a genuine passion for quality and automation we would love to hear from you! Apply now to join the Vespa Team and play a key role in shaping the future of our industry.
Note: is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We believe in fostering a collaborative and inclusive environment where every team member has the opportunity to make a significant impact.
Full-Time