The Role
We are seeking an experienced Staff Site Reliability Engineer to own and evolve our cloud infrastructure with a focus on scalable design operational excellence and system reliability.
The ideal candidate brings a strong production-engineering mindset and a deep commitment to observability resilience and well-instrumented distributed systems while holding a high bar for production readiness and believes no service should ship without meaningful telemetry and safeguards in place.
This role is critical to scaling the infrastructure that underpins our core data pipelines and directly enables our Machine Learning and Robotics engineering teams. If you enjoy tackling complex production challenges and building robust highly scalable systems this role offers significant scope and impact.
What you will do
- System Design & Operations: Design build and operate highly scalable reliable systems used by all Bedrock engineering teams.
- Cloud Infrastructure Ownership: Take full ownership of Bedrocks cloud infrastructure (AWS GCP Azure) ensuring best-in-class security performance and cost efficiency.
- Observability Stack: Design implement and maintain Bedrocks end-to-end observability stack (including monitoring logging and tracing).
- Production Excellence: Pave the road for production engineering by developing and implementing best practices for system reliability security on-call rotation and effective incident response.
- Performance & Cost Optimization: Continuously identify and implement improvements to enhance system performance and optimize cloud resource consumption.
What we are looking
- Reliability Passion: A deep passion for building and maintaining reliable fault-tolerant distributed systems.
- Cloud & IaC Expertise: Strong proficiency in major cloud platforms (such as AWS GCP or Azure) and Infrastructure as Code (IaC) tools like Terraform.
- Containerization & Orchestration: Proven experience with container technologies and orchestration platforms particularly Kubernetes.
- Observability: Hands-on experience with observability tools (e.g. Datadog Prometheus Splunk) and techniques.
- Technical Foundations: Strong understanding of distributed systems networking concepts database technologies and compute infrastructure.
- Security Best Practices: Strong understanding and experience implementing security best practices in cloud environments.
- Fast-Paced Environment: Ability to work in a fast-paced high-growth environment deal effectively with ambiguity and take decisive ownership of challenging problems.
Our roles are often flexible. If you dont fit all the criteria or are in another location (especially one where we have an office like SF of NY) please apply anyway! Wed love to consider you.
Join the team bringing advanced autonomy to the built world
At Bedrock weve assembled one of the most experienced autonomous technology teams in the industry with deep expertise scaling breakthroughs across transportation infrastructure and enterprise software. Our leaders helped put the first self-driving cars on public roads at Waymo scaled systems for Segments $3.2B acquisition and grew Uber Freight to $5B in revenue.
While others debate the future of AI were deploying it in the real world. Our systems are already installed on heavy machines across the country learning on real construction sites and working to reshape the earth with survey-grade precision and exceptional safety. This isnt a simulationits autonomous intelligence working on billion-dollar infrastructure projects.
In just over a year weve raised $80M put our equipment into the field and established partnerships with forward-thinking contractors who are integrating our technology into their operations. Were working quickly to close the gap between Americas surging demand for housing data centers manufacturing hubs and the construction industrys growing labor shortage.
Here algorithms meet steel-toed boots. Youll collaborate with both construction veterans and experienced engineers tackling problems where your work directly impacts how the physical world get built. If youre interested in applying cutting-edge technology to solve meaningful problems alongside a talented teamwed love to have you join us.
Required Experience:
Staff IC
The RoleWe are seeking an experienced Staff Site Reliability Engineer to own and evolve our cloud infrastructure with a focus on scalable design operational excellence and system reliability.The ideal candidate brings a strong production-engineering mindset and a deep commitment to observability res...
The Role
We are seeking an experienced Staff Site Reliability Engineer to own and evolve our cloud infrastructure with a focus on scalable design operational excellence and system reliability.
The ideal candidate brings a strong production-engineering mindset and a deep commitment to observability resilience and well-instrumented distributed systems while holding a high bar for production readiness and believes no service should ship without meaningful telemetry and safeguards in place.
This role is critical to scaling the infrastructure that underpins our core data pipelines and directly enables our Machine Learning and Robotics engineering teams. If you enjoy tackling complex production challenges and building robust highly scalable systems this role offers significant scope and impact.
What you will do
- System Design & Operations: Design build and operate highly scalable reliable systems used by all Bedrock engineering teams.
- Cloud Infrastructure Ownership: Take full ownership of Bedrocks cloud infrastructure (AWS GCP Azure) ensuring best-in-class security performance and cost efficiency.
- Observability Stack: Design implement and maintain Bedrocks end-to-end observability stack (including monitoring logging and tracing).
- Production Excellence: Pave the road for production engineering by developing and implementing best practices for system reliability security on-call rotation and effective incident response.
- Performance & Cost Optimization: Continuously identify and implement improvements to enhance system performance and optimize cloud resource consumption.
What we are looking
- Reliability Passion: A deep passion for building and maintaining reliable fault-tolerant distributed systems.
- Cloud & IaC Expertise: Strong proficiency in major cloud platforms (such as AWS GCP or Azure) and Infrastructure as Code (IaC) tools like Terraform.
- Containerization & Orchestration: Proven experience with container technologies and orchestration platforms particularly Kubernetes.
- Observability: Hands-on experience with observability tools (e.g. Datadog Prometheus Splunk) and techniques.
- Technical Foundations: Strong understanding of distributed systems networking concepts database technologies and compute infrastructure.
- Security Best Practices: Strong understanding and experience implementing security best practices in cloud environments.
- Fast-Paced Environment: Ability to work in a fast-paced high-growth environment deal effectively with ambiguity and take decisive ownership of challenging problems.
Our roles are often flexible. If you dont fit all the criteria or are in another location (especially one where we have an office like SF of NY) please apply anyway! Wed love to consider you.
Join the team bringing advanced autonomy to the built world
At Bedrock weve assembled one of the most experienced autonomous technology teams in the industry with deep expertise scaling breakthroughs across transportation infrastructure and enterprise software. Our leaders helped put the first self-driving cars on public roads at Waymo scaled systems for Segments $3.2B acquisition and grew Uber Freight to $5B in revenue.
While others debate the future of AI were deploying it in the real world. Our systems are already installed on heavy machines across the country learning on real construction sites and working to reshape the earth with survey-grade precision and exceptional safety. This isnt a simulationits autonomous intelligence working on billion-dollar infrastructure projects.
In just over a year weve raised $80M put our equipment into the field and established partnerships with forward-thinking contractors who are integrating our technology into their operations. Were working quickly to close the gap between Americas surging demand for housing data centers manufacturing hubs and the construction industrys growing labor shortage.
Here algorithms meet steel-toed boots. Youll collaborate with both construction veterans and experienced engineers tackling problems where your work directly impacts how the physical world get built. If youre interested in applying cutting-edge technology to solve meaningful problems alongside a talented teamwed love to have you join us.
Required Experience:
Staff IC
View more
View less