Reddit is a community of communities. Its built on shared interests passion and trust and is home to the most open and authentic conversations on the internet. Every day Reddit users submit vote and comment on the topics they care most about. With 100000 active communities and approximately 101M daily active unique visitors Reddit is one of the internets largest sources of information. For more information visit
.Reddit SRE is rapidly innovating and our teams are working to meet the needs of infrastructure and development teams as they evolve our product faster than ever before. This is a unique opportunity to leave your mark on one of the most influential and trafficked corners of the internet.
As a Senior Site Reliability Engineer on Reddits Infrastructure SRE team youll use your knowledge of distributed systems and architecture to improve the reliability and performance of Reddits engineering platforms and services. We are looking for someone who thrives at the intersection of infrastructure and software development. This team will work very closely with the Compute Traffic and Observability infrastructure teams. They will own a suite of tools for allowing engineers to understand their creations based primarily on open-source solutions at scale. Were active users of and contributors to Prometheus Thanos Grafana Vector and more.
In this role you will also take ownership of risk management ensuring the reliability and performance of our systems. You will collaborate with cross-functional teams to identify assess and mitigate risks implementing best practices to enhance system resilience. Your expertise will drive proactive measures to maintain uptime and optimize service delivery making a significant impact on our operational excellence.
Join us and help build the future of Reddit!
Responsibilities:
- Advise:
- Work closely with engineering teams in designing and developing systems that are resilient and highly performant at a tremendous scale and maintaining the foundational platform for running Reddits infrastructure.
- Amplify:
- Identify and build capabilities into our foundational Infrastructure and Platform services which are used by Reddit engineering teams to build deploy and operate Reddit.
- Deliver software to improve the availability scalability latency and efficiency of observability components.
- Identify and engineer away risk across Reddits systems.
- Automate:
- Take repetitive manual or risky tasks and automate them out of existence. Build tools and integrate systems to support Reddits evolution.
- Automate critical aspects of the event driven development process
- Diagnose:
- Draw on your knowledge of distributed systems to identify and fix network system and service-level issues. Practice sustainable incident response and drive structural improvement with blameless postmortem.
- Share on-call responsibilities.
- Optimize:
- Observe and improve performance reduce cost and improve the experience for millions of users
- Contribute upstream changes to the open source projects we use
Qualifications
- 5 years of experience in Software Engineering Site Reliability Engineering or a development-focused DevOps role.
- Proficiency in one or more programming languages. Were predominantly writing code in Go and Python.
- Experience with Kubernetes and Cloud systems.
- Familiarity with distributed systems development bonus if familiar with any of the specific tools (Prometheus Thanos Grafana Vector Clickhouse Otel Loki)
- Experience with the development and operation of high-traffic backend systems.
- A demonstrated ability to debug fix and optimize code.
- Troubleshooting skills that span applications networking (TCP/IP) and systems.
- Strong working knowledge of Linux and containers.
- Excellent communication and collaborative skills.
Benefits:
- Private Medical Dental and Vision Benefits
- Retirement Savings plan with matching contributions
- Workspace benefits for your home office
- Personal & Professional development funds
- Family Planning Support
- Commuter Benefits
- Flexible Vacation & Reddit Global Days Off
In select roles and locations the interviews will be recorded transcribed and summarized by artificial intelligence (AI). You will have the opportunity to opt out of recording transcription and summarization prior to any scheduled interviews.
During the interview we will collect the following categories of personal information: Identifiers Professional and Employment-Related Information Sensory Information (audio/video recording) and any other categories of personal information you choose to share with us. We will use this information to evaluate your application for employment or an independent contractor role as applicable. We will not sell your personal information or disclose it to any third party for their marketing purposes. We will delete any recording of your interview promptly after making a hiring decision. For more information about how we will handle your personal information including our retention of it please refer to our Candidate Privacy Policy for Potential Employees and Contractors.
Reddit is proud to be an equal opportunity employer and is committed to building a workforce representative of the diverse communities we serve. Reddit is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If due to a disability you need an accommodation during the interview process please let your recruiter know.
Required Experience:
Senior IC
Reddit is a community of communities. Its built on shared interests passion and trust and is home to the most open and authentic conversations on the internet. Every day Reddit users submit vote and comment on the topics they care most about. With 100000 active communities and approximately 101M dai...
Reddit is a community of communities. Its built on shared interests passion and trust and is home to the most open and authentic conversations on the internet. Every day Reddit users submit vote and comment on the topics they care most about. With 100000 active communities and approximately 101M daily active unique visitors Reddit is one of the internets largest sources of information. For more information visit
.Reddit SRE is rapidly innovating and our teams are working to meet the needs of infrastructure and development teams as they evolve our product faster than ever before. This is a unique opportunity to leave your mark on one of the most influential and trafficked corners of the internet.
As a Senior Site Reliability Engineer on Reddits Infrastructure SRE team youll use your knowledge of distributed systems and architecture to improve the reliability and performance of Reddits engineering platforms and services. We are looking for someone who thrives at the intersection of infrastructure and software development. This team will work very closely with the Compute Traffic and Observability infrastructure teams. They will own a suite of tools for allowing engineers to understand their creations based primarily on open-source solutions at scale. Were active users of and contributors to Prometheus Thanos Grafana Vector and more.
In this role you will also take ownership of risk management ensuring the reliability and performance of our systems. You will collaborate with cross-functional teams to identify assess and mitigate risks implementing best practices to enhance system resilience. Your expertise will drive proactive measures to maintain uptime and optimize service delivery making a significant impact on our operational excellence.
Join us and help build the future of Reddit!
Responsibilities:
- Advise:
- Work closely with engineering teams in designing and developing systems that are resilient and highly performant at a tremendous scale and maintaining the foundational platform for running Reddits infrastructure.
- Amplify:
- Identify and build capabilities into our foundational Infrastructure and Platform services which are used by Reddit engineering teams to build deploy and operate Reddit.
- Deliver software to improve the availability scalability latency and efficiency of observability components.
- Identify and engineer away risk across Reddits systems.
- Automate:
- Take repetitive manual or risky tasks and automate them out of existence. Build tools and integrate systems to support Reddits evolution.
- Automate critical aspects of the event driven development process
- Diagnose:
- Draw on your knowledge of distributed systems to identify and fix network system and service-level issues. Practice sustainable incident response and drive structural improvement with blameless postmortem.
- Share on-call responsibilities.
- Optimize:
- Observe and improve performance reduce cost and improve the experience for millions of users
- Contribute upstream changes to the open source projects we use
Qualifications
- 5 years of experience in Software Engineering Site Reliability Engineering or a development-focused DevOps role.
- Proficiency in one or more programming languages. Were predominantly writing code in Go and Python.
- Experience with Kubernetes and Cloud systems.
- Familiarity with distributed systems development bonus if familiar with any of the specific tools (Prometheus Thanos Grafana Vector Clickhouse Otel Loki)
- Experience with the development and operation of high-traffic backend systems.
- A demonstrated ability to debug fix and optimize code.
- Troubleshooting skills that span applications networking (TCP/IP) and systems.
- Strong working knowledge of Linux and containers.
- Excellent communication and collaborative skills.
Benefits:
- Private Medical Dental and Vision Benefits
- Retirement Savings plan with matching contributions
- Workspace benefits for your home office
- Personal & Professional development funds
- Family Planning Support
- Commuter Benefits
- Flexible Vacation & Reddit Global Days Off
In select roles and locations the interviews will be recorded transcribed and summarized by artificial intelligence (AI). You will have the opportunity to opt out of recording transcription and summarization prior to any scheduled interviews.
During the interview we will collect the following categories of personal information: Identifiers Professional and Employment-Related Information Sensory Information (audio/video recording) and any other categories of personal information you choose to share with us. We will use this information to evaluate your application for employment or an independent contractor role as applicable. We will not sell your personal information or disclose it to any third party for their marketing purposes. We will delete any recording of your interview promptly after making a hiring decision. For more information about how we will handle your personal information including our retention of it please refer to our Candidate Privacy Policy for Potential Employees and Contractors.
Reddit is proud to be an equal opportunity employer and is committed to building a workforce representative of the diverse communities we serve. Reddit is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If due to a disability you need an accommodation during the interview process please let your recruiter know.
Required Experience:
Senior IC
View more
View less