Manager DevOps Engineering (Observability & Developer Experience)
Job Summary
About the Role
Were looking for a Lead DevOps Engineer (Observability & Developer Experience) to take ownership of two key areas within our engineering operations. Youll work closely with the engineering teams responsible for our observability platform and developer tools helping to ensure our systems are reliable and our developers are empowered with efficient this role youll shape the technical direction for monitoring logging CI/CD and automation initiatives across the organization while staying hands-on with engineering work. Youll also collaborate with teams across the company to drive continuous improvement and innovation.
What Youll Do
- Lead and contribute to a team of DevOps/SRE engineers focused on Observability (monitoring logging alerting) and Developer Experience (CI/CD pipelines internal developer tools).
- Help define the roadmap and vision for observability and developer productivity aligning with business and engineering needs.
- Design implement and operate monitoring and logging infrastructure ensuring robust visibility into system health and performance.
- Build and maintain CI/CD pipelines and automation frameworks that enable fast safe deployments and improve developer workflows.
- Collaborate with other engineering and product teams to drive cross-functional initiatives and ensure reliability and efficiency are embedded into our services.
- Establish and share best practices for infrastructure management incident response and post-mortems.
- Contribute to defining and implementing AI tooling and standards across the company ensuring developers have access to scalable and secure AI platforms.
What Youll Bring
- 5 years of experience in DevOps SRE or Infrastructure Engineering including some experience leading or mentoring other engineers.
- Hands-on expertise in observability practices and tools (e.g. Prometheus Grafana ELK stack) and familiarity with SRE principles.
- Experience with CI/CD pipelines and automation tools (e.g. Jenkins GitLab CI) and Infrastructure as Code practices.
- Knowledge of cloud platforms (AWS and/or GCP) and container orchestration (Kubernetes) and experience building scalable systems.
- Strong collaboration and communication skills with experience working across teams (development QA product).
- Passion for automation reliability and developer efficiency with a mindset of continuous improvement.
Preferred (Not Essential)
- Experience leading DevOps/SRE transformations or implementing reliability engineering practices at scale.
- Previous software engineering experience.
Why Join Us
This role is perfect for someone who wants to stay hands-on technically while also mentoring and influencing a team. Youll have a direct impact on the reliability and efficiency of our engineering systems and help shape the way our developers work every day.
Required Experience:
Manager