Senior Site Reliability Engineer

Zello

Not Interested
Bookmark
Report This Job

profile Job Location:

Austin, TX - USA

profile Monthly Salary: Not Disclosed
Posted on: 10 days ago
Vacancies: 1 Vacancy

Department:

Engineering

Job Summary

IMPORTANT: Please be aware scammers may try to impersonate Zello by reaching out regarding job opportunities. We will never ask you for bank account information checks or other sensitive information as part of our hiring process. All correspondence will come from the email domain. If youre unsure please email with questions.

About Zello

Zello is a voice-first communication platform powered by our industry-leading push-to-talk technology to improve collaboration and productivity for desk-less workers. With over 175 million users were the #1 rated push-to-talk app in the world delivering 9 billion (yes with a B) messages a month.

At Zello our company values are at the heart of what we do everyday. Were proud to serve the frontline were privileged to connect people in times of crisis across the globe and were honored to support first responders.

And this is where you come in.

Were looking for a Site Reliability Engineer to help us make our systems more observable performant and resilient. Youll work closely with our platform and application teams to build the tooling practices and insights that keep Zello reliable as we scale.

After a successful first year you will have

  • Implemented end-to-end observability tooling for application and infrastructure metrics traces and logs.

  • Delivered profiling and tracing systems that surface performance bottlenecks before they impact users.

  • Defined and tuned alerting to ensure only high-signal actionable incidents reach engineers.

  • Helped evolve Zellos incident response and postmortem processes ensuring consistent learning and improvement.

  • Provided developers with clear visibility into application performance and release impact driving data-informed engineering.

What youll do

  • Build and maintain monitoring tracing and profiling systems that empower teams to measure and improve performance.

  • Partner with cross-organization teams to define SLIs SLOs and SLAs that reflect real user experience.

  • Lead efforts to optimize observability from instrumentation standards to dashboard design.

  • Participate in and help coordinate our on-call rotation incident response and post-incident reviews.

  • Continuously evaluate and recommend tools or process improvements to strengthen reliability and reduce alert fatigue.

  • Collaborate on platform improvements that enhance system resilience and developer velocity.

Who you are

  • BSc in Computer Science or equivalent experience.

  • 6 years of experience in site reliability DevOps or software engineering roles.

  • Deep understanding of monitoring alerting and observability platforms (e.g. Prometheus Grafana Loki OpenTelemetry).

  • Experience implementing tracing logging and profiling for distributed systems.

  • Strong background in incident management postmortem practices and reliability metrics.

  • Familiarity with Linux Kubernetes Terraform and GCP (preferred) or other major clouds.

  • Proficiency in a scripting or backend language (e.g. Python Go Bash).

Excellent problem-solving communication and collaboration skills.
Passionate about eliminating toil and driving continuous improvement in system health.

We hire for potential passion for our mission and a knack for solving difficult problems over checking every qualification box. We have competitive pay equity with significant upside and intentionally design our benefits to encourage healthy and well-balanced employees flexible schedules and time off. We even offer a sabbatical after every five years of service so youre able to pursue and enjoy what matters most to you. And of course we wouldnt be a technology company in Austin without a ping-pong table and free snacks in our break room. Join us!

Zello provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race color religion age sex national origin disability status genetics protected veteran status sexual orientation gender identity or expression or any other characteristic protected by federal state or local laws.

All Zello personnel are required to comply with defined security privacy and compliance requirements applicable to their role along with requirements that are applicable to all Zello personnel.


Required Experience:

Senior IC

IMPORTANT: Please be aware scammers may try to impersonate Zello by reaching out regarding job opportunities. We will never ask you for bank account information checks or other sensitive information as part of our hiring process. All correspondence will come from the email domain. If youre unsure p...
View more view more

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting

About Company

Company Logo

Stay connected with your team using Zello - the leading push-to-talk app for frontline communication. Download Zello for work and get PTT walkie-talkie and radio features. Visit us now.

View Profile View Profile