Senior Site Reliability Engineer (SRE) Release & Observability Focus

Cloudious LLC

Not Interested
Bookmark
Report This Job

profile Job Location:

Scottsdale, AZ - USA

profile Monthly Salary: Not Disclosed
Posted on: 4 hours ago
Vacancies: 1 Vacancy

Job Summary

Key Responsibilities

  • Solid hands-on experience in SRE or Release Engineering Roles
  • Strong experience deploying and operating containerized applications on Kubernetes across on-Prem and AWS Cloud
  • Strong of Linux and networking fundamentals
  • Own release automation deployment strategies rollback mechanisms and release validation
  • Proven experience supporting REST API services in production environments
  • Dr. Continuous improvements in release safety reliability monitoring alerting and operational readiness
  • Experience with monitoring and observability tools such as Splunk Prometheus/Grafana
  • Lead troubleshooting of complex production incidents and service degradations
  • Participate in on call rotations and lead incident response and post incidence reviews

Nice To Have

  • Python scripting for automation and platform tooling
  • Knowledge or experience with Honeycomb for observability
Key Responsibilities Solid hands-on experience in SRE or Release Engineering Roles Strong experience deploying and operating containerized applications on Kubernetes across on-Prem and AWS Cloud Strong of Linux and networking fundamentals Own release automation deployment strategie...
View more view more

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting