Site Reliability Engineer (SRE)

Not Interested
Bookmark
Report This Job

profile Job Location:

Atlanta, GA - USA

profile Monthly Salary: Not Disclosed
Posted on: 5 hours ago
Vacancies: 1 Vacancy

Job Summary

Job Summary:
We are looking for a highly experienced Site Reliability Engineer (SRE) with 12 years of experience to support and enhance the reliability scalability and performance of enterprise applications and cloud infrastructure. The ideal candidate will have strong hands-on experience with CI/CD pipelines Google Cloud Platform (GCP) Linux systems databases and API testing along with a strong production support mindset.
Key Responsibilities:
Design build and maintain CI/CD pipelines using Jenkins
Perform API testing and validation using Postman or Bruno
Write analyze and troubleshoot SQL Server queries and stored procedures
Provide advanced Linux support including shell scripting and AWK usage
Monitor and support applications hosted on Google Cloud Platform (GCP)
Work with BigQuery for data analysis and issue resolution
Manage Google Cloud Storage including bucket-to-bucket data transfers
Parse and manipulate JSON data for APIs and system integrations
Provide production support incident management and root cause analysis
Collaborate with development and DevOps teams to improve system reliability and automation
Required Skills & Experience:
12 years of IT experience including SRE DevOps or Production Support roles
Strong hands-on experience with Jenkins
Experience with Postman or Bruno for API testing
Strong expertise in SQL Server (complex queries stored procedures tuning)
Advanced Linux skills with shell scripting and AWK
Hands-on experience with Google Cloud Platform (GCP)
Experience with BigQuery
Knowledge of Google Cloud Storage and data transfers
Working knowledge of JavaScript (scripting level)
Strong understanding of JSON
Experience supporting production systems and on-call environments
Excellent troubleshooting and communication skills
Preferred / Nice to Have:
Experience with monitoring tools (Grafana Prometheus Stackdriver)
Exposure to Infrastructure as Code (Terraform)
Experience in Agile / DevOps environments
Job Summary: We are looking for a highly experienced Site Reliability Engineer (SRE) with 12 years of experience to support and enhance the reliability scalability and performance of enterprise applications and cloud infrastructure. The ideal candidate will have strong hands-on experience with CI/CD...
View more view more

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting