drjobs Cloud Infrastructure - Site Reliability Engineer

Cloud Infrastructure - Site Reliability Engineer

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Alpharetta, GA - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Title: Cloud Infrastructure - Site Reliability Engineer
Location: Alpharetta GA or Berkeley Heights NJ (5 Days Onsite)
Certifications:
Certified Engineer DevOps SRE CSREF
Job Description:
As a Cloud Infrastructure Site Reliability Engineer (SRE) with expertise across multiple public cloud platforms you will be responsible for managing and operating cloud infrastructure in alignment with the principles of Googles SRE model. Your role will focus on ensuring the reliability availability and performance of our cloud services while driving automation and continuous improvement across production environments. You will collaborate closely with cross-functional teams to strengthen our cloud reliability posture and streamline operations through innovative automation solutions.
Key Responsibilities:
  • Design build and maintain highly available scalable and secure cloud infrastructure on platforms such as AWS GCP or Azure.
  • Develop and implement automation for provisioning monitoring scaling and incident response using Infrastructure-as-Code tools (e.g. Terraform CloudFormation Ansible). Monitor system reliability capacity and performance; proactively detect and address issues before they impact users.
  • Respond to production incidents participate in on-call rotations and lead post-incident reviews to drive root cause analysis and reliability improvements.
  • Collaborate with software engineering and security teams to ensure new services and features are production-ready and meet reliability standards.
  • Build and maintain tools for deployment monitoring and operations; automate manual processes to reduce toil.
  • Document operational processes and system architectures to ensure knowledge sharing and repeatability.
  • Continuously evaluate and implement new technologies to improve system reliability security and efficiency.
Qualifications:
  • Bachelors degree in computer science Engineering or a related technical field or equivalent practical experience.
  • 3 years of experience in software development with proficiency in at least one programming language (e.g. Python Go Java C).
  • Experience administrating cloud platforms (AWS GCP Azure) including networking security containerization storage data management and serverless technologies.
  • Solid understanding of Linux systems networking fundamentals virtualized and distributed systems file systems system processes and configurations.
  • Deep understanding of observability (monitoring alerting and logging) tools in cloud environments. Ability to set up and maintain monitoring dashboards alerts and logs. Familiarity with Continuous Integration/Continuous Deployment (CI/CD) tools for automated testing deployments provisioning and observability.
  • Ability to manage and respond to incidents perform root cause analysis and implement post-mortem reviews. Understanding of setting monitoring and maintaining Service-Level Objectives (SLOs) and Service-Level Agreements (SLAs) for system reliability.
  • Additional Qualifications a Plus: Experience working with enterprise-scale financial services or other regulated industries

Employment Type

Full-time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.