drjobs SRE Lead

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Atlanta, GA - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Job Title: SRE Lead

Location: Atlanta GA (Day 1 hybrid)

Hybrid: Thursday to Wed work from office Alternate weeks

Onsite SRE Lead (10 yrs)

Core Skillset

Client Consulting:

  • Work with team to define SRE maturity model observability strategy identify gaps and AWS reliability roadmap.
  • Translate business SLAs into SLIs/SLOs/Error Budgets.

Architecture & Design:

  • Lead and implement AWS serverless reliability architecture (multi-region failover self-healing).
  • Define observability blueprints (logs metrics traces UX telemetry).
  • Define cost optimized Data Observability and Resiliency solutions

Reliability & Resilience

  • Design and implement fault-tolerant highly available AWS architectures.
  • Experience in DynamoDB global tables RDS Failovers capacity planning
  • Apply SRE principles: SLIs SLOs SLAs error budgets and toil reduction.
  • Drive chaos engineering disaster recovery and capacity planning exercises.

Observability & Monitoring

  • Experience in implementing end-to-end observability (logs metrics traces events).
  • Build cost optimized unified dashboards custom metrics using Dynatrace Cloudwatch
  • Experience in implementing Data Observability and Resiliency solutions
  • Automate alerts anomaly detection and incident response workflows.

Automation & Infrastructure

  • Develop automation and custom tooling using Python and .
  • Build infrastructure as code using AWS CDK and CloudFormation.
  • Implement self-healing and auto-remediation solutions with AWS serverless Services

Operations & Incident Management

  • Implement AI/ML-driven automation.
  • Collaborate with developers for shift-left observability and performance optimization.
  • Guide and Lead adoption of automation proactive observability and self-healing systems.

Employment Type

Full-time

Company Industry

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.