Site Reliability Engineer

Florida City, FL - USA

Monthly Salary: Not Disclosed

Posted on: 30+ days ago

Vacancies: 1 Vacancy

Job Summary

Job Title: Site Reliability Engineer
Location: Florida City (FL)
ZIP Code: 32004
Experience Required: 12 Years
Employment Type: Contract

About the Role

We are seeking an experienced Site Reliability Engineer (SRE) with more than 12 years of hands-on expertise in building maintaining and improving large-scale highly available systems. The ideal candidate will have strong skills in automation cloud infrastructure performance optimization monitoring and incident response.

Key Responsibilities

Design implement and maintain highly reliable scalable and secure systems.
Develop automation to reduce manual operational tasks and eliminate repeated issues.
Build and maintain CI/CD pipelines to support continuous delivery and deployment.
Manage cloud infrastructure (AWS Azure or GCP) including networking security and scaling.
Create and maintain monitoring logging and alerting systems using modern tooling.
Lead incident response root-cause analysis and post-incident reviews.
Improve system performance and reliability through capacity planning and performance tuning.
Work closely with software engineering teams to ensure smooth production operations.
Implement infrastructure-as-code using Terraform Ansible or similar tools.
Ensure compliance with security and operational standards.

Required Skills and Experience

12 years of experience in Site Reliability DevOps or Production Engineering roles.
Strong hands-on experience with cloud platforms (AWS Azure or GCP).
Expertise in CI/CD pipelines and automation tools such as Jenkins GitLab or GitHub Actions.
Proficiency with containerization and orchestration (Docker Kubernetes).
Experience with monitoring tools such as Prometheus Grafana ELK Splunk or Datadog.
Strong scripting/programming skills (Python Bash Go or similar).
Familiarity with networking concepts load balancing and distributed systems.
Solid understanding of security best practices and infrastructure governance.
Experience managing high-availability systems in production environments.

Preferred Qualifications

Experience working in Agile environments.
Background with database operations (SQL/NoSQL).
Knowledge of infrastructure cost optimization.

Job Title: Site Reliability Engineer Location: Florida City (FL) ZIP Code: 32004 Experience Required: 12 Years Employment Type: Contract About the Role We are seeking an experienced Site Reliability Engineer (SRE) with more than 12 years of hands-on expertise in building maintaining and improving la...

Job Title: Site Reliability Engineer
Location: Florida City (FL)
ZIP Code: 32004
Experience Required: 12 Years
Employment Type: Contract

About the Role

Key Responsibilities

Design implement and maintain highly reliable scalable and secure systems.
Develop automation to reduce manual operational tasks and eliminate repeated issues.
Build and maintain CI/CD pipelines to support continuous delivery and deployment.
Manage cloud infrastructure (AWS Azure or GCP) including networking security and scaling.
Create and maintain monitoring logging and alerting systems using modern tooling.
Lead incident response root-cause analysis and post-incident reviews.
Improve system performance and reliability through capacity planning and performance tuning.
Work closely with software engineering teams to ensure smooth production operations.
Implement infrastructure-as-code using Terraform Ansible or similar tools.
Ensure compliance with security and operational standards.

Required Skills and Experience

12 years of experience in Site Reliability DevOps or Production Engineering roles.
Strong hands-on experience with cloud platforms (AWS Azure or GCP).
Expertise in CI/CD pipelines and automation tools such as Jenkins GitLab or GitHub Actions.
Proficiency with containerization and orchestration (Docker Kubernetes).
Experience with monitoring tools such as Prometheus Grafana ELK Splunk or Datadog.
Strong scripting/programming skills (Python Bash Go or similar).
Familiarity with networking concepts load balancing and distributed systems.
Solid understanding of security best practices and infrastructure governance.
Experience managing high-availability systems in production environments.