Site Reliability Engineer

Encora

Not Interested
Bookmark
Report This Job

profile Job Location:

Mexico City - Mexico

profile Monthly Salary: Not Disclosed
Posted on: 10 hours ago
Vacancies: 1 Vacancy

Job Summary

Important Information:

  • Years of Experience: 5 years

  • Job Mode: Full-time

  • Work Mode: Remote within Mexico

Job Summary:
We are seeking a Site Reliability Engineer (19324) to ensure the reliability scalability and performance of custom platforms running on AWS infrastructure and Kubernetes. This role focuses on Tier 3 issue resolution operational readiness for new releases and proactive improvements to platform stability and customer experience through SRE best practices.

Responsibilities and Duties:
Troubleshoot and resolve Tier 3 platform issues for AWS-based custom applications. Collaborate closely with engineering teams to prepare Operations for new releases and feature enhancements. Identify recurring issues and implement automation tooling or process improvements to prevent reoccurrence. Design and implement strategies to improve platform reliability scalability and performance. Monitor system health and proactively identify risks or degradation. Participate in incident response root cause analysis and post-mortem reviews. Contribute to operational documentation runbooks and readiness plans. Partner with internal stakeholders to continuously enhance customer experience and platform robustness.

Qualifications and Skills:
Hands-on experience supporting and operating AWS cloud environments. Strong knowledge of Kubernetes and container orchestration concepts. Proficiency in Python or Go for automation and scripting. Experience with platform support troubleshooting and performance optimization. Familiarity with CI/CD pipelines monitoring and observability tools. Strong problem-solving abilities with an engineering-focused mindset.

Role-specific Requirements:
Ability to handle complex production incidents and drive them to resolution. Experience working closely with development teams on operational readiness. Proven ability to identify systemic issues and implement long-term solutions. Understanding of SRE principles incident management and reliability metrics.

Technologies:
AWS Kubernetes Docker Python Go CI/CD pipelines Monitoring and Observability tools Terraform or CloudFormation (preferred)

Skillset Competencies:
Cloud Infrastructure Management Container Orchestration Automation and Scripting Incident Response Root Cause Analysis Reliability Engineering Cross-team Collaboration Documentation and Operational Excellence

About Encora:
Encora is the preferred digital engineering and modernization partner of some of the worlds leading enterprises and digital native companies. With over 9000 experts in 47 offices and innovation labs worldwide Encoras technology practices include Product Engineering & Development Cloud Services Quality Engineering DevSecOps Data & Analytics Digital Experience Cybersecurity and AI & LLM Engineering.
At Encora we hire professionals based solely on their skills and qualifications and do not discriminate based on age disability religion gender sexual orientation socioeconomic status or nationality.


Required Experience:

IC

Important Information:Years of Experience: 5 yearsJob Mode: Full-timeWork Mode: Remote within MexicoJob Summary:We are seeking a Site Reliability Engineer (19324) to ensure the reliability scalability and performance of custom platforms running on AWS infrastructure and Kubernetes. This role focuses...
View more view more

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting

About Company

Company Logo

As Encora Inc. expands its footprint in Latin America, its acquisition of Nearsoft provides our clients with a unique chance to Nearshore on a global scale.

View Profile View Profile