Sr Site Reliability Engineer

Visa

Not Interested
Bookmark
Report This Job

profile Job Location:

São Paulo - Brazil

profile Monthly Salary: Not Disclosed
Posted on: 5 hours ago
Vacancies: 1 Vacancy

Job Summary

Join Pismos Platform squad within the SRE Tribe dedicated to owning and evolving the containerized platform that underpins critical workloads. Youll work crossfunctionally to ensure our platform is reliable scalable secure and easy to operate focusing on Kubernetes at scale and cloud architecture.

What Youll Do

Own the endtoend lifecycle (design provisioning upgrades maintenance and decommissioning) of core platform components including:

  • Cloud infrastructure primitives
  • Kubernetes clusters and cluster services
  • Networking ingress and service discovery
  • Service Mesh and supporting dataplane components

Design platform components to be resilient by default applying SRE principles such as:

  • Fault isolation and graceful degradation
  • Capacity planning and saturation control
  • Reduced operational toil and clear failure modes

Lead the design and implementation of infrastructure bootstrap orchestration including:

  • Automated cluster and environment provisioning
  • Deterministic repeatable platform bringup and teardown
  • Dependencyaware orchestration across cloud network and Kubernetes layers

Drive InfrastructureasCode and GitOpsfirst practices to ensure:

  • Platform components are reproducible and auditable
  • Changes are automated testable and reversible
  • Manual intervention is minimized or eliminated
  • Identify automation gaps and lead initiatives that reduce human effort onboarding time and operational risk.

Apply and promote SRE operational excellence practices including:

  • Clear ownership and runbooks for platform components
  • Participation in oncall rotation as a platform reliability escalation point
  • Incident response postincident reviews and problem management
  • Improve day2 operations by standardizing upgrade/rollback strategies and reducing MTTD/MTTR.
  • Ensure platform operations align with security compliance and internal control requirements.
  • Collaborate with engineering teams across the organization to influence platform adoption reliability standards and cloudnative best practices.

This is a remote position. A remote position does not require job duties be performed within proximity of a Visa office location. Remote positions may be required to be present at a Visa office with scheduled notice. #LI-Remote


Qualifications :

For this role you must be based in Brazil.

Language Skills
Proficiency in English at B2 level or above (Upper-Intermediate)

Technical Skills

  • Strong handson experience with public cloud platforms (AWS preferred Azure also considered).
  • Proven experience operating and administering Kubernetes at scale in production environments.
  • Strong experience with container orchestration platforms and cloud architecture fundamentals (networking IAM/security concepts and reliability patterns).
  • Experience with Infrastructure as Code (Terraform preferred) and automationfirst workflows.
  • Familiarity with GitOps practices and CI/CD pipelines.
  • Strong troubleshooting skills for distributed systems including rootcause analysis and reliability improvements.
  • Experience with observability concepts and practices (monitoring logging alerting tracing).

Preferred Qualifications

  • Experience with Service Mesh technologies (Istio preferred App Mesh or Linkerd).
  • Experience working with critical or missioncritical systems.
  • Strong background applying SRE principles (operational readiness incident management runbooks toil reduction).
  • AWS certifications.

Additional Information :

Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race color religion sex national origin sexual orientation gender identity disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.


Remote Work :

Yes


Employment Type :

Full-time

Join Pismos Platform squad within the SRE Tribe dedicated to owning and evolving the containerized platform that underpins critical workloads. Youll work crossfunctionally to ensure our platform is reliable scalable secure and easy to operate focusing on Kubernetes at scale and cloud architecture.Wh...
View more view more

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting

About Company

Company Logo

Visa (NYSE: V) is a world leader in digital payments, facilitating transactions between consumers, merchants, financial institutions and government entities across more than 200 countries and territories. Our purpose is to uplift everyone, everywhere by being the best way to pay and b ... View more

View Profile View Profile