Stefanini is looking forCloud Site Reliability Engineer - Remote
For quick apply please contact ; Ph:
W2 Candidates only!
About the Opportunity:
As a Senior Cloud Engineer in the Cloud SRE team you will be responsible for designing and developing cloud solutions and engineering reliability tools for the Cloud Foundation Services (CFS) platform in the Infrastructure Platforms & Operations organization. You will apply software engineering practices to build scalable reusable solutions and utilities that enhance platform reliability.
Responsibilities:
What Will Be Expected of You:
Design develop and maintain reliability solutions and SRE utilities to reduce toil improve cloud platform reliability and industrialize SRE practices across the system
Build and optimize Infrastructure as Code (IaC) using Terraform to manage AWS resources related to SRE solutions incorporating cost-efficient design principles
Develop CI/CD pipelines and automated testing to ensure code quality reliability and rapid delivery of the solutions
Define SRE standards best practices and guidelines for adoption across teams; establish SRE metrics like SLI SLOs etc.
Apply software engineering best practices including version control code reviews test-driven development and documentation to all development
Participate in incident management and on-call rotation providing technical support for SRE tools troubleshooting production issues and collaborating with teams to reduce incident recurrence through proactive detection and pattern analysis
Stay current with emerging AWS services SRE methodologies and cloud-native development technologies and drive adoption of innovative solutions
Collaborate within Agile and Scaled Agile frameworks with cross-functional teams to deliver integrated cloud automation solutions
Produce clear blameless postmortems with actionable items and documented failure scenarios
#LI-SS3
#LI-REMOTE
Qualifications:
Bachelors degree in computer science Information Systems or equivalent background or equivalent experience
7 years of extensive experience in software development with focus on reliability and platform engineering
5 Years of advanced Python development skills with proven experience building enterprise-grade highly available tools APIs and utilities
3 years of hands-on experience developing solutions in AWS environments with deep understanding of core services (EC2 VPC S3 Lambda IAM CloudFormation EventBridge Step Functions etc.) and resource cost optimization
3 years of experience applying SRE principles including observability toil automation SLIs/SLOs and reliability engineering
Expert-level proficiency with Infrastructure as Code (IaC) using Terraform including module development and state management
Strong experience with CI/CD pipelines automated testing frameworks and DevOps practices
Experience with observability tools and practices including Grafana AWS CloudWatch AWS Canary
Experience defining implementing and managing SLOs/SLIs and error budgets; familiarity with conducting RCAs and producing postmortem documentation
Working experience in Agile and Scaled Agile environments and familiarity with ITSM processes (incident change and problem management) resilience testing and chaos engineering practices
Experience with GoLang or additional programming languages is a plus
Stefanini takes pride in hiring top talent and developing relationships with our future employees. Our talent acquisition teams will never make an offer of employment without having a phone conversation with you. Those face-to-face conversations will involve a description of the job for which you have applied. We also speak with you about the process including interviews and job offers.
About Stefanini Group:
The Stefanini Group is a global provider of offshore onshore and near shore outsourcing IT digital consulting systems integration application and strategic staffing services to Fortune 1000 enterprises around the world. Our presence is in countries like the Americas Europe Africa and Asia and more than four hundred clients across a broad spectrum of markets including financial services manufacturing telecommunications chemical services technology public sector and utilities. Stefanini is a CMM level 5 IT consulting company with a global presence. We are CMM Level 5 company
Required Experience:
IC
Job DescriptionStefanini Group is hiring!Stefanini is looking forCloud Site Reliability Engineer - RemoteFor quick apply please contact ; Ph: W2 Candidates only!About the Opportunity:As a Senior Cloud Engineer in the Cloud SRE team you will be responsible for designing and developing cloud solutions...
Job Description
Stefanini Group is hiring!
Stefanini is looking forCloud Site Reliability Engineer - Remote
For quick apply please contact ; Ph:
W2 Candidates only!
About the Opportunity:
As a Senior Cloud Engineer in the Cloud SRE team you will be responsible for designing and developing cloud solutions and engineering reliability tools for the Cloud Foundation Services (CFS) platform in the Infrastructure Platforms & Operations organization. You will apply software engineering practices to build scalable reusable solutions and utilities that enhance platform reliability.
Responsibilities:
What Will Be Expected of You:
Design develop and maintain reliability solutions and SRE utilities to reduce toil improve cloud platform reliability and industrialize SRE practices across the system
Build and optimize Infrastructure as Code (IaC) using Terraform to manage AWS resources related to SRE solutions incorporating cost-efficient design principles
Develop CI/CD pipelines and automated testing to ensure code quality reliability and rapid delivery of the solutions
Define SRE standards best practices and guidelines for adoption across teams; establish SRE metrics like SLI SLOs etc.
Apply software engineering best practices including version control code reviews test-driven development and documentation to all development
Participate in incident management and on-call rotation providing technical support for SRE tools troubleshooting production issues and collaborating with teams to reduce incident recurrence through proactive detection and pattern analysis
Stay current with emerging AWS services SRE methodologies and cloud-native development technologies and drive adoption of innovative solutions
Collaborate within Agile and Scaled Agile frameworks with cross-functional teams to deliver integrated cloud automation solutions
Produce clear blameless postmortems with actionable items and documented failure scenarios
#LI-SS3
#LI-REMOTE
Qualifications:
Bachelors degree in computer science Information Systems or equivalent background or equivalent experience
7 years of extensive experience in software development with focus on reliability and platform engineering
5 Years of advanced Python development skills with proven experience building enterprise-grade highly available tools APIs and utilities
3 years of hands-on experience developing solutions in AWS environments with deep understanding of core services (EC2 VPC S3 Lambda IAM CloudFormation EventBridge Step Functions etc.) and resource cost optimization
3 years of experience applying SRE principles including observability toil automation SLIs/SLOs and reliability engineering
Expert-level proficiency with Infrastructure as Code (IaC) using Terraform including module development and state management
Strong experience with CI/CD pipelines automated testing frameworks and DevOps practices
Experience with observability tools and practices including Grafana AWS CloudWatch AWS Canary
Experience defining implementing and managing SLOs/SLIs and error budgets; familiarity with conducting RCAs and producing postmortem documentation
Working experience in Agile and Scaled Agile environments and familiarity with ITSM processes (incident change and problem management) resilience testing and chaos engineering practices
Experience with GoLang or additional programming languages is a plus
Stefanini takes pride in hiring top talent and developing relationships with our future employees. Our talent acquisition teams will never make an offer of employment without having a phone conversation with you. Those face-to-face conversations will involve a description of the job for which you have applied. We also speak with you about the process including interviews and job offers.
About Stefanini Group:
The Stefanini Group is a global provider of offshore onshore and near shore outsourcing IT digital consulting systems integration application and strategic staffing services to Fortune 1000 enterprises around the world. Our presence is in countries like the Americas Europe Africa and Asia and more than four hundred clients across a broad spectrum of markets including financial services manufacturing telecommunications chemical services technology public sector and utilities. Stefanini is a CMM level 5 IT consulting company with a global presence. We are CMM Level 5 company
Created in 1987, Stefanini is a $1B global IT provider of business solutions with locations in 40 countries across the Americas, Europe, Australia and Asia. With more than 25,000 employees, Stefanini provides onshore, offshore and nearshore IT services, including application developme
... View more