Location: IN - Bangalore 24/7 Job-ID: 216135 Contract type: Standard Business Unit: IT Consulting
Life on the team
Computacenter seeking confident and experienced Site Reliability Engineer (SRE) is responsible for ensuring the reliability scalability and performance of systems and must have the ability to be flexible and proactive to facilitate a quick response to changing project requirements and meeting global goals successfully.
What youll do
- Strong experience in Agile product ownership (Scrum SAFe Kanban).
- Write clear user stories and prioritize features based on business value.
- Build CI/CD Pipelines
- Proficiency in IaC tools like Terraforrm Ansible and Puppet
- Strong knowledge of cloud architecture
- Identify any security issues and remediate
- Keep abreast of industry best practises and emerging technologies and their potential to enhance the performance of the services
- Stakeholder & Business Engagement
- Excellent communication and stakeholder management skills able to work with other teams to identify opportunities and build
- Ability to engage with business leaders and technical teams.
- Strong requirement-gathering and analysis skills
What youll need
- Experience: 8 to 15 years
- Reliability: Focus on maintaining system uptime and minimizing downtime. Implement monitoring and alerting systems to detect and respond to issues promptly. Responsible for ensuring that DR capabilities are designed and tested regularly
- Scalability: Design and manage systems that can handle increasing loads efficiently. This involves capacity planning and optimizing resource usage.
- Performance: SREs continuously monitor and improve system performance addressing bottlenecks and ensuring smooth operation. Work with the development teams to ensue that production systems are fully optimised.
- Automation: They automate repetitive tasks to reduce manual intervention and improve consistency. This includes deploying infrastructure as code and automating deployment processes.
- Incident Management: SREs are responsible for responding to incidents performing root cause analysis and implementing solutions to prevent recurrence.
- Collaboration: They work closely with development and operations teams to ensure that new features and updates are reliable and scalable.Good knowledge and experience with AppExchange products and understanding of how to install/de-install
Certifications & Methodologies
ITIL
Safe
Devops
Experience with AI & Automation
Exposure to AI tools and promote an automation first culture
Understanding of AI-enhanced monitoring tools for proactive issue detection.
DevOps & Cloud Technologies
Hands-on experience with Azure AWS or Google Cloud
Knowledge of Infrastructure as Code (Terraform Bicep) for platform deployments
Multi-Region & Large-Scale Enterprise Experience
Experience working in global organizations with multi regional teams
About us With over 20000 employees across the globe we work at the heart of digitisation advising organisations on IT strategy implementing the most appropriate technology and helping our customers to source transform and manage their technology infrastructure in over 70 countries. We deliver digital technology to some of the worlds greatest organisations driving digital transformation and enabling people and their businesses.