Senior Site Reliability Engineer

GE HealthCare

Not Interested
Bookmark
Report This Job

profile Job Location:

Bengaluru - India

profile Monthly Salary: Not Disclosed
Posted on: Yesterday
Vacancies: 1 Vacancy

Job Summary

Job Description Summary

The Senior Site Reliability Engineer will be responsible for performance and availability of Compute and Network infrastructure consumed by all business segments. The Site Reliability team is composed of highly talented individuals obsessively focused with availability through operational excellence. The ideal individual is relentlessly technical passionate for automating everything and committed to delivering amazing customer experiences

GE HealthCare is a leading global medical technology and digital solutions innovator. Our purpose is to create a world where healthcare has no limits. Unlock your ambition turn ideas into world-changing realities and join an organization where every voice makes a difference and every difference builds a healthier world..

Job Description

In this role you will:

  • Establish performance baseline capacity thresholds correlate events and define monitoring/alerting criteria.

  • Develop automated solutions to address potential problems before they result in a service interruption.

  • Provide impact assessment and mitigation plan for changes going into the production environment.

  • Investigate root cause of severe and systemic outages identify corrective actions and apply across the enterprise.

  • Develop availability measures that align with consumer experience to accurately assess the usability of crucial services.

  • Build capacity models to baseline transactional load compared to resource performance and leverage data to predict overall system capacity while automating load placement to avoid outages.

  • Identify thresholds for all critical links in the data path to quickly isolate where imbalances may result in potential outages.

  • Analyse failure points in services to model risk level and resolution steps if failure occurs.

  • Assist in driving architecture enhancements into system to mitigate potential failure points.

  • Programmatically monitor for and remediate configuration drift of critical devices.

  • Develop response plans to potential failure points and evaluate effectiveness during planned tests.

  • Perform comprehensive operational health checks of the entire services to identify areas of concern and track activities to drive improvements at all levels of the architecture.

  • Provide technical coaching and direction to more junior teammates.

Qualifications/Essential Requirements:

  • Bachelors Degree in Computer Science or STEM Majors (Science Technology Engineering and Math) with at least 10 years of overall experience

  • Hands-on experience in site reliability engineering with a focus on AWS.

  • Strong understanding of AWS Services architecture and best practices.

  • Expertise on management & administration of Kubernetes clusters.

  • Strong background in scripting automation configuration management and infrastructure-as-code practices (Terraform AWS CloudFormation Crossplane Pulumi etc.)

  • Good understanding of DevOps practices CI/CD pipelines version control systems (Git). Experience in GitOps is a plus.

  • Strong knowledge on Unix based operating systems & workload management and networking systems.

  • Experience with configuring customizing and extending monitoring /APM tools (Datadog Kloudfuse Grafana Splunk etc.)

  • Operational experience in complex distributed systems including defining measuring and monitoring SLO/SLAs for availability and reliability goals.

  • Experience with incident management and post-incident reviews.

  • AWS Certified Solutions Architect Associate AWS Certified DevOps Engineer is a plus.

Inclusion and Diversity

GE Healthcare is an Equal Opportunity Employer where inclusion matters. Employment decisions are made without regard to race color religion national or ethnic origin sex sexual orientation gender identity or expression age disability protected veteran status or other characteristics protected by law.

We expect all employees to live and breathe our behaviors: to act with humility and build trust; lead with transparency; deliver with focus and drive ownership always with unyielding integrity.

Ourtotal rewardsare designed to unlock your ambition by giving you the boost and flexibility you need to turn your ideas into world-changing realities. Our salary and benefits are everything youd expect from an organization with global strength and scale and youll be surrounded by career opportunities in a culture that fosters care collaboration and support.

#LI-RS1

Additional Information

Relocation Assistance Provided: Yes


Required Experience:

Senior IC

Job Description SummaryThe Senior Site Reliability Engineer will be responsible for performance and availability of Compute and Network infrastructure consumed by all business segments. The Site Reliability team is composed of highly talented individuals obsessively focused with availability through...
View more view more

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting

About Company

Company Logo

GE HealthCare provides digital infrastructure, data analytics & decision support tools helps in diagnosis, treatment and monitoring of patients

View Profile View Profile