drjobs Lead Site Reliability Engineer Public Cloud LMTS

Lead Site Reliability Engineer Public Cloud LMTS

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Bengaluru - India

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Join us as we work to create a thriving ecosystem that delivers accessible highquality and sustainable healthcare for all.

We are looking for a Lead Site Reliability Engineer to join our Cloud Infrastructure Engineering division. Cloud Infrastructure Engineering ensures the continuous availability of the technologies and systems that are the foundation of athenahealths services. We are directly responsible for thousands of servers petabytes of storage and handling thousands of web requests per second all while sustaining growth at a meteoric rate. We enable an operating system for the medical office that abstracts away administrative complexity leaving doctors free to practice medicine.

But enough about us; lets talk about you!

Youre a seasoned engineer with a passion for identifying and resolving reliability and scalability challenges. You are a curious team player someone who loves to explore learn and make things better. You are excited to uncover inefficiencies in business processes creative in finding ways to automate solutions and relentless in your pursuit of greatness. Youre a nimble learner capable of quickly absorbing complex solutions and an excellent communicator who can help evangelize engineering excellence.

The Team:

We are a bunch of Site Reliability Engineers who are passionate about reliability automation and scalability. We use an agile based framework to execute our work ensuring we are always focused on the most important and impactful needs of the business. We support systems in both private and public cloud and make datadriven decisions for which one best suit the needs of the business. We are relentless in automating away manual repetitive work so we can focus on projects that help move the business forward.

Job Responsibilities

Cloud Infrastructure Leadership:

  • Lead the design implementation and maintenance of scalable and highly available cloud infrastructure using public cloud platforms (AWS).
  • Ensure the cloud infrastructure is resilient faulttolerant and capable of supporting largescale applications and services.
  • Provide technical guidance and leadership for cloud infrastructure projects helping to drive the infrastructure strategy forward.
  • Strong Understanding of Hybrid Cloud Setup Operations and scaling up

Reliability and Availability:

  • Define measure and maintain Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for cloud services and infrastructure components.
  • Lead efforts to continuously improve system availability fault tolerance and disaster recovery capabilities.
  • Ensure proactive incident detection efficient root cause analysis and timely resolution of production incidents
  • OnCall participation in 24x7 setup.

Automation and Infrastructure as Code (IaC):

  • Drive automation efforts to reduce manual intervention and streamline cloud infrastructure management.
  • Implement Infrastructure as Code (IaC) using tools like Terraform AWS CloudFormation and Ansible to provision manage and scale cloud resources.
  • Automate deployment scaling and monitoring processes to improve efficiency and reduce operational complexity.

Monitoring Observability and Performance Tuning:

  • Design and implement monitoring logging and alerting solutions to track cloud infrastructure health performance and security.
  • Use observability tools (e.g. Prometheus Grafana Cloud Watch) to ensure continuous visibility into cloud infrastructure performance and capacity.
  • Identify bottlenecks and performance issues proposing and implementing improvements to ensure optimal resource usage.

Security and Compliance:

  • Ensure that cloud infrastructure is built with security best practices in mind and meets all relevant compliance and regulatory requirements.
  • Collaborate with security teams to implement security controls and risk mitigation strategies across cloud environments.
  • Regularly audit and review cloud infrastructure for security vulnerabilities and compliance gaps.

Cost Optimization:

  • Optimize cloud resource usage and reduce costs without compromising performance or reliability.
  • Monitor cloud service usage and recommend strategies for optimizing cloud infrastructure spending.
  • Implement costtracking tools and reporting mechanisms to ensure the business remains within budget for cloud infrastructure.

Collaboration and CrossFunctional Leadership:

  • Work closely with development DevOps and operations teams to ensure cloud infrastructure aligns with application and business requirements.
  • Lead and mentor a team of Site Reliability Engineers promoting best practices and fostering a culture of operational excellence.
  • Act as a key technical point of contact for cloudrelated infrastructure and operations issues.

Incident Management and PostMortem:

  • Lead the incident response efforts for cloud infrastructurerelated issues ensuring that all incidents are managed effectively.
  • Conduct postincident reviews (PIRs) to identify root causes and implement preventive measures.
  • Continuously refine incident management processes to reduce downtime and enhance recovery times.

Qualifications

  • 810 years of handson experience with cloud automation and configuration management tools (e.g. Terraform AWS CloudFormation Ansible). On a Hybrid Cloud Setup.
  • 7 years of experience in a Site Reliability Engineering (SRE) Infrastructure Engineering or DevOps role with at least 3 years in a technical leadership capacity.
  • Deep knowledge of cloud services and technologies (e.g. EC2 S3 Lambda Kubernetes etc..
  • Proficiency in scripting or programming languages (Python Go Bash etc..
  • Experience with monitoring logging and observability tools (e.g. Prometheus Grafana Datadog ELK stack).
  • Familiarity with Continuous Integration/Continuous Deployment (CI/CD) pipelines and cloudnative development practices.
  • Strong expertise in managing cloud infrastructure (AWS Google Cloud Azure) in production environments.
  • Experience with cloudnative architectures microservices and containerized environments (Kubernetes Docker).
  • Proven experience in building and managing highly available scalable and faulttolerant systems in the cloud.
  • Strong understanding of cloud networking storage compute services OnPrem and security best practices.

Behaviors & Abilities Required:

  • Strong Technical leadership and mentoring abilities with a track record of developing highperformance engineering teams.
  • Excellent problemsolving troubleshooting and diagnostic skills.
  • Ability to work in a crossfunctional collaborative environment.
  • Effective communication skills with the ability to translate technical concepts to nontechnical stakeholders.

About athenahealth

Heres ourvision:To create a thriving ecosystem that delivers accessible highquality and sustainable healthcare for all.

Whats unique about our locations
From an historic 19thcentury arsenal to a converted landmark power plantallofathenahealths offices were carefully chosen to represent our innovative spirit and promote the most positive and productive work environment for our teams. Our10offices across the United States and India plus numerous remote employees all work to modernize the healthcare experience together.

Our company culture might be our best feature.
We dont take ourselves too seriously. But our work Thats another story.athenahealth develops andimplements products and services that support US healthcare: Itsour chance to create healthier futures for ourselves for our family and friends for everyone.

Our vibrant and talentedemployees orathenistas as we call ourselves spark the innovation and passion needed to accomplishour goal. We continue to expand our workforce with amazing people who bring diverse backgrounds experiences and perspectives at every level and foster an environment where everyathenistafeels comfortable bringing theirbestselves to work.

Our size makes a difference too: We are small enoughthatyourindividual contributionswill stand out butlarge enoughto grow your career with ourresources and established business stability.

Giving back is integral to our culture. OurathenaGivesplatform strives tosupport food security expand access to highquality healthcare for all and support STEM education to develop providers and technologists who will provide access to highquality healthcare for all in the future. As part of the evolution of athenahealthsCorporate Social ResponsibilityCSRprogram weve selected nonprofit partners that align with our purpose and let us foster longterm partnerships for charitable giving employee volunteerism insight sharing collaboration and crossteam engagement.

What can we do for you
Along with health and financial benefitsathenistasenjoy perks specific to eachlocation including commuter support employee assistance programs tuition assistanceemployeeresource groups and collaborativeworkspaces some offices even welcome dogs.

In addition to our traditional benefits and perks we sponsor events throughout the year includingbook clubs external speakers and hackathons. And weprovideathenistaswithacompany culturebased onlearningthe support of anengaged teamandan inclusive environment where all employees are valued.

We alsoencourage a better worklife balance forathenistaswith our flexibility. Whilewe know inoffice collaboration is critical to our vision we recognize that not all work needs to be done within an office environmentfulltime. With consistent communication and digital collaboration tools athenahealthenablesemployees to find a balance that feels fulfilling and productive for each individual situation.

Employment Type

Full-Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.