drjobs Senior Cloud Site Reliability Engineer

Senior Cloud Site Reliability Engineer

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Bengaluru - India

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

Join us as we work to create a thriving ecosystem that delivers accessible high-quality and sustainable healthcare for all.

athenahealth is a progressive & innovative U.S. health-tech leader delivering cloud-based solutions that improve clinical and financial performance across the care continuum. Our modern open ecosystem connects care teams and delivers actionable insights that drive better outcomes. Acquired by Bain Capital in a $17B deal. We foster a values-driven culture focused on flexibility collaboration and work-life balance.

Headquartered in Boston
we have offices in Atlanta Austin Belfast Burlington and in India: Bangalore Chennai and Pune.

Position Summary:
We are looking for aSenior Site Reliability Engineerto join ourCloud Infrastructure Engineeringdivision in Bangalore. This team ensures the continuous availability of the technologies and systems that form the foundation of athenahealths services.

We are directly responsible for thousands of servers petabytes of storage and handling thousands of web requests per secondall while supporting rapid growth. Our mission is to enable an operating system for the medical office that abstracts away administrative complexity allowing doctors to focus on practicing medicine.

About You: Youre a seasoned engineer with a passion for solving reliability and scalability challenges. Youre curious collaborative and driven to improve systems. You enjoy uncovering inefficiencies automating solutions and striving for operational excellence. Youre a fast learner an excellent communicator and a champion of engineering best practices.

The ideal candidate will havestrong expertise in AWS and Kubernetes along with hands-on experience inTerraform CI/CD pipelines and scripting(e.g. Python Bash Go).Experience with AI toolssuch asWindsurf GitHub Copilot or similar will be considered aplus.

The Team: We are a team of Site Reliability Engineers passionate about reliability automation and scalability. We follow an agile framework to prioritize high-impact work. Supporting both private and public cloud environments we make data-driven decisions to choose the best fit for the business. We relentlessly automate manual tasks to focus on strategic initiatives.

Key Responsibilities

Reliability & Availability

  • Define measure and maintain SLOs and SLIs for cloud services and infrastructure.
  • Lead efforts to improve system availability fault tolerance and disaster recovery.
  • Ensure proactive incident detection root cause analysis and timely resolution.
  • Participate in a 24x7 on-call rotation.

Automation & Infrastructure as Code (IaC)

  • Drive automation to reduce manual intervention in cloud infrastructure management.
  • Implement IaC using tools likeTerraform AWS CloudFormation and Ansible.
  • Automate deployment scaling and monitoring processes.

Monitoring Observability & Performance

  • Design and implement monitoring logging and alerting solutions.
  • Use observability tools (e.g. Prometheus Grafana CloudWatch) for performance insights.
  • Identify and resolve performance bottlenecks.

Security & Compliance

  • Build cloud infrastructure with security best practices and compliance in mind.
  • Collaborate with security teams to implement controls and mitigate risks.
  • Conduct regular audits for vulnerabilities and compliance gaps.

Collaboration & Leadership

  • Partner with development DevOps and operations teams to align infrastructure with business needs.
  • Mentor junior engineers and promote a culture of operational excellence.
  • Serve as a technical point of contact for infrastructure-related issues.

Incident Management & Post-Mortems

  • Lead incident response for cloud infrastructure issues.
  • Conduct post-incident reviews and implement preventive measures.
  • Continuously improve incident management processes.


Qualifications

  • 59 years of hands-on experience with cloud automation and configuration tools (e.g.Terraform CloudFormation Ansible) in a hybrid cloud setup.
  • 4 years in SRE Infrastructure Engineering or DevOps roles
  • Deep expertise inAWSservices (e.g. EC2 S3 Lambda) andKubernetes.
  • Proficiency in scripting/programming (e.g.Python Go Bash).
  • Experience with observability tools (e.g. Prometheus Grafana Datadog ELK).
  • Familiarity withCI/CD pipelinesand cloud-native development practices.
  • Strong experience managing production environments in AWS GCP or Azure.
  • Knowledge of cloud-native architectures microservices and containerization (Kubernetes Docker).
  • Proven ability to build scalable fault-tolerant systems.
  • Solid understanding of cloud networking storage compute and security best practices.
  • Bonus:Experience with AI tools such asWindsurfGitHub Copilot or similar.

About athenahealth

Our vision: In an industry that becomes more complex by the day we stand for simplicity. We offer IT solutions and expert services that eliminate the daily hurdles preventing healthcare providers from focusing entirely on their patients powered by our vision to create a thriving ecosystem that delivers accessible high-quality and sustainable healthcare for all.

Our company culture: Our talentedemployees or athenistas as we call ourselves spark the innovation and passion needed to accomplish our vision. We are a diverse group of dreamers and do-ers with unique knowledge expertise backgrounds and perspectives. We unite as mission-driven problem-solvers with a deep desire to achieve our vision and make our time here count. Our award-winning culture is built around shared values of inclusiveness accountability and support.

Our DEI commitment: Our vision of accessible high-quality and sustainable healthcare for all requires addressing the inequities that stand in the way. Thats one reason we prioritize diversity equity and inclusion in every aspect of our business from attracting and sustaining a diverse workforce to maintaining an inclusive environment for athenistas our partners customers and the communities where we work and serve.

What we can do for you:

Along with health and financial benefits athenistas enjoy perks specific to each location including commuter support employee assistance programs tuition assistance employee resource groups and collaborativeworkspaces some offices even welcome dogs.

We also encourage a better work-life balance for athenistas with our flexibility. While we know in-office collaboration is critical to our vision we recognize that not all work needs to be done within an office environmentfull-time. With consistent communication and digital collaboration tools athenahealthenablesemployees to find a balance that feels fulfilling and productive for each individual situation.

In addition to our traditional benefits and perks we sponsor events throughout the year including book clubs external speakers and hackathons. We provide athenistas with a company culture based on learning the support of an engaged team and an inclusive environment where all employees are valued.

Learn more about our culture and benefits here:

Experience:

Senior IC

Employment Type

Full-Time

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.