Principal Systems Engineer

Harris


Job Location:

London - UK

Monthly Salary: Not Disclosed
Posted on: 8 hours ago
Vacancies: 1 Vacancy

Job Summary

Site Reliability Engineer (SRE) - Remote

Overview
As a Site Reliability Engineer (SRE) at Altera you will be responsible for ensuring the reliability scalability and performance of our hosted healthcare platforms. This role blends software and systems engineering to enhance service availability automate operations and improve the customer experience. You will act as a technical leader in monitoring troubleshooting incident response and continuous improvement across our cloud and hybrid environments.

Key Responsibilities

  • Maintain and improve the reliability availability and performance of our production environments.
  • Lead the investigation and resolution of complex application database and infrastructure issues.
  • Participate in incident management conduct root cause analysis (RCA) and contribute to post-incident reviews to prevent future occurrences.
  • Define and measure Service Level Indicators (SLIs) and Objectives (SLOs) to meet our service commitments.
  • Develop proactive monitoring and alerting strategies to identify and resolve issues before they impact customers.
  • Automate operational tasks using scripting and Infrastructure-as-Code (IaC) to improve efficiency.
  • Partner with engineering and cloud teams to refine deployment monitoring and support processes.
  • Provide technical leadership during major incidents and act as a key escalation point for critical issues.

Qualifications

Experience:

  • 7 years of experience supporting enterprise applications infrastructure or cloud environments.
  • Monitoring & Observability: Strong experience with APM tools such as LogicMonitor AppDynamics Azure Monitor SentryOne Dynatrace Datadog or New Relic.
  • Microsoft Stack: Deep knowledge of Windows Server administration applications Windows Clustering MSMQ Event Logs and PerfMon.
  • Database Skills: Strong SQL Server experience including performance tuning query optimization blocking analysis and Always On Availability Groups.
  • Cloud & Networking: Experience with Azure cloud environments and a solid understanding of networking fundamentals (DNS TCP/IP load balancing firewalls).
  • ITSM & ITIL: Familiarity with ServiceNow (or other ITSM platforms) and ITIL principles.

Preferred Skills:

  • Scripting with PowerShell Python or similar languages.
  • Infrastructure as Code (Terraform ARM Templates Bicep).
  • CI/CD pipelines and deployment automation (Azure DevOps GitHub Actions).
  • Experience with Kubernetes and containerized workloads.
  • Experience implementing SLOs SLIs and Error Budgets.
  • Experience in a healthcare technology or patient care environment.

Education:

  • Bachelors Degree in Computer Science Information Technology or Engineering is preferred; equivalent professional experience will be considered.

Working Arrangements

  • This is a remote position open to candidates within the United States.
  • You will participate in an on-call rotation to support our 24x7 healthcare environment.
  • Occasional after-hours work is required for activations upgrades and major incidents.

Travel

  • Travel is not a requirement for this role.

Why Altera
At Altera Digital Health you will have the opportunity to profoundly impact the lives of patients by empowering healthcare providers to deliver superior care. You will join a passionate and gifted team committed to innovation and excellence. We offer a competitive compensation and benefits package and the opportunity to work in a fast-paced and dynamic environment.


Required Experience:

Staff IC

Site Reliability Engineer (SRE) - RemoteOverviewAs a Site Reliability Engineer (SRE) at Altera you will be responsible for ensuring the reliability scalability and performance of our hosted healthcare platforms. This role blends software and systems engineering to enhance service availability automa...

About Company

Company Logo

Harris is an acquirer of software businesses. Our focus is to acquire businesses with growth potential, manage them well and build them for the future.

View Profile View Profile