Site Reliability Engineering Lead

Lula

Not Interested
Bookmark
Report This Job

profile Job Location:

Cape Town - South Africa

profile Monthly Salary: Not Disclosed
Posted on: 12 hours ago
Vacancies: 1 Vacancy

Department:

Engineering

Job Summary

OVERALL PURPOSE

We are seeking an experienced Site Reliability Engineering Lead to lead mentor and grow our SRE team. The ideal candidate will have a deep understanding of Microsoft Azure cloud computing and distributed systems.

As the SRE Lead you will be responsible for the overall strategy and execution of our SRE function. You will guide your team to monitor maintain and improve our Azure-based infrastructure and applications ensuring their reliability scalability and security.

KEY RESPONSIBILITIES:

  • Lead mentor and develop a high-performing SRE team fostering a culture of ownership collaboration and continuous improvement.
  • Manage the teams performance including setting clear goals conducting regular 1:1s and supporting career development.
  • Collaborate with the software engineering manager on the recruitment process to grow the SRE team ensuring a high bar for technical skill and cultural fit.
  • Own and manage the 24/7 on-call rotation and incident response process acting as a key escalation point and driving effective root cause analysis (RCA) and remediation plans.
  • Define and drive the SRE technical roadmap partnering with Engineers DevOps and SecOps to build and manage highly available scalable and resilient architectures on Azure.
  • Oversee the platforms monitoring and alerting strategy guiding the team to build a holistic view of infrastructure and application performance using tools like Azure Monitor.
  • Champion automation by directing the teams development of scripts and tools to streamline deployment and management of Azure services.
  • Drive platform optimisation by analysing performance metrics and evaluating new Azure features and services to improve workflows.
  • Ensure the security of the Azure infrastructure by enforcing security policies and best practices in partnership with the SecOps team.
  • Foster a culture of delivery continuous improvement and innovation within the SRE team encouraging experimentation

THE EXPERIENCE WERE LOOKING FOR

  • Matric certificate or equivalent.
  • 5 years of experience in a senior SRE DevOps or Cloud Infrastructure role with deep knowledge of maintaining Azure infrastructure.
  • Minimum 2 years of formal people management and leadership experience.
  • Demonstrable experience leading incident response and root cause analysis.
  • Strong understanding of Azure services such as Web Applications Functions and Application Gateways.
  • Strong experience with automation tools such as PowerShell Azure CLI and ARM templates.
  • Deep experience with monitoring and logging tools such as Azure Monitor Grafana or similar Log Analytics Application Insights and Logic Apps.
  • Excellent troubleshooting problem-solving and strategic planning skills.
  • Azure certification(s) preferred such as Azure Administrator Associate.
  • Strong familiarity with DevOps practices and tools such as Jira and OpsGenie

Our Tech Stack

  • Cloud Platform: Microsoft Azure
  • Automation & IaC: Azure CLI Python Azure DevOps GitHub and Terraform.
  • Monitoring & Observability: Azure Monitor Log Analytics and Grafana.
  • Operations & Incident Management: Jira Sentinel and OpsGenie.
  • Dev Stack: .NET React MS SQL

Required Experience:

IC

OVERALL PURPOSE We are seeking an experienced Site Reliability Engineering Lead to lead mentor and grow our SRE team. The ideal candidate will have a deep understanding of Microsoft Azure cloud computing and distributed systems. As the SRE Lead you will be responsible for the overall strategy and ex...
View more view more

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting

About Company

Company Logo

OVERALL PURPOSE We're looking for a newly qualified CA(SA) to join Lula as a Financial Accountant. The Financial Accountant ensures accurate, timely, and compliant financial reporting in a fast-paced fintech environment. This role manages day-to-day accounting, period-end close, intan ... View more

View Profile View Profile