Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailNot Disclosed
Salary Not Disclosed
1 Vacancy
As a Site Reliability Engineer you will join our clients 25-member engineering team focused on driving reliability and operational excellence across our platform.
You will play a critical role shaping our observability strategy implementing automation at scale and developing tools to boost engineering productivity.
Your work will directly impact the reliability and scalability of systems used by thousands of users worldwide.
Excellent teams in Global team collaboration
High work-life balance with Flexible hours
Agile working environment
POSITION: Contract until December 2027
EXPERIENCE: 8 Years related working experience
COMMENCEMENT: 01 August 2025
Qualifications / Experience Required:
IT Degree and/or relevant qualifications
10 years of experience in a Site Reliability Engineer DevOps engineer or similar role preferably in a technology-driven environment
Essential Skills Requirements:
Strong understanding of networking fundamentals
Skilled with AWS
Proficiency in at least one programming language: Python Go or JavaScript/TypeScript
Understanding of containerization (Docker) and orchestration principles
Experience with monitoring and alerting systems
Understanding of CI/CD principles
Version control with Git
Any additional responsibilities assigned in the Agile Working Model (AWM) Charter
Advantageous Skills Requirements:
Advanced Kubernetes knowledge and certification (CKA/CKAD)
Experience with the complete Grafana stack (Grafana Loki Tempo)
Proficiency with GitOps tools (Flux ArgoCD)
Advanced programming skills in Go or TypeScript
Knowledge of Terraform
Database experience with PostgreSQL MongoDB
Multi-environment infrastructure management
Infrastructure testing methodologies
Experience leading major incident response
Implementation of SRE practices at scale
Performance optimization experience
Capacity planning and load testing
Experience with cost optimization
Security hardening and compliance implementation
Role and Responsibilities:
Core responsibilities include:
Designing and implementing robust scalable infrastructure solutions for system reliability
Architecting and maintaining comprehensive monitoring and alerting solutions
Creating automated workflows to minimize toil and human error
Leading incident response efforts and driving continuous improvements
Providing technical leadership and mentoring team members
Building internal tools that enhance operational efficiency
Collaborating closely with development teams to improve service reliability
Establishing and enforcing SRE best practices across the organization
Daily technologies and focus areas:
Containerization: Kubernetes Docker
Observability: Grafana Stack Prometheus
GitOps: Flux ArgoCD
CI/CD: Modern pipeline tools
Cloud-native infrastructure
Programming languages: Go Python TypeScript/JavaScript
Microservices architecture
Multi-region deployments with high availability requirements
Managing complex dependencies
PLEASE NOTE:
By applying for this role you consent to be added to the iSanqa database and to receive updates until you unsubscribe.
Also note that if you have not received a response from us within 2 weeks your application was unsuccessful.
Candidates MUST be based in Gauteng or WILLING TO RELOCATE!
#isanqa #isanqaresourcing #SiteReliabilityEngineer #SRE #DevOps #AWS #Kubernetes #Docker #CloudComputing #InfrastructureAutomation #Monitoring #Grafana #Python #GoLang #TypeScript #CICD #GitOps #Microservices #TechLeadership #Agile #FuelledByPassionIntegrityExcellence
iSanqa is your trusted Level 2 BEE recruitment partner dedicated to continuous improvement in delivering exceptional service. Specializing in seamless placements for permanent staff temporary resources and efficient contract management and billing facilitation iSanqa Resourcing is powered by a team of professionals with an outstanding track record. With over 100 years of combined experience we are committed to evolving our practices to ensure ongoing excellence.
Full Time