Our client is looking for a Senior Reliability Engineer to join the team responsible for vehicle stock transparency and administration.
This product group provides the critical workflow engine that orchestrates process flows ensuring total visibility of vehicles until they are sold.
If you are passionate about system stability automated orchestration and driving operational excellence within a global IT landscape this role is for you.
6-8 Years related experience
Strong Java Angular and PostgreSQL Stack
Focus on High Availability & System Reliability
Position Details:
- Contract Start Date:
- Contract End Date:
- Location: Midrand/Menlyn/Rosslyn/Home Office Rotation
The Mission: You will be responsible for ensuring the reliability and availability of global production systems.
Your role involves building sophisticated monitoring dashboards managing incident resolutions within strict SLAs and driving technical lifecycle security measures.
You will collaborate closely with feature teams to automate process flows and maintain high-quality IT Service Continuity.
Qualifications & Experience:
- Education: Degree in IT or equivalent experience.
- Experience: 8 to 10 years of proven experience in IT Service Management DevOps Cloud infrastructure or similar roles.
Essential Skills (Verified):
- Deep ITSM & IT Operations knowledge
- Mastery of PIC (Problem Incident Change) processes
- Monitoring alerting and operational tooling expertise
- DevOps experience
- Operational KPI analysis and improvement measures
- Strong interpersonal and communication skills
- Cloud platforms (AWS preferred)
- Database SQL (MongoDB preferred)
- Code Versioning & Deployments (GitHub)
- Building and maintaining CI/CD pipelines and automation for build test and deployment processes
- Agile development methodologies
- Confluence / Jira
Advantageous Skills:
- Containerization technologies (Docker Kubernetes)
- Familiarity with RESTful APIs and certificate management as part of platform operations
- Apache Kafka
- Kibana
- Grafana
- Terraform
- Backup and continuity solutions
Key Responsibilities:
- Operate and monitor IT products
- Execute Incident Problem and Change Management processes
- Lead root cause analysis and postmortems
- Drive continuous improvement of operational quality and implement measures to prevent recurring incidents
- Perform system monitoring and remediation measures
- Manage release and deployment activities
- Develop maintain and optimise automated build test and deployment pipelines (CI/CD)
- Manage and optimise Cloud Infrastructure
- Maintain ISO-aligned system documentation
- Identify and implement technical lifecycle and security measures
- Manage Technical and IT Service Continuity Management activities
- Build monitoring dashboards and scripts to analyse systems reliability and availability; development relevant metrics
- Drive KPI compliance across IT Operations and IT Security
- Collaborate with stakeholders and feature teams
Important Application Details Location & Relocation Applicants based outside of Gauteng must be willing to relocate. Please note that relocation to the province will be at the candidates own cost.
Eligibility & Legal
- Citizenship: South African citizens and residents are preferred.
- Work Permits: Candidates with valid work permits will be considered.
- Privacy: By applying you consent to being added to our database and receiving updates until you unsubscribe.
iSanqa is your trusted Level 2 BEE recruitment partner dedicated to continuous improvement in delivering exceptional service. Specializing in seamless placements for permanent staff temporary resources and efficient contract management and billing facilitation iSanqa Resourcing is powered by a team of professionals with an outstanding track record. With over 100 years of combined experience we are committed to evolving our practices to ensure ongoing excellence.
Our client is looking for a Senior Reliability Engineer to join the team responsible for vehicle stock transparency and administration. This product group provides the critical workflow engine that orchestrates process flows ensuring total visibility of vehicles until they are sold. If you are pas...
Our client is looking for a Senior Reliability Engineer to join the team responsible for vehicle stock transparency and administration.
This product group provides the critical workflow engine that orchestrates process flows ensuring total visibility of vehicles until they are sold.
If you are passionate about system stability automated orchestration and driving operational excellence within a global IT landscape this role is for you.
6-8 Years related experience
Strong Java Angular and PostgreSQL Stack
Focus on High Availability & System Reliability
Position Details:
- Contract Start Date:
- Contract End Date:
- Location: Midrand/Menlyn/Rosslyn/Home Office Rotation
The Mission: You will be responsible for ensuring the reliability and availability of global production systems.
Your role involves building sophisticated monitoring dashboards managing incident resolutions within strict SLAs and driving technical lifecycle security measures.
You will collaborate closely with feature teams to automate process flows and maintain high-quality IT Service Continuity.
Qualifications & Experience:
- Education: Degree in IT or equivalent experience.
- Experience: 8 to 10 years of proven experience in IT Service Management DevOps Cloud infrastructure or similar roles.
Essential Skills (Verified):
- Deep ITSM & IT Operations knowledge
- Mastery of PIC (Problem Incident Change) processes
- Monitoring alerting and operational tooling expertise
- DevOps experience
- Operational KPI analysis and improvement measures
- Strong interpersonal and communication skills
- Cloud platforms (AWS preferred)
- Database SQL (MongoDB preferred)
- Code Versioning & Deployments (GitHub)
- Building and maintaining CI/CD pipelines and automation for build test and deployment processes
- Agile development methodologies
- Confluence / Jira
Advantageous Skills:
- Containerization technologies (Docker Kubernetes)
- Familiarity with RESTful APIs and certificate management as part of platform operations
- Apache Kafka
- Kibana
- Grafana
- Terraform
- Backup and continuity solutions
Key Responsibilities:
- Operate and monitor IT products
- Execute Incident Problem and Change Management processes
- Lead root cause analysis and postmortems
- Drive continuous improvement of operational quality and implement measures to prevent recurring incidents
- Perform system monitoring and remediation measures
- Manage release and deployment activities
- Develop maintain and optimise automated build test and deployment pipelines (CI/CD)
- Manage and optimise Cloud Infrastructure
- Maintain ISO-aligned system documentation
- Identify and implement technical lifecycle and security measures
- Manage Technical and IT Service Continuity Management activities
- Build monitoring dashboards and scripts to analyse systems reliability and availability; development relevant metrics
- Drive KPI compliance across IT Operations and IT Security
- Collaborate with stakeholders and feature teams
Important Application Details Location & Relocation Applicants based outside of Gauteng must be willing to relocate. Please note that relocation to the province will be at the candidates own cost.
Eligibility & Legal
- Citizenship: South African citizens and residents are preferred.
- Work Permits: Candidates with valid work permits will be considered.
- Privacy: By applying you consent to being added to our database and receiving updates until you unsubscribe.
iSanqa is your trusted Level 2 BEE recruitment partner dedicated to continuous improvement in delivering exceptional service. Specializing in seamless placements for permanent staff temporary resources and efficient contract management and billing facilitation iSanqa Resourcing is powered by a team of professionals with an outstanding track record. With over 100 years of combined experience we are committed to evolving our practices to ensure ongoing excellence.
View more
View less