Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailDesign and Architect SRE element into all the existing and new apps and services along with defining several controls/processes that ensures SLAs/KPIs are met.
Define SLAs/SLIs/SLOs metrics at a technical level and ensure 100% adherence.
Proactively maintain services once they are live by measuring and monitoring availability latency and overall system health.
Respond quickly to issues and mobilise responsible individuals quickly to achieve the fasted possible resolution.
Support services before they go live through activities such as system design consulting developing software platforms and frameworks capacity planning and launch reviews
Scale system and service sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and speed of service resolution.
Continually analyse service to end customers with a view to enhancing customer experience eradicating issues fixing root causes and driving quality into everything we do.
Educating support operations and customer help desks to adapt to new ways of working by increasing skills and knowledge.
Perform RCAs publish reports and take it to the next level by inventing short/long term fixes and further Runbooks.
Be part of the Agile Mode of delivering Work Products by performing Backlog planning Sprint Planning Design Reviews Peer Reviews and Retrospectives
Experience in one or more of the following: C C Java Python Go Ruby or shell scripting
Experience with Windows and Unix/Linux operating systems internals and administration (e.g. filesystems system calls) or networking (e.g. TCP/IP routing network topologies and hardware)
Experience with containers and containers orchestration (e.g. Kubernetes Docker) Extensive knowledge of AWS
Hands-on experience with IAC tools such as Cloudformation and Terraform
Experience with Configuration Management tools such as Ansible Chef.
Experience with cloud hosted application-monitoring tools such as Kibana ELK stack etc
Experience with Observability tools such as Dynatrace or Datadog
Excellent communication skills with the ability to present complex technical information in a clear and concise manner to a variety of audiences both technical and non-technical
Comfortable working in a fast-paced multi-tasking dynamic environment
Experience with deployment automation working with platforms for configuration management provisioning and artifact repositories.
Preferred to have expertise with Make Maven Groovy Gitlab Gitlab pipelines ArgoCD AWS Codebuild/Codepipeline/CodeDeploy
Experience in improving internal processes and good understanding of security engineering
Capable of grasping modifying and maintaining systems and code developed by others.
Ability to debug and optimise code and automate routine tasks
Systematic problem-solving approach coupled with a strong sense of ownership drive and determination.
Ability to think outside the box and find innovative solutions to complex problems.
Full Time