Location: Kuala Lumpur Malaysia On-Site Start Date: June/July 2026 Shift Requirement: 24/7 On-Call Rotations Salary: Up to MYR 9000 (Preferred BPO & SRE Experience)
What is a Site Reliability Engineer (SRE)
A Site Reliability Engineer (SRE) combines software engineering and systems engineering to build maintain and optimize large-scale highly available and fault-tolerant systems. This role focuses on automation infrastructure reliability system scalability and operational excellence in a fast-paced BPO environment supporting infrastructure services and AML systems.
Key Responsibilities
Design build and maintain scalable highly available and fault-tolerant systems Collaborate with software engineering teams to improve reliability and system performance Develop automation procedures to reduce manual intervention and improve operational efficiency Monitor infrastructure performance and proactively identify system bottlenecks Implement and maintain monitoring tools automated alerts SLIs SLOs and SLAs Participate in 24/7 on-call rotations including scheduled shifts and holidays Conduct root-cause analysis and lead blameless post-mortems to prevent recurring issues Ensure systems comply with security standards and regulatory requirements
Requirements
Bachelors or Masters degree in Computer Science Information Technology Computer Engineering or a related field Minimum 3 years of experience as a Site Reliability Engineer Systems Engineer or Software Engineer Strong proficiency in at least one programming language such as Python Go Java or C Experience with shell scripting Linux operating systems and network architecture Strong understanding of relational databases and database modeling Experience with Docker and Kubernetes is highly preferred Familiarity with monitoring tools such as Prometheus and Grafana Exposure to machine learning frameworks such as TensorFlow PyTorch MXNet or PaddlePaddle is an advantage Excellent communication skills and ability to collaborate in cross-functional teams Able to work in a fast-paced environment with rotational on-call responsibilities
Preferred Candidate Profile
Experience working in BPO environments is highly preferred Strategic thinker with strong troubleshooting and analytical skills Passionate about automation reliability and infrastructure optimization
Whats in it for You
Competitive salary package up to MYR 9000 Opportunity to work on large-scale distributed systems and infrastructure Exposure to modern cloud-native technologies and automation tools Career growth opportunities within a global organization Collaborative and innovation-driven work environment Continuous learning and professional development opportunities
Location: Kuala Lumpur Malaysia On-SiteStart Date: June/July 2026Shift Requirement: 24/7 On-Call RotationsSalary: Up to MYR 9000 (Preferred BPO & SRE Experience) What is a Site Reliability Engineer (SRE) A Site Reliability Engineer (SRE) combines software engineering and systems engineering to buil...
Location: Kuala Lumpur Malaysia On-Site Start Date: June/July 2026 Shift Requirement: 24/7 On-Call Rotations Salary: Up to MYR 9000 (Preferred BPO & SRE Experience)
What is a Site Reliability Engineer (SRE)
A Site Reliability Engineer (SRE) combines software engineering and systems engineering to build maintain and optimize large-scale highly available and fault-tolerant systems. This role focuses on automation infrastructure reliability system scalability and operational excellence in a fast-paced BPO environment supporting infrastructure services and AML systems.
Key Responsibilities
Design build and maintain scalable highly available and fault-tolerant systems Collaborate with software engineering teams to improve reliability and system performance Develop automation procedures to reduce manual intervention and improve operational efficiency Monitor infrastructure performance and proactively identify system bottlenecks Implement and maintain monitoring tools automated alerts SLIs SLOs and SLAs Participate in 24/7 on-call rotations including scheduled shifts and holidays Conduct root-cause analysis and lead blameless post-mortems to prevent recurring issues Ensure systems comply with security standards and regulatory requirements
Requirements
Bachelors or Masters degree in Computer Science Information Technology Computer Engineering or a related field Minimum 3 years of experience as a Site Reliability Engineer Systems Engineer or Software Engineer Strong proficiency in at least one programming language such as Python Go Java or C Experience with shell scripting Linux operating systems and network architecture Strong understanding of relational databases and database modeling Experience with Docker and Kubernetes is highly preferred Familiarity with monitoring tools such as Prometheus and Grafana Exposure to machine learning frameworks such as TensorFlow PyTorch MXNet or PaddlePaddle is an advantage Excellent communication skills and ability to collaborate in cross-functional teams Able to work in a fast-paced environment with rotational on-call responsibilities
Preferred Candidate Profile
Experience working in BPO environments is highly preferred Strategic thinker with strong troubleshooting and analytical skills Passionate about automation reliability and infrastructure optimization
Whats in it for You
Competitive salary package up to MYR 9000 Opportunity to work on large-scale distributed systems and infrastructure Exposure to modern cloud-native technologies and automation tools Career growth opportunities within a global organization Collaborative and innovation-driven work environment Continuous learning and professional development opportunities