Reliability Operations Engineer
Job Summary
Working at Infobip means being part of something truly global. With 75 offices across six continents were not just building technology were shaping how more than 80% of the world connects and communicates.
As employees we take pride in contributing to the worlds largest and only full-stack cloud communication platform. But its not just what we do its how we do it: with curiosity passion and a whole lot of collaboration.
We operate with an AI-first mindset embedding intelligent tools into our daily workflows to work smarter and more efficiently. Every role here benefits from and contributes to this approach.
If youre looking for meaningful work and challenges that grow you in a culture where people show up with purpose this is your opportunity.
Lets build whats next together.
What this role is all about
As a Platform / Site Reliability Engineer you will ensure the stability reliability and continuous improvement of our platform. You will play a key role in incident management monitoring automation and technical enhancements within your teams scope. This role combines operational excellence problem-solving and engineering ownership with opportunities to mentor others and contribute to long-term platform reliability.
What youll do
- Create respond to and continuously improve platform alerts and runbooks
- Actively monitor the platform identify issues and triage incidents
- Perform impact assessments and communicate incident summaries clearly
- Escalate incidents to the correct owner teams and act as Incident Leader when required
- Execute mitigation actions to minimize impact and restore service
- Write test secure and maintain well-documented scripts and automation
- Work autonomously on complex technical tasks and initiatives
- Solve challenging technical problems in collaboration with senior engineers
- Ensure stable and reliable service delivery within the teams scope
- Drive technical improvements and reliability enhancements
- Mentor and support other engineers within the team
- Provide and receive constructive feedback to continuously improve performance
What makes you a strong fit
- Experience in platform operations SRE DevOps or similar engineering roles
- Strong understanding of monitoring alerting and incident management processes
- Hands-on scripting and automation experience
- Solid troubleshooting and root cause analysis skills
- Ability to work independently on complex technical topics
- Clear and structured communication skills during incidents
- Proactive mindset focused on reliability and continuous improvement
- Comfortable collaborating with cross-functional teams
- Fluent English spoken and written
Diversity drives connection
Infobip is built on diverse backgrounds perspectives and talents. Were proud to be an equal-opportunity employer and are committed to fostering an inclusive workplace.
No matter your race gender age background or identity if you have the passion and skills to thrive theres a place for you here.
All qualified applicants will receive consideration for employment without regard to race color ancestry religion age sex sexual orientation gender gender identity national origin citizenship disability veteran status or any other part of ones identity.
Read more about our hiring process.
#LI-DM1Required Experience:
IC
About Company
Infobip is a global leader in omnichannel communication, helping brands to create meaningful relationships with their customers, at scale. Our communications platform is powering a broad range of solutions, messaging channels, and tools for advanced customer engagement, security, auth ... View more