Infrastructure Engineer Monitoring
We are looking for a highly skilled infrastructure engineer with proficient skills with open-source monitoring solutions like Zabbix or Nagios. The ideal candidate will have experience in designing implementing and managing enterprise monitoring solutions taking ownership of monitoring strategies enhance system reliability and delivering value by implementing best practises.
In this role you will collaborate with cross-functional teams to ensure seamless monitoring of infrastructure applications and services leveraging Zabbix as a core tool alongside other monitoring solutions.
This role is required to work during EMEA hours.
Key Responsibilities
- Configure deploy and maintain Zabbix servers proxies agents & databases.
- Develop & optimize templates triggers alerts and custom monitoring solutions.
- Perform upgrades migrations and scaling of Zabbix Infrastructure.
- Design robust monitoring and best practise frameworks to meet operational and business requirements.
- Integrate Zabbix with other monitoring tools and platforms for comprehensive monitoring (e.g. ELK or Grafana).
- Experience developing business service monitoring / data visualization / availability reporting.
- Develop custom scripts plugins or modules to extend Zabbix capabilities.
- Automate monitoring processes using API scripting languages (e.g. Python Bash) and infrastructure-as-code tools.
- Define and implement proactive alerting mechanisms to ensure early detection of issues.
- Collaboration with various team within Group Technology to diagnose and resolve incidents quickly.
- Continuously analyse and improve the performance of Zabbix systems and monitored infrastructure.
- Conduct capacity planning and ensure scalability of monitoring solutions.
- Create thorough documentation for Zabbix configurations monitoring procedures and troubleshooting guidelines.
- Train team members on using Zabbix effectively and interpreting monitoring data.
Key Competencies:
- Proficiency with Zabbix API and integrations with third-party tools.
- Experience creating / optimizing templates triggers and custom monitoring items.
- Strong understanding of open-source systems and network protocols.
- Experience with cloud platforms / containerization (Docker / Kubernetes)
- Strong analytical and problem-solving skills.
- Ability to work independently and as part of a team.
Qualifications :
Desirable Qualifications:
- Familiarity with other monitoring tools (e.g. Grafana / ELK stack).
- Experience with open-source database clustering performance monitoring and optimization.
- Certifications related to Zabbix cloud platforms or monitoring tools are desirable.
- Experience with large-scale distributed monitoring environments.
- ITIL Foundation.
Remote Work :
No
Employment Type :
Full-time