The Engineering chapter team is seeking a Site Reliability Engineering (SRE) service to guarantee the reliability scalability monitoring and performance of onpremises services within a productoriented environment. The role focuses on designing and implementing best practices improving infrastructure and collaborating with cross-functional teams to ensure stability observability and high availability.
Responsibilities
- Design and maintain monitoring infrastructure
- Create dashboards alerts and visualization solutions
- Implement distributed tracing and log aggregation systems
- Establish monitoring standards SLI/SLO frameworks and best practices
- Ensure security compliance for onprem monitoring tools
- Automate deployment and configuration processes
- Collaborate with development teams on instrumentation
- Participate in onduty rotations (24/7 incident support)
Qualifications :
Core Technologies
- Advanced skills with Grafana
- Prometheus PromQL
- OpenTelemetry
- Elasticsearch
Infrastructure
- Linux administration
- Networking
- Onpremises security
Programming (for automation)
Experience
- 3 years in monitoring/observability
- 2 years using Grafana & Prometheus in production
- Strong background in Linux system administration
- Demonstrated experience with onprem infrastructure
Security
- Knowledge of enterprise security practices
- Understanding of compliance requirements
Other Skills
- Ability to balance technical and business priorities
- Willingness to participate in 24/7 onduty rotations
Key Deliverables
- Reduced MTTD and MTTR through robust monitoring
- Full observability across all systems
- Automated monitoring deployment and management pipelines
- Securitycompliant monitoring framework
Additional Information :
Possess a work permit allowing the individual to work in Belgium.
Hold a valid residence permit confirming the right of residence in Belgium.
Remote Work :
No
Employment Type :
Full-time
The Engineering chapter team is seeking a Site Reliability Engineering (SRE) service to guarantee the reliability scalability monitoring and performance of onpremises services within a productoriented environment. The role focuses on designing and implementing best practices improving infrastructure...
The Engineering chapter team is seeking a Site Reliability Engineering (SRE) service to guarantee the reliability scalability monitoring and performance of onpremises services within a productoriented environment. The role focuses on designing and implementing best practices improving infrastructure and collaborating with cross-functional teams to ensure stability observability and high availability.
Responsibilities
- Design and maintain monitoring infrastructure
- Create dashboards alerts and visualization solutions
- Implement distributed tracing and log aggregation systems
- Establish monitoring standards SLI/SLO frameworks and best practices
- Ensure security compliance for onprem monitoring tools
- Automate deployment and configuration processes
- Collaborate with development teams on instrumentation
- Participate in onduty rotations (24/7 incident support)
Qualifications :
Core Technologies
- Advanced skills with Grafana
- Prometheus PromQL
- OpenTelemetry
- Elasticsearch
Infrastructure
- Linux administration
- Networking
- Onpremises security
Programming (for automation)
Experience
- 3 years in monitoring/observability
- 2 years using Grafana & Prometheus in production
- Strong background in Linux system administration
- Demonstrated experience with onprem infrastructure
Security
- Knowledge of enterprise security practices
- Understanding of compliance requirements
Other Skills
- Ability to balance technical and business priorities
- Willingness to participate in 24/7 onduty rotations
Key Deliverables
- Reduced MTTD and MTTR through robust monitoring
- Full observability across all systems
- Automated monitoring deployment and management pipelines
- Securitycompliant monitoring framework
Additional Information :
Possess a work permit allowing the individual to work in Belgium.
Hold a valid residence permit confirming the right of residence in Belgium.
Remote Work :
No
Employment Type :
Full-time
View more
View less