Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via emailNot Disclosed
Salary Not Disclosed
1 Vacancy
We are seeking a Site Reliability Engineer (SRE) to help scale and secure mission-critical platforms for a leading financial institution in Amsterdam. As part of a cross-functional engineering team youll focus on observability reliability incident response and operational excellence across distributed systems.
This role demands both engineering skill and operational discipline. Previous experience in a banking or regulated enterprise environment is mandatory.
Responsibilities:
Build and improve monitoring logging and alerting for high-availability systems
Support production reliability across Kubernetes cloud and on-prem environments
Define SLOs SLIs and error budgets in collaboration with development teams
Lead root cause analysis and incident response processes
Automate operational tasks and drive reliability through infrastructure-as-code
Contribute to playbooks runbooks and operational readiness reviews
Requirements:
35 years in an SRE DevOps or Platform Engineering role
Strong skills in observability tooling (Prometheus Grafana ELK Splunk etc.)
Experience with incident management and post-mortem analysis
Proficient with Kubernetes and infrastructure automation (Terraform Helm)
Solid scripting (Bash Python Go)
Minimum 2 years in a banking or highly regulated enterprise environment
Comfortable working with InfoSec Compliance and Risk teams
What we offer:
Full-time position with long-term (12 month) scope
Mission-critical role with visibility across engineering and operations
Competitive salary and secondary benefits
Hybrid work setup (23 days onsite in Amsterdam)
Budget for tooling training and certifications
Note: Immediate availability or short notice (2 weeks) required. Banking experience is a strict must.
Full-Time