Job Title: Site Reliability Engineer (SRE)
Location: Lisbon Portugal (Hybrid)
Job Type: Contract (6 months)
Role Overview:
We are looking for an experienced Site Reliability Engineer (SRE) to support business-critical systems in the banking and financial services domain. The role has a strong focus on production support monitoring automation CI/CD pipelines and incident management ensuring system stability availability and operational excellence in a regulated environment.
Key Responsibilities:
Provide end-to-end production support for critical banking applications and platforms.
Monitor system performance availability and reliability using monitoring and observability tools.
Proactively identify issues perform troubleshooting and prevent service disruptions.
Manage incidents and major incidents including:
Incident triage and resolution
Root cause analysis (RCA)
Incident and post-incident reporting
Design implement and maintain CI/CD pipelines for reliable and automated deployments.
Automate infrastructure and operational tasks using Infrastructure as Code (IaC) principles.
Use Ansible for configuration management and automation.
Collaborate closely with development QA operations and security teams.
Ensure systems comply with banking security and regulatory standards.
Continuously improve monitoring alerting incident response and operational processes.
Required Skills & Experience:
57 years of experience in Site Reliability Engineering (SRE) DevOps or a related role.
Strong hands-on experience in production support for high-availability environments.
Proven experience with incident management and handling critical production issues.
Experience creating and maintaining incident reports and RCA documentation.
Solid knowledge of monitoring and observability tools (e.g. Prometheus Grafana ELK Splunk Datadog or similar).
Hands-on experience with CI/CD tools such as (Jenkins GitLab CI Azure DevOps or equivalent).
Strong expertise in Ansible for automation and configuration management.
Good knowledge of Linux systems and scripting (Bash Python or similar).
Experience working in or supporting banking or financial services systems (highly preferred).
Nice to Have:
Experience with cloud platforms (AWS Azure or GCP)
Knowledge of containerization and orchestration (Docker Kubernetes)
Familiarity with security compliance and risk controls in regulated environments
Experience supporting hybrid (on-premises and cloud) infrastructures.
Recruitment Partner: Sperton
This position is exclusively managed by Sperton a global talent partner connecting high-performing professionals with leading organizations worldwide.
Job Title: Site Reliability Engineer (SRE) Location: Lisbon Portugal (Hybrid)Job Type: Contract (6 months)Role Overview:We are looking for an experienced Site Reliability Engineer (SRE) to support business-critical systems in the banking and financial services domain. The role has a strong focus on...
Job Title: Site Reliability Engineer (SRE)
Location: Lisbon Portugal (Hybrid)
Job Type: Contract (6 months)
Role Overview:
We are looking for an experienced Site Reliability Engineer (SRE) to support business-critical systems in the banking and financial services domain. The role has a strong focus on production support monitoring automation CI/CD pipelines and incident management ensuring system stability availability and operational excellence in a regulated environment.
Key Responsibilities:
Provide end-to-end production support for critical banking applications and platforms.
Monitor system performance availability and reliability using monitoring and observability tools.
Proactively identify issues perform troubleshooting and prevent service disruptions.
Manage incidents and major incidents including:
Incident triage and resolution
Root cause analysis (RCA)
Incident and post-incident reporting
Design implement and maintain CI/CD pipelines for reliable and automated deployments.
Automate infrastructure and operational tasks using Infrastructure as Code (IaC) principles.
Use Ansible for configuration management and automation.
Collaborate closely with development QA operations and security teams.
Ensure systems comply with banking security and regulatory standards.
Continuously improve monitoring alerting incident response and operational processes.
Required Skills & Experience:
57 years of experience in Site Reliability Engineering (SRE) DevOps or a related role.
Strong hands-on experience in production support for high-availability environments.
Proven experience with incident management and handling critical production issues.
Experience creating and maintaining incident reports and RCA documentation.
Solid knowledge of monitoring and observability tools (e.g. Prometheus Grafana ELK Splunk Datadog or similar).
Hands-on experience with CI/CD tools such as (Jenkins GitLab CI Azure DevOps or equivalent).
Strong expertise in Ansible for automation and configuration management.
Good knowledge of Linux systems and scripting (Bash Python or similar).
Experience working in or supporting banking or financial services systems (highly preferred).
Nice to Have:
Experience with cloud platforms (AWS Azure or GCP)
Knowledge of containerization and orchestration (Docker Kubernetes)
Familiarity with security compliance and risk controls in regulated environments
Experience supporting hybrid (on-premises and cloud) infrastructures.
Recruitment Partner: Sperton
This position is exclusively managed by Sperton a global talent partner connecting high-performing professionals with leading organizations worldwide.
View more
View less