Senior Site Reliability Engineer (SRE)

Leap29

Posted on : 24-05-2025

Employer Active

1 Vacancy

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Send me jobs like this

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Job Location

Wokingham - UK

Monthly Salary

Not Disclosed

Salary Not Disclosed

Vacancy

1 Vacancy

Posted on : 24-05-2025

Job Description

Senior Site Reliability Engineer (SRE)

Location: Wokingham (2 days a week onsite)
Type: Inside IR35
Rate: 80.00 an hour DOE

Were seeking a Senior Site Reliability Engineer to play a key role in the stability scalability and performance of critical platforms and applications. This is a leadership-level position suited to individuals who can move seamlessly between code infrastructure incident response and mentoring engineering teams.

Youll work across systems tools and teams to ensure platform reliability and enable continuous improvement in how software is built released and operated.

What Youll Be Responsible For

As a Senior SRE youll lead initiatives that:

Ensure availability latency and performance of mission-critical systems across cloud and hybrid environments.
Architect observability solutions (monitoring logging alerting) that detect and prevent failures before they impact users.
Own and improve incident response workflows including runbooks communications and root cause analysis.
Define and enforce SLIs SLOs and error budgets to balance innovation with operational stability.
Mentor engineers and advise teams on best practices for scalability security deployment and incident readiness.
Automate repetitive work via infrastructure-as-code CI/CD pipelines scripts and custom tooling.
Support and lead platform engineering efforts reliability reviews and cross-functional reliability programs.

Core Responsibilities

Operations Leadership

Act as a senior escalation point for major incidents and production outages.
Lead post-incident reviews coordinate root cause analysis and drive remediation plans.
Communicate platform health risk and improvement plans with technical and non-technical stakeholders.

Engineering & Automation

Design and build robust CI/CD workflows using tools such as Azure DevOps GitHub Actions Jenkins or GitLab.
Lead the design and delivery of resilient scalable infrastructure using IaC (Terraform Bicep etc.).
Develop automation and observability tooling that enables fast feedback loops and minimal manual intervention.

Strategic & Advisory

Define infrastructure architecture to support fault-tolerant applications.
Collaborate with developers architects and product teams to embed reliability into the software lifecycle.
Support implementation of secure scalable deployment patterns (e.g. blue-green canary releases rollback strategies).
Influence reliability culture and DevOps maturity across teams.

Technical Environment

The ideal candidate brings hands-on experience in many of the following areas:

Cloud & Infrastructure: Azure AWS OpenShift Kubernetes Docker App Services IaaS (e.g. EC2 VMs)
Observability: Datadog Prometheus Grafana Splunk ELK Application Insights CloudWatch
Automation & CI/CD: Terraform Bicep Azure DevOps Jenkins GitLab GitHub Actions
Languages & Scripting: Python C# Bash PowerShell
Networking: DNS SSL/TLS load balancing WAF proxies CDN Azure Application Gateway
Databases: MSSQL PostgreSQL MongoDB CosmosDB DynamoDB
OS & Systems: Windows and Linux internals Nginx IIS

Ideal Candidate Profile

Extensive experience (typically 5 years) in Site Reliability Engineering DevOps or Production Engineering roles.
A solid software engineering background with the ability to read write and review production-quality code.
Proven ability to lead incident response influence reliability culture and design for resilience.
Experience operating complex systems in fast-paced high-availability environments.
Strong collaborator who can work across development infrastructure and security disciplines.
Passion for solving operational problems through automation not repetition.

What Youll Bring

Ability to lead technical decisions while balancing risk and velocity.
Strong communication skills across technical and non-technical stakeholders.
A mindset of continuous improvement ownership and mentorship.
Commitment to eliminating toil improving developer experience and delivering reliable platforms at scale.

Ready to make reliability your legacy
Wed love to hear from experienced SREs who can bring stability to change and clarity to complexity.

Required Experience:

Senior IC

Employment Type

Full-Time

Company Industry

Key Skills

Apply Now

About Company

Leap29

Report This Job

Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.

Start Now

Dr.Job AutoApply

3X your job search with AutoApply's AI for faster dream job results.

Senior Site Reliability Engineer (SRE)

Leap29

Job Description

Job Description

Employment Type

Company Industry

Key Skills

About Company

Similar Jobs

Site Design Staff Engineer

On-Site Service Engineer

Site Manager

Senior Systems Engineer

Senior Process Development Engineer

Marketing Manager I (On-site)

Senior Customer Services Technical Specialist ( Senior Support Software Engineer) - Hybrid R0050712

Engineer - Test Engineer II