Site Reliability Engineering Lead – Spring

Experian

Not Interested
Bookmark
Report This Job

profile Job Location:

Nottingham - UK

profile Monthly Salary: Not Disclosed
Posted on: 4 hours ago
Vacancies: 1 Vacancy

Job Summary

We are looking for an enthusiastic SRE Lead to work in Project Spring at the forefront of our cloud modernisation within our Credit & Verification Services.

Background:

This is an incredibly exciting time for the Experian UKI Region as we look to build our presence in the UK and Hyderabad and work on a technology transformation to meet our aspiration to significantly scale our business over the next five years. This an opportunity to join Credit & Verification Services on this journey and be part of a collaborative team that uses Agile DevSecOps principles to deliver value.

Credit and Verification Services comprises nearly 100 engineering teams who deliver over 200 products achieving significant revenue per annum for our UK Business. We pride ourselves in excellence adopting best practices and holding ourselves to the highest standards.

The Domain:

As a member of the Project Spring team within Credit and Verification Services youll be part of a forward-thinking delivery group at the forefront of transforming how credit information is accessed in the UK. Were leading the charge in moving the Experian UK credit report ecosystem to the cloudmodernizing legacy systems and unlocking new possibilities for data-driven insights.

Role Context

Reporting into our Senior Director of Engineering and Delivery you will lead the reliability strategy for mission-critical systems and lead a team of engineers to ensure high availability scalability and performance. You will combine technical expertise with leadership skills to lead operational excellence and foster a culture of reliability across engineering teams.

Main Responsibilities:

  • Leadership & Strategy
    • Develop SRE best practices across the organization.
    • Expertise in production support resilience engineering disaster recovery (DCR) automation and cloud operations
    • Guide a team of SREs encouraging growth and technical excellence.
    • Collaborate with senior stakeholders to align reliability goals with our goals.
  • Reliability & Performance
    • Establish SLIs SLOs and SLAs for critical services and ensure adherence.
    • Lead programs to improve system resilience and reduce operational toil.
    • Excellent in designing systems that detect and improve issues without manual intervention Self Healing systems Runbook automation
    • Exposure to tools like Gremlin Chaos Monkey AWS FIS to simulate outages and improve fault tolerance
  • Incident Management
    • Be a primary point of escalation for critical production issues and lead major incident response root cause analysis and postmortems.
    • Perform detailed post-incident investigations to identify underlying causes. Document findings and share insights to prevent recurrence.
    • Implement preventive measures and continuous improvement processes.
  • Observability
    • Champion monitoring logging and alerting strategies using tools like Prometheus Grafana ELK and AWS CloudWatch.
    • Build real-time dashboards to visualize system health and reliability metrics.
    • Configure intelligent alerting based on anomaly detection and thresholds.
    • Combine metrics logs and trace to allow root cause analysis and reduce Mean Time to Resolution (MTTR).
    • Knowledge of AIOps or ML-based anomaly detection for proactive reliability management.

Qualifications :

Qualified with a degree Computer Science MCA in Computer Science Bachelor of Technology in Engineering or higher.

  • Hands-on technologist with a significant experience working in software development with experience leading an SRE team.
  • Deep expertise with multiple AWS services. Experience of monitoring and observability tools.
  • Experience working with geographically distributed teams promoting inclusive collaboration across diverse cultures and backgrounds.
  • Provide solutions and feedback clearly to both technical and non-technical stakeholders.
  • Adept at managing conflict constructively.
  • Proven track record of building secure mission-critical high-volume transaction web-based software systems in regulated environments (finance and insurance industries).

Additional Information :

  • Hybrid working
  • Great compensation package and discretionary bonus
  • Core benefits include pension bupa healthcare sharesave scheme and more
  • 25 days annual leave with 8 bank holidays and 3 volunteering days. You can purchase additional annual leave.

We take our people agenda very seriously and focus on what matters; DEI work/life balance development authenticity collaboration wellness reward & recognition volunteering... the list goes on. Experians people first approach is award-winning; Worlds Best Workplaces 2024 (Fortune Top 25) Great Place To Work in 24 countries and Glassdoor Best Places to Work 2024 to name a few. Check out Experian Life on social or our Careers Site to understand why.

Experian is proud to be an Equal Opportunity and Affirmative Action employer. Innovation is an important part of Experians DNA and practices and our diverse workforce drives our success. Everyone can succeed at Experian and bring their whole self to work irrespective of their gender ethnicity religion colour sexuality physical ability or age. If you have a disability or special need that requires accommodation please let us know at the earliest opportunity.

Experian Careers - Creating a better tomorrow together

Find out what its like to work for Experian by clicking here

#LI-Hybrid

This is a hybrid remote/in-office role.

Experian Careers - Creating a better tomorrow together

Find out what its like to work for Experian by clicking here


Remote Work :

No


Employment Type :

Full-time

We are looking for an enthusiastic SRE Lead to work in Project Spring at the forefront of our cloud modernisation within our Credit & Verification Services.Background:This is an incredibly exciting time for the Experian UKI Region as we look to build our presence in the UK and Hyderabad and work on ...
View more view more

Key Skills

  • Kubernetes
  • FMEA
  • Continuous Improvement
  • Elasticsearch
  • Go
  • Root cause Analysis
  • Maximo
  • CMMS
  • Maintenance
  • Mechanical Engineering
  • Manufacturing
  • Troubleshooting

About Company

Company Logo

Experian is a global data and technology company, powering opportunities for people and businesses around the world. We help to redefine lending practices, uncover and prevent fraud, simplify healthcare, create marketing solutions, and gain deeper insights into the automotive market, ... View more

View Profile View Profile