Head of Production Engineering & Site Reliability Engineering (SRE)

SS&C

Posted on : 04-06-2025

Employer Active

1 Vacancy

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Send me jobs like this

Job Alert

You will be updated with latest job alerts via email

Valid email field required

Send jobs

Job Location

London - UK

Monthly Salary

Not Disclosed

Salary Not Disclosed

Vacancy

1 Vacancy

Posted on : 04-06-2025

Job Description

As a leading financial services and healthcare technology company based on revenue SS&C is headquartered in Windsor Connecticut and has 27000 employees in 35 countries. Some 20000 financial services and healthcare organizations from the worlds largest companies to small and mid-market firms rely on SS&C for expertise scale and technology.

Job Description

About SS&C Technologies

SS&C is a global provider of investment and financial software-enabled services and software for the global financial services and healthcare industries. The GIDS product suite powers mission-critical investor and distributor services across asset managers insurance companies retirement providers and wealth management platforms.

Job Overview

As the Head of Production Engineering and Site Reliability Engineering (SRE) for the GIDS organisation you will lead a team responsible for the scalability resilience performance and reliability of cloud and hybrid infrastructure powering some of the most critical client-facing applications in financial services.

You will be the strategic and operational leader for platform reliability observability incident response CI/CD modernisation and developer productivity.

Why Join SS&C GIDS

Lead mission-critical infrastructure for a globally recognised financial technology provider.
Influence the technical direction of a high-impact product suite.
Build a modern engineering organisation with a strong culture of innovation ownership and reliability.

Key Responsibilities

Leadership & Strategy

Define and execute the vision and roadmap for Production Engineering and SRE within GIDS.
Build and lead globally distributed high-performance teams with a focus on talent development SRE culture and operational excellence.
Collaborate cross-functionally with Engineering Product Compliance and Infrastructure teams to improve system reliability and efficiency.

Production Operations & Incident Management

Own reliability uptime and performance KPIs for GIDS applications and services.
Implement a comprehensive incident management lifecycle (on-call escalation RCA blameless postmortems).
Reduce Mean Time to Detect (MTTD) and Mean Time to Resolve (MTTR) through automated observability alerting and playbooks.

CI/CD and Platform Engineering

Oversee the development and evolution of CI/CD pipelines for all GIDS products using GitHub Actions ArgoCD TeamCity Octopus Deploy and GitOps principles.
Integrate static and dynamic code analysis vulnerability scanning artifact promotion and release gating into the SDLC.
Ensure pipeline scalability and governance while maintaining developer velocity.

Observability & Troubleshooting

Lead the implementation and usage of modern observability stacks (e.g. OpenTelemetry Prometheus Grafana Splunk Datadog).
Establish SLOs SLIs and error budgets with product and engineering teams.
Drive root cause identification using distributed tracing advanced log analysis and anomaly detection.

Security Audit & Compliance

Partner with security and compliance teams to embed controls into infrastructure and software delivery.
Automate audit evidence collection change tracking and access management (e.g. HashiCorp Vault OPA AWS IAM).
Ensure all systems meet internal and regulatory audit requirements (SOC2 GDPR etc.).

Infrastructure & Automation

Champion infrastructure-as-code (IaC) using Terraform Helm and Kubernetes for scalable cloud and hybrid deployments.
Optimise infrastructure cost elasticity and resilience through autoscaling canary deployments and chaos testing.
Maintain high SLAs for critical services running on Kubernetes AWS and on-prem hybrid infrastructure.

Talent Management & Culture

Attract retain and mentor top engineering talent with a strong focus on diversity and continuous learning.
Cultivate a culture of ownership transparency blameless accountability and operational excellence.
Drive career development through structured learning paths performance reviews and skills-based mentoring.

Talent Management & Global Operations

Build and scale a globally distributed 24/7 operations team ensuring consistent coverage and operational resilience.
Establish and enforce engineering and operational standards for deployments monitoring and incident response across geographies.
Implement and continuously refine a multi-tiered support structure (L1 L2 L3) with clear escalation paths and accountability.
Drive hiring onboarding and training initiatives that support both site reliability and continuous delivery.
Foster a strong engineering culture rooted in transparency autonomy learning and operational excellence.
Develop strategies to prevent burnout in around-the-clock operations including tooling automation and shift rotation planning.

Qualifications

Required:

10 years of experience in engineering with 5 years in a leadership role in SRE DevOps or Production Engineering.
Proven track record managing reliable scalable systems in a high-compliance environment (e.g. FinTech HealthTech).
Strong understanding of modern software development lifecycle CI/CD IaC and cloud-native technologies.
Expertise in Kubernetes AWS (or Azure/GCP) GitOps workflows observability tools and automation frameworks.
Excellent leadership communication and stakeholder management skills.
Certifications: AWS Certified Solutions Architect CKA/CKAD or relevant DevOps/SRE certs.
Familiarity with ISO/SOC2/GDPR compliance frameworks and evidence collection automation.

Unless explicitly requested or approached by SS&C Technologies Inc. or any of its affiliated companies the company will not accept unsolicited resumes from headhunters recruitment agencies or fee-based recruitment services.

SS&C Technologies is an Equal Employment Opportunity employer and does not discriminate against any applicant for employment or employee on the basis of race color religious creed gender age marital status sexual orientation national origin disability veteran status or any other classification protected by applicable discrimination laws.

Required Experience:

Director

Employment Type

Full-Time

Company Industry

Key Skills

Apply Now

About Company

SS&C

Report This Job

Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.

Start Now

Dr.Job AutoApply

3X your job search with AutoApply's AI for faster dream job results.

Head of Production Engineering & Site Reliability Engineering (SRE)

SS&C

Job Description

About SS&C Technologies

Job Overview

Why Join SS&C GIDS

Key Responsibilities

Leadership & Strategy

Production Operations & Incident Management

CI/CD and Platform Engineering

Observability & Troubleshooting

Security Audit & Compliance

Infrastructure & Automation

Talent Management & Culture

Talent Management & Global Operations

Qualifications

Required:

Employment Type

Company Industry

Key Skills

About Company

Similar Jobs

Senior Production Scheduler

Director of Mechanical Engineering

Structural Engineering Manager

Head of Finance

Head of Customer Experience

Site Acquisition Manager

Civil Engineer Site Design

Civil Engineer Site Design