Site Reliability Engineer (SRE) AI Platforms
Job Summary
If youre looking for a career that will help you stand out join HSBC and fulfil your potential - whether you want a career that could take you to the top or an exciting new direction we offer opportunities support and rewards that will take you further.
Were one of the largest banking and financial services organisations in the world with a network that covers more than 50 countries and territories. We aim to be where the growth is enabling businesses to thrive and economies to prosper and ultimately helping people fulfil their hopes and realise their ambitions.
We are seeking a Site Reliability Engineer (SRE) AI Platforms
In this fantastic role youll support reliability engineering for the global AI platform powering enterprise-scale adoption. Youll operate and improve cloud-native production services (Kubernetes CI/CD automation observability) in a team building shared AI foundations where engineering rigour responsible controls and scale actually matter.
As an HSBC employee in the UK youll have access to tailored professional development opportunities and a competitive pay and benefits package. This includes private healthcare for all UK-based employees enhanced maternity and adoption pay and support when you return to work and a contributory pension scheme with a generous employer contribution.
In this role you will:
- Run and support production services to meet availability reliability and scalability targets
- Implement core SRE practices: monitoring/alerting SLIs/SLOs and incident response
- Operate container platforms by deploying and managing workloads using Docker and Kubernetes (incl. tooling like Helm/Istio as applicable)
- Improve delivery reliability and speed by building/enhancing CI/CD pipelines.
- Automate operational work using scripting/coding (e.g. Python Bash or Go) and drive continuous improvement via post-incident reviews and documentation
To be successful in this role you should have the following skills:
- Extensive experience in SRE DevOps or production support
- Strong hands-on capability with Docker and Kubernetes (and related ecosystem such as Helm/Istio)
- Experience with a major cloud platform: AWS Azure or GCP
- Proven problem-solving/analytical skills for diagnosing complex production issues end-to-end
- An AI-native mindset (using AI-driven approaches such as coding assistants to improve productivity quality and engineering practices)
Being open to different points of view is important for our business and the communities we serve. At HSBC were dedicated to creating diverse and inclusive workplaces - no matter their gender ethnicity disability religion sexual orientation socio-economic background or age. We are committed to removing barriers and ensuring careers at HSBC are inclusive and accessible for everyone to be at their best. We take pride in being a Disability Confident Leader and will offer an interview to people with disabilities long term conditions or neurodivergent candidates who meet the minimum criteria for the role.
If you have a need that requires accommodations or changes during the recruitment process please get in touch with our Recruitment Helpdesk via .
Required Experience:
IC
About Company
HSBC Holdings plc is a British multinational investment bank and financial services holding company. It was the 7th largest bank in the world by 2018, and the largest in Europe, with total assets of US$2.558 trillion.