Principal SRE Cross-Cluster SRE Lead
Kuala Lumpur - Malaysia
Job Summary
ABOUT US
Were the worlds leading provider of secure financial messaging services headquartered in Belgium. We are the way the world moves value across borders through cities and overseas. No other organisation can address the scale precision pace and trust that this demands and were proud to support the global economy.
Were unique too. We were established to find a better way for the global financial community to move value a reliable safe and secure approach that the community can trust completely. Were always striving to be better and are constantly evolving in an ever-changing landscape without undermining that trust. Five decades on our vibrant community reflects the complexity and diversity of the financial ecosystem. We innovate diligently test exhaustively then implement a connected and exciting era our mission has never been more relevant. Swift now has a presence in 200 countries and legal territories to serve a community of more than 12000 banks and financial institutions.
Key Responsibilities
Cross-Cluster Standardization
- Define and enforce incident management practices
- Standardize alerting monitoring and request handling
- Align workflows across ServiceNow and Jira
- Ensure consistency across all clusters
Reliability Engineering
- Define SLO SLA MTTR MTRS standards
- Identify systemic reliability gaps across clusters
- Drive incident reduction and prevention strategies
- Establish reliability as a measurable discipline
Automation Strategy
- Identify cross-cluster automation opportunities
- Define reusable automation patterns and frameworks
- Eliminate duplicated operational solutions
- Drive reduction of manual toil
Architecture Alignment
- Partner with Solution Architects across clusters
- Ensure operability is built into system design
- Align monitoring alerting and failover strategies
- Prevent conflicting tooling or architectural decisions
Governance and Reviews
- Lead cross-cluster SRE reviews
- Track adoption of standards and practices
- Drive accountability across clusters
- Highlight systemic risks and gaps
Technical Leadership
- Guide SRE leads across clusters
- Raise technical standards within SRE
- Mentor engineers on reliability practices
- Influence engineering teams on operability
Minimum Requirements:
Experience
- 15 years in software engineering platform engineering or SRE
- Experience operating production systems at scale
- Experience across multiple systems or domains
Technical Depth
Strong in at least two areas:
- Distributed systems
- Observability and monitoring
- Infrastructure and cloud platforms
- Automation and software engineering
Capabilities
- Strong debugging and incident analysis skills
- Ability to design automation solutions
- Strong systems thinking across complex environments
- Ability to influence without direct authority
Success Indicators
- Standardized SRE practices across all clusters
- Reduced incident recurrence
- Improved MTTR and operational efficiency
- Increased automation coverage
- Reduced duplication across teams
What we offer
We give you the freedom to be yourself. We are creating an environment of unique individuals like you with different perspectives on the financial industry and the world. A diverse and inclusive environment in which everyones voice counts and where you can reach your full potential.
We are committed to an inclusive and accessible recruitment process. If you require a reasonable accommodation related to accessibility during your application or interview please contact or indicate this in your application.
Please note that this mailbox is not monitored for general recruitment enquiries and should only be used for accessibility or accommodation-related requests (for example related to vision hearing or neurodiversity).
All requests are confidential and will not affect your candidacy.
Dont meet every single requirement At Swift we are dedicated to building a workplace where people can bring their full selves and ideas to the team so if you are excited about this role we encourage you to apply even if you do not meet every single qualification.
Required Experience:
Staff IC
About Company
SWIFT is a global member-owned cooperative and the world’s leading provider of secure financial messaging services. We provide our community with a platform for messaging and standards for communicating, and we offer products and services to facilitate access and integration, identifi ... View more