About Man Group
Man Group is a global alternative investment management firm focused on pursuing outperformance for sophisticated clients via our Systematic Discretionary and Solutions offerings. Powered by talent and advanced technology our single and multi-manager investment strategies are underpinned by deep research and span public and private markets across all major asset classes with a significant focus on alternatives. Man Group takes a partnership approach to working with clients establishing deep connections and creating tailored solutions to meet their investment goals and those of the millions of retirees and savers they represent.
Headquartered in London we manage $213.9 billion* and operate across multiple offices globally. Man Group plc is listed on the London Stock Exchange under the ticker and is a constituent of the FTSE 250 Index. Further information can be found at
* As at 30 September 2025
Purpose of the Role
Join our high-performing Site Reliability Engineering (SRE) team and play a pivotal role in ensuring the reliability scalability and performance of the technology powering Man Groups hedge funds. Youll have the autonomy tools and support to innovate and shape the future of our platform. This is an opportunity to work on cutting-edge projects gain mentorship from senior leaders and develop a deep understanding of both technology and the business.
As an SRE youll take ownership of service reliability and deliver solutions that make a real impact. Your initial focus will include leveraging AI to accelerate incident diagnosis and resolution improving observability capacity planning and automation. Over time youll work across our entire infrastructure stack operating at scale and driving continuous improvement.
Specific responsibilities
- Ensure reliability and performance of critical systems across global infrastructure through proactive monitoring and rapid incident response.
- Design and implement observability solutions using tools like Prometheus Grafana ELK and Loki to provide deep insights into system health.
- Automate operational tasks and build self-service capabilities to eliminate toil and improve efficiency.
- Develop and maintain SLIs SLOs and error budgets to guide reliability improvements and inform engineering priorities.
- Participate in incident response efforts blameless post-mortems and implement preventive measures to reduce recurrence.
- Collaborate with development teams to improve system design deployment practices and operational excellence.
- Operate at scale managing petabyte-level storage large CPU/GPU deployments and high-throughput distributed systems.
- Contribute to capacity planning and performance tuning ensuring systems meet business demands.
- Manage multiple ELK clusters hosting hundreds of terabytes of logs telemetry and APM data.
Key competencies
- Strong understanding of SRE principles including SLIs SLOs error budgets and reliability best practices.
- Hands-on experience with observability and monitoring tools (Prometheus Grafana ELK Loki or similar).
- Proficiency with automation tools (Ansible Terraform) and scripting/programming languages (Python Go PowerShell).
- Strong troubleshooting and debugging skills across distributed systems with the ability to diagnose complex production issues under pressure.
- Experience with incident management on-call rotations and post-incident reviews.
- Familiarity with Kubernetes and container orchestration.
- A proactive mindset and ability to take ownership of reliability initiatives.
Advantageous
- Experience with CI/CD pipelines and source control workflows (Git Jenkins TeamCity).
- Administration of Linux and Windows systems and exposure to cloud technologies (AWS/Azure).
- Understanding of networking concepts load balancing and distributed architectures.
- Knowledge of AI/LLM concepts (context windows prompt tuning MCP servers).
- Interest in FinOps principles desire to understand the true cost of our decisions.
- Excellent communication and collaboration skills.
Benefits
- ModernofficelocatedintheOfficeXcampuswitheasyaccesstotransportandamenities.
- Hybridworkingmodel
- Competitivecompensationpackage
- 25daysholidayallowance
- PremiumHealthinsurance
- EmployeeAssistanceprogram
- ReferralBonus
- Additionaldaysoffforlongserviceandvolunteering
- Multisportcard
- Opportunitiesforprofessionaldevelopmentincludinginternaltechtalks
- Conferenceattendanceandengagementwiththeopen-sourcecommunity.
Inclusion Work-Life Balance and Benefits at Man Group
Youll thrive in our working environment that champions equality of opportunity. Your unique perspective will contribute to our success joining a workplace where inclusion is fundamental and deeply embedded in our culture and values. Through our external and internal initiatives partnerships and programmes youll find opportunities to grow develop your talents and help foster an inclusive environment for all across our firm and industry. Learn more at have opportunities to make a difference through our charitable and global initiatives while advancing your career through professional development and with flexible working arrangements available too. Like all our people youll receive two annual Mankind days of paid leave for community volunteering.
Our comprehensive benefits package includes competitive holiday entitlements pension/401k life and long-term disability coverage group sick pay enhanced parental leave and long-service leave. Depending on your location you may also enjoy additional benefits such as private medical coverage discounted gym membership options and pet insurance.
Equal Employment Opportunity Policy
Man Group provides equal employment opportunities to all applicants and all employees without regard to race color creed national origin ancestry religion disability sex gender identity and expression marital status sexual orientation military or veteran status age or any other legally protected category or status in accordance with applicable federal state and local laws.
Man Group is a Disability Confident Committed employer; if you require help or information on reasonable adjustments as you apply for roles with us please contact.
About Man GroupMan Group is a global alternative investment management firm focused on pursuing outperformance for sophisticated clients via our Systematic Discretionary and Solutions offerings. Powered by talent and advanced technology our single and multi-manager investment strategies are underpin...
About Man Group
Man Group is a global alternative investment management firm focused on pursuing outperformance for sophisticated clients via our Systematic Discretionary and Solutions offerings. Powered by talent and advanced technology our single and multi-manager investment strategies are underpinned by deep research and span public and private markets across all major asset classes with a significant focus on alternatives. Man Group takes a partnership approach to working with clients establishing deep connections and creating tailored solutions to meet their investment goals and those of the millions of retirees and savers they represent.
Headquartered in London we manage $213.9 billion* and operate across multiple offices globally. Man Group plc is listed on the London Stock Exchange under the ticker and is a constituent of the FTSE 250 Index. Further information can be found at
* As at 30 September 2025
Purpose of the Role
Join our high-performing Site Reliability Engineering (SRE) team and play a pivotal role in ensuring the reliability scalability and performance of the technology powering Man Groups hedge funds. Youll have the autonomy tools and support to innovate and shape the future of our platform. This is an opportunity to work on cutting-edge projects gain mentorship from senior leaders and develop a deep understanding of both technology and the business.
As an SRE youll take ownership of service reliability and deliver solutions that make a real impact. Your initial focus will include leveraging AI to accelerate incident diagnosis and resolution improving observability capacity planning and automation. Over time youll work across our entire infrastructure stack operating at scale and driving continuous improvement.
Specific responsibilities
- Ensure reliability and performance of critical systems across global infrastructure through proactive monitoring and rapid incident response.
- Design and implement observability solutions using tools like Prometheus Grafana ELK and Loki to provide deep insights into system health.
- Automate operational tasks and build self-service capabilities to eliminate toil and improve efficiency.
- Develop and maintain SLIs SLOs and error budgets to guide reliability improvements and inform engineering priorities.
- Participate in incident response efforts blameless post-mortems and implement preventive measures to reduce recurrence.
- Collaborate with development teams to improve system design deployment practices and operational excellence.
- Operate at scale managing petabyte-level storage large CPU/GPU deployments and high-throughput distributed systems.
- Contribute to capacity planning and performance tuning ensuring systems meet business demands.
- Manage multiple ELK clusters hosting hundreds of terabytes of logs telemetry and APM data.
Key competencies
- Strong understanding of SRE principles including SLIs SLOs error budgets and reliability best practices.
- Hands-on experience with observability and monitoring tools (Prometheus Grafana ELK Loki or similar).
- Proficiency with automation tools (Ansible Terraform) and scripting/programming languages (Python Go PowerShell).
- Strong troubleshooting and debugging skills across distributed systems with the ability to diagnose complex production issues under pressure.
- Experience with incident management on-call rotations and post-incident reviews.
- Familiarity with Kubernetes and container orchestration.
- A proactive mindset and ability to take ownership of reliability initiatives.
Advantageous
- Experience with CI/CD pipelines and source control workflows (Git Jenkins TeamCity).
- Administration of Linux and Windows systems and exposure to cloud technologies (AWS/Azure).
- Understanding of networking concepts load balancing and distributed architectures.
- Knowledge of AI/LLM concepts (context windows prompt tuning MCP servers).
- Interest in FinOps principles desire to understand the true cost of our decisions.
- Excellent communication and collaboration skills.
Benefits
- ModernofficelocatedintheOfficeXcampuswitheasyaccesstotransportandamenities.
- Hybridworkingmodel
- Competitivecompensationpackage
- 25daysholidayallowance
- PremiumHealthinsurance
- EmployeeAssistanceprogram
- ReferralBonus
- Additionaldaysoffforlongserviceandvolunteering
- Multisportcard
- Opportunitiesforprofessionaldevelopmentincludinginternaltechtalks
- Conferenceattendanceandengagementwiththeopen-sourcecommunity.
Inclusion Work-Life Balance and Benefits at Man Group
Youll thrive in our working environment that champions equality of opportunity. Your unique perspective will contribute to our success joining a workplace where inclusion is fundamental and deeply embedded in our culture and values. Through our external and internal initiatives partnerships and programmes youll find opportunities to grow develop your talents and help foster an inclusive environment for all across our firm and industry. Learn more at have opportunities to make a difference through our charitable and global initiatives while advancing your career through professional development and with flexible working arrangements available too. Like all our people youll receive two annual Mankind days of paid leave for community volunteering.
Our comprehensive benefits package includes competitive holiday entitlements pension/401k life and long-term disability coverage group sick pay enhanced parental leave and long-service leave. Depending on your location you may also enjoy additional benefits such as private medical coverage discounted gym membership options and pet insurance.
Equal Employment Opportunity Policy
Man Group provides equal employment opportunities to all applicants and all employees without regard to race color creed national origin ancestry religion disability sex gender identity and expression marital status sexual orientation military or veteran status age or any other legally protected category or status in accordance with applicable federal state and local laws.
Man Group is a Disability Confident Committed employer; if you require help or information on reasonable adjustments as you apply for roles with us please contact.
View more
View less