Site Reliability Engineer III

Not Interested
Bookmark
Report This Job

profile Job Location:

London - UK

profile Monthly Salary: Not Disclosed
Posted on: 2 days ago
Vacancies: 1 Vacancy

Job Summary

This opportunity is part of the Global Technology Infrastructure & Operations team (GTIO) where our mission is to deliver modern and relevant technology that supports the way McDonalds works. We provide outstanding foundational technology products and services including Global Networking Cloud End User Computing and IT Service Management. Its our goal to always provide an engaging relevant and simple experience for our customers.

The Site Reliability Engineer (SRE) Edge Platform is a key member of the Edge Operations and SRE team within Global Technology Infrastructure & Operations. This role is responsible for ensuring the reliability scalability and operational excellence of the Edge computing platform that supports McDonalds global restaurant technology ecosystem.

You will work closely with Architecture Platform Engineering Security teams to implement observability automation and incident response strategies that ensure the Edge platform is resilient and maintainable. This is a unique opportunity to influence the operational maturity of a global platform and drive continuous improvement across infrastructure and services.

Responsibilities & Accountabilities:

  • Operate and maintain Edge platform infrastructure to ensure 24x7x365 availability reliability and performance.
  • Design and implement observability frameworks using tools such as Prometheus Grafana Jaeger and Datadog.
  • Collaborate with Platform Engineering and Edge Solution Delivery teams to ensure platform features are operable maintainable and supportable in production environments.
  • Develop and maintain runbooks playbooks and automation scripts to streamline operations and reduce manual effort.
  • Develop and maintain runbooks playbooks and automation scripts to streamline operations and reduce manual toil.
  • Lead incident response root cause analysis and post-incident reviews to drive continuous improvement.
  • Participate in capacity planning performance tuning and disaster recovery exercises.
  • Implement and manage CI/CD pipelines and Infrastructure-as-Code (IaC) for operational tooling and automation.
  • Architect and maintain self-healing and auto-scaling capabilities across Edge clusters.
  • Partner with security teams to ensure compliance with enterprise standards and implement secure operational practices.
  • Contribute to platform architecture discussions with a focus on operational readiness and supportability.
  • Stay current with industry trends in SRE edge computing and distributed systems.

Skills and experience required:

  • Experience in Site Reliability Engineering DevOps or Platform Operations.
  • Experience supporting Edge computing or hybrid cloud environments.
  • Strong expertise in observability tools (Prometheus Grafana Jaeger Datadog ELK).
  • Experience with container orchestration platforms (Kubernetes GKE) and virtualization technologies.
  • Proficiency in scripting and automation (Python Bash PowerShell).
  • Hands-on experience with CI/CD tools (GitHub Actions Jenkins ArgoCD) and IaC (Terraform).
  • Solid understanding of cloud platforms (GCP AWS) and distributed systems.
  • Strong problem-solving skills and ability to work in a fast-paced collaborative environment.
  • Excellent communication and documentation skills.
  • GCP or AWS certification preferred.
  • Experience with Agile methodologies is a plus.

Qualifications :

  • Bachelors degree in Computer Science Engineering or related field; or equivalent experience.

Additional Information :

At McDonalds we are People from all Walks of Life... 

People are at the heart of everything we do and they make the McDonalds experience. We embrace diversity and are committed to creating an inclusive culture that means people can be their best authentic self in our restaurants and offices which helps us to better serve our customers. We have a strong heritage of diversity and representation within our communities which we are proud of. The diversity of our people customers Franchisees and suppliers gives us strength.

We do not tolerate inequality injustice or discrimination of any kind.  These are hugely important issues and a brand with our reach and relevance means we have a very meaningful role to play.

We also recognise our responsibility as a large employer to continue being active in our communities helping to develop skills and drive aspirations that will help people to be more aware of the world of work and more successful within it whether with McDonalds or elsewhere.


Remote Work :

No


Employment Type :

Full-time

This opportunity is part of the Global Technology Infrastructure & Operations team (GTIO) where our mission is to deliver modern and relevant technology that supports the way McDonalds works. We provide outstanding foundational technology products and services including Global Networking Cloud End U...
View more view more

About Company

McDonald’s growth strategy, Accelerating the Arches, encompasses all aspects of our business as the leading global omni-channel restaurant brand. As the consumer landscape shifts we are using our competitive advantages to further strengthen our brand. One of our core growth strategies ... View more

View Profile View Profile