Principal DevOps Engineer
Job Summary
About the role
We are looking for a Principal DevOps Engineer to own and evolve the core infrastructure that underpins our cloudnative AIenabled SaaS platforms.
This role is about building platforms that scale not firefighting. You will design operate and continuously improve a secure highly available Kubernetesbased platform that enables product engineering teams to deploy operate and evolve services safely and independently.
You will work closely with software engineers product teams and security stakeholders to embed bestinclass DevOps and platform engineering practices across the organisation.
Mission
- Build and operate secure scalable highly available cloud infrastructure
- Enable product teams through automation selfservice and clear standards
- Raise the bar on reliability security observability and deployment quality
- Act as a technical leader across platform and infrastructure initiatives
What success looks like
You will be accountable for outcomes such as:
Highly available faulttolerant platforms
- All containerised services are deployed with appropriate replication resilience and resource limits
- Workloads are designed for multizone availability and safe failure modes
Zerodowntime highquality delivery
- CI/CD pipelines support safe deployment patterns (e.g. rolling canary fast rollback)
- Deploymentrelated incidents are eliminated or rapidly mitigated
Empowered engineering teams
- Engineers can diagnose and resolve the majority of platformrelated issues independently
- Clear standards tooling and automation reduce cognitive load and friction
Strong security posture
- Infrastructure and workloads follow securitybydefault principles
- Vulnerabilities are proactively identified prioritised and remediated
- Platform security tooling is continuously maintained and improved
Comprehensive observability
- All critical services are monitored with meaningful alerts and dashboards
- Teams have access to selfservice monitoring and alerting capabilities
Key responsibilities
- Design build and operate cloud infrastructure using Infrastructure as Code
- Own and evolve Kubernetes platforms including workload standards and deployment models
- Develop and maintain CI/CD pipelines and GitOps workflows
- Embed security best practices across infrastructure pipelines and runtime environments
- Improve platform reliability monitoring and incident response workflows
- Act as a technical leader and mentor for engineers using the platform
- Partner with product and engineering teams to anticipate future platform needs
Why join us
- Own and shape a modern platform engineering capability
- Work on real production systems supporting AIenabled SaaS products
- High trust high autonomy engineering culture
- Opportunity to influence platform strategy as the organisation scales
Qualifications :
Essential skills and experience
- Proven experience building and operating cloudnative platforms at scale
- Strong handson experience with:
- Kubernetes & containerised workloads
- Infrastructure as Code (e.g. Terraform)
- CI/CD pipelines and GitOpsstyle delivery
- Deep understanding of:
- High availability fault tolerance and scaling strategies
- Secure infrastructure design and operational security practices
- Experience running production platforms on public cloud (GCP preferred; AWS acceptable)
- Strong troubleshooting skills across distributed systems
- Ability to explain complex technical concepts to nonspecialist audiences
- Exposure to AI/ML or LLMbased workloads in production environments
Technologies youll work with
- Google Cloud Platform (GCP)
- Kubernetes (GKE) Docker
- Terraform
- Gitbased CI/CD pipelines
- GitOps tooling (e.g. Argo CD)
- Observability tooling (metrics logging alerting)
- Modern AIenabled workloads and services
Nice to have
- Experience with service mesh technologies (e.g. Istio)
- Experience with Kubernetes Gateway API or modern ingress patterns
- Familiarity with Redis PostgreSQL or managed cloud data services
Additional Information :
We embrace flexibility and hybrid work opportunities to support diverse needs and lifestyles while also valuing inclusive workplace experiences. By fostering a sense of community we drive innovation strengthen connections and nurture belonging. Our commitment ensures you can work in a way that suits you best while also engaging with colleagues to share ideas and build meaningful relationships.
Remote Work :
No
Employment Type :
Full-time
About Company
We are growing! At IFS we are constantly growing to deliver award-winning solutions to hundreds of partners and thousands of customers worldwide! We help companies who want to be their best when it matters most at their #momentofservice. Visit https://ifs.link/IzM0px to find out mo ... View more