Principal DevOps Engineer

IFS

Not Interested
Bookmark
Report This Job

profile Job Location:

Colombo - Sri Lanka

profile Monthly Salary: Not Disclosed
Posted on: 4 days ago
Vacancies: 1 Vacancy

Job Summary

About the role

We are looking for a Principal DevOps Engineer to own and evolve the core infrastructure that underpins our cloudnative AIenabled SaaS platforms.

This role is about building platforms that scale not firefighting. You will design operate and continuously improve a secure highly available Kubernetesbased platform that enables product engineering teams to deploy operate and evolve services safely and independently.

You will work closely with software engineers product teams and security stakeholders to embed bestinclass DevOps and platform engineering practices across the organisation.

Mission

  • Build and operate secure scalable highly available cloud infrastructure
  • Enable product teams through automation selfservice and clear standards
  • Raise the bar on reliability security observability and deployment quality
  • Act as a technical leader across platform and infrastructure initiatives

What success looks like

You will be accountable for outcomes such as:

  • Highly available faulttolerant platforms

    • All containerised services are deployed with appropriate replication resilience and resource limits
    • Workloads are designed for multizone availability and safe failure modes
  • Zerodowntime highquality delivery

    • CI/CD pipelines support safe deployment patterns (e.g. rolling canary fast rollback)
    • Deploymentrelated incidents are eliminated or rapidly mitigated
  • Empowered engineering teams

    • Engineers can diagnose and resolve the majority of platformrelated issues independently
    • Clear standards tooling and automation reduce cognitive load and friction
  • Strong security posture

    • Infrastructure and workloads follow securitybydefault principles
    • Vulnerabilities are proactively identified prioritised and remediated
    • Platform security tooling is continuously maintained and improved
  • Comprehensive observability

    • All critical services are monitored with meaningful alerts and dashboards
    • Teams have access to selfservice monitoring and alerting capabilities

Key responsibilities

  • Design build and operate cloud infrastructure using Infrastructure as Code
  • Own and evolve Kubernetes platforms including workload standards and deployment models
  • Develop and maintain CI/CD pipelines and GitOps workflows
  • Embed security best practices across infrastructure pipelines and runtime environments
  • Improve platform reliability monitoring and incident response workflows
  • Act as a technical leader and mentor for engineers using the platform
  • Partner with product and engineering teams to anticipate future platform needs

Why join us

  • Own and shape a modern platform engineering capability
  • Work on real production systems supporting AIenabled SaaS products
  • High trust high autonomy engineering culture
  • Opportunity to influence platform strategy as the organisation scales

Qualifications :

Essential skills and experience

  • Proven experience building and operating cloudnative platforms at scale
  • Strong handson experience with:
    • Kubernetes & containerised workloads
    • Infrastructure as Code (e.g. Terraform)
    • CI/CD pipelines and GitOpsstyle delivery
  • Deep understanding of:
    • High availability fault tolerance and scaling strategies
    • Secure infrastructure design and operational security practices
  • Experience running production platforms on public cloud (GCP preferred; AWS acceptable)
  • Strong troubleshooting skills across distributed systems
  • Ability to explain complex technical concepts to nonspecialist audiences
  • Exposure to AI/ML or LLMbased workloads in production environments

Technologies youll work with

  • Google Cloud Platform (GCP)
  • Kubernetes (GKE) Docker
  • Terraform
  • Gitbased CI/CD pipelines
  • GitOps tooling (e.g. Argo CD)
  • Observability tooling (metrics logging alerting)
  • Modern AIenabled workloads and services

Nice to have

  • Experience with service mesh technologies (e.g. Istio)
  • Experience with Kubernetes Gateway API or modern ingress patterns
  • Familiarity with Redis PostgreSQL or managed cloud data services

Additional Information :

We embrace flexibility and hybrid work opportunities to support diverse needs and lifestyles while also valuing inclusive workplace experiences. By fostering a sense of community we drive innovation strengthen connections and nurture belonging. Our commitment ensures you can work in a way that suits you best while also engaging with colleagues to share ideas and build meaningful relationships.


Remote Work :

No


Employment Type :

Full-time

About the roleWe are looking for a Principal DevOps Engineer to own and evolve the core infrastructure that underpins our cloudnative AIenabled SaaS platforms.This role is about building platforms that scale not firefighting. You will design operate and continuously improve a secure highly available...
View more view more

About Company

Company Logo

We are growing! At IFS we are constantly growing to deliver award-winning solutions to hundreds of partners and thousands of customers worldwide! We help companies who want to be their best when it matters most – at their #momentofservice. Visit https://ifs.link/IzM0px to find out mo ... View more

View Profile View Profile