Site Reliability Engineer


Job Location:

Bengaluru - India

Monthly Salary: Not Disclosed
Posted on: 8 days ago
Vacancies: 1 Vacancy

Job Summary

We are seeking a highly skilled Site Reliability Engineer (SRE) to design build and operate scalable reliable and secure cloud-native platforms. The ideal candidate will have strong experience with Kubernetes ecosystems cloud infrastructure automation observability and GitOps practices.

Key Responsibilities

  • Manage and optimize Kubernetes-based platforms including Cilium Istio Ingress Controllers and related ecosystem components.
  • Design deploy and maintain infrastructure on Google Cloud Platform (GCP).
  • Automate infrastructure provisioning and lifecycle management using Terraform.
  • Implement and manage GitOps workflows using ArgoCD and GitLab.
  • Deploy and maintain Helm charts for Kubernetes applications.
  • Manage secrets service discovery and distributed systems using Vault and Consul.
  • Build and maintain monitoring logging and observability platforms using Prometheus Operator and the Grafana Stack (Grafana Mimir Loki Alloy Tempo and Pyroscope).
  • Collaborate with development teams to improve platform reliability performance scalability and operational excellence.
  • Develop CI/CD pipelines and automation to support modern cloud-native deployments.

Required Skills

  • Strong hands-on experience with Kubernetes (K8s) and cloud-native technologies.
  • Experience with GCP Terraform Helm and ArgoCD.
  • Knowledge of Service Mesh technologies particularly Istio and Cilium.
  • Experience with Vault Consul and infrastructure security best practices.
  • Strong expertise in observability tools including Prometheus and the Grafana ecosystem.
  • Proficiency with GitOps GitLab CI/CD pipelines and automation.
  • Good understanding of Linux systems networking and troubleshooting in distributed environments.

Preferred Qualifications

  • Experience operating large-scale production environments.
  • Knowledge of SRE principles incident management capacity planning and reliability engineering.
  • Relevant cloud-native certifications (CKA GCP Terraform etc.) are a plus.

Required Experience:

IC

We are seeking a highly skilled Site Reliability Engineer (SRE) to design build and operate scalable reliable and secure cloud-native platforms. The ideal candidate will have strong experience with Kubernetes ecosystems cloud infrastructure automation observability and GitOps practices.Key Responsib...