Job Description - SRE (GCP)
We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) Terraform Terragrunt and Kubernetes. The ideal candidate should have a solid Java development background and experience managing scalable cloud infrastructure and production environments.
Key Responsibilities
- Design implement and manage cloud infrastructure on GCP
- Develop and maintain Infrastructure as Code (IaC) using Terraform and Terragrunt
- Deploy manage and troubleshoot Kubernetes clusters and containerized applications
- Monitor system reliability performance and availability
- Automate operational tasks and CI/CD workflows
- Collaborate with development teams to improve application reliability and scalability
- Troubleshoot production issues and perform root cause analysis
- Support security compliance and best practices across cloud environments
Required Skills
- Strong hands-on experience with GCP services
- Excellent knowledge of Terraform and Terragrunt
- Strong experience with Kubernetes administration and troubleshooting
- Good understanding of DevOps and SRE principles
- Java development background with understanding of application architecture
- Experience with CI/CD tools and automation
- Knowledge of monitoring and logging tools such as Prometheus Grafana or ELK
- Familiarity with Linux systems and scripting
Job Description - SRE (GCP) We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) Terraform Terragrunt and Kubernetes. The ideal candidate should have a solid Java development background and experience managing scalable cloud infrastru...
Job Description - SRE (GCP)
We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) Terraform Terragrunt and Kubernetes. The ideal candidate should have a solid Java development background and experience managing scalable cloud infrastructure and production environments.
Key Responsibilities
- Design implement and manage cloud infrastructure on GCP
- Develop and maintain Infrastructure as Code (IaC) using Terraform and Terragrunt
- Deploy manage and troubleshoot Kubernetes clusters and containerized applications
- Monitor system reliability performance and availability
- Automate operational tasks and CI/CD workflows
- Collaborate with development teams to improve application reliability and scalability
- Troubleshoot production issues and perform root cause analysis
- Support security compliance and best practices across cloud environments
Required Skills
- Strong hands-on experience with GCP services
- Excellent knowledge of Terraform and Terragrunt
- Strong experience with Kubernetes administration and troubleshooting
- Good understanding of DevOps and SRE principles
- Java development background with understanding of application architecture
- Experience with CI/CD tools and automation
- Knowledge of monitoring and logging tools such as Prometheus Grafana or ELK
- Familiarity with Linux systems and scripting
View more
View less