Senior DevOps/SRE/Cloud Engineering Lead
Role Summary
Lead the design automation and reliability of large-scale cloud-native systems across multi-cloud environments. Drive platform stability CI/CD efficiency while mentoring engineering teams.
Key Responsibilities
- Architect and manage multi-cloud multi-region Kubernetes platforms to improve uptime and reduce outages.
- Build and scale AWS infrastructure using Terraform Puppet and automation best practices.
- Implement GitOps-driven CI/CD with ArgoCD Argo Workflows and Helm to accelerate deployments.
- Establish robust observability using Prometheus Grafana AlertManager Splunk ELK improving MTTR and alerting quality.
- Use laC to automate Databricks and cloud resource provisioning eliminating config drift.
- Collaborate with product teams to define SLOs and drive reliability improvements
- Mentor engineers on DevOps automation and SRE principles.
Technical Skills Cloud: AWS Azure OpenStack OpenShift Containers: Kubernetes Docker
laC & Automation: Terraform Vault Puppet Ansible Chef
Monitoring: Prometheus Grafana Splunk ELK Nagios Sensu New Relic Catchpoint CI/CD: ArgoCD Argo Workflows Helm Git Jenkins Gerrit Bitbucket
Data: MongoDB Kafka MySQL PostgreSQL Snowflake Couchbase
Big Data: Spark Hadoop Hive
Programming: Python Go Bash Java (Spring Boot) Spark JUnit Ops & Security: PagerDuty ServiceNow Security Hardening Certifications: CCNA RHCE CKA
Senior DevOps/SRE/Cloud Engineering Lead Role Summary Lead the design automation and reliability of large-scale cloud-native systems across multi-cloud environments. Drive platform stability CI/CD efficiency while mentoring engineering teams. Key Responsibilities Architect and manage multi-cloud mu...
Senior DevOps/SRE/Cloud Engineering Lead
Role Summary
Lead the design automation and reliability of large-scale cloud-native systems across multi-cloud environments. Drive platform stability CI/CD efficiency while mentoring engineering teams.
Key Responsibilities
- Architect and manage multi-cloud multi-region Kubernetes platforms to improve uptime and reduce outages.
- Build and scale AWS infrastructure using Terraform Puppet and automation best practices.
- Implement GitOps-driven CI/CD with ArgoCD Argo Workflows and Helm to accelerate deployments.
- Establish robust observability using Prometheus Grafana AlertManager Splunk ELK improving MTTR and alerting quality.
- Use laC to automate Databricks and cloud resource provisioning eliminating config drift.
- Collaborate with product teams to define SLOs and drive reliability improvements
- Mentor engineers on DevOps automation and SRE principles.
Technical Skills Cloud: AWS Azure OpenStack OpenShift Containers: Kubernetes Docker
laC & Automation: Terraform Vault Puppet Ansible Chef
Monitoring: Prometheus Grafana Splunk ELK Nagios Sensu New Relic Catchpoint CI/CD: ArgoCD Argo Workflows Helm Git Jenkins Gerrit Bitbucket
Data: MongoDB Kafka MySQL PostgreSQL Snowflake Couchbase
Big Data: Spark Hadoop Hive
Programming: Python Go Bash Java (Spring Boot) Spark JUnit Ops & Security: PagerDuty ServiceNow Security Hardening Certifications: CCNA RHCE CKA
View more
View less