We are looking for a Senior Platform Engineer with hands-on experience in deploying and operating production-grade infrastructure with a strong focus on Kubernetes in on-premise and customer-owned environments. The role involves supporting highly available scalable and fault-tolerant platforms in restricted or air-gapped environments while driving automation CI/CD observability and platform improvements end-to-end.
Key Responsibilities
- Deploy and operate vanilla Kubernetes on on-premise customer-owned infrastructure
- Run Kubernetes in restricted or air-gapped environments including upgrades and dependency management
- Manage Helm-based application deployments upgrades and day-2 Kubernetes operations
- Design highly available fault-tolerant and scalable platforms for edge or sovereign environments
- Implement infrastructure automation using Infrastructure as Code tools such as Terraform or CloudFormation
- Design and operate CI/CD pipelines using GitHub Actions ArgoCD or Jenkins
- Operate monitoring and logging stacks such as Prometheus Grafana and ELK/OpenSearch
- Utilize configuration management tools such as Ansible Puppet or Chef
- Take ownership of platform operations end-to-end and drive continuous improvements
Skills Knowledge and Expertise
- 5 years of hands-on experience in DevOps / Platform / SRE roles owning production infrastructure end to end
- Strong experience with on-premise Kubernetes including clusters deployed and operated inside customer or third-party data centers
- Hands-on experience with Rancher (including Rancher Government Edition preferred) for Kubernetes cluster management and Fleet for multi-cluster GitOps
- Experience supporting fully air-gapped Kubernetes environments including strategies for shipping artifacts images and Helm charts into isolated networks
- Practical experience with private container registries (e.g. Harbor) and image mirroring for offline environments
- Solid understanding of Helm-based deployments upgrades and day-2 Kubernetes operations
- Working knowledge of enterprise networking fundamentals including firewalls local DNS network policies and security constraints within customer-owned IT environments
- Ability to troubleshoot Kubernetes and application issues caused by network restrictions DNS behavior or firewall rules
- Familiarity with tools used in restricted environments such as WireGuard Teleport and Velero for secure access connectivity and backup/restore
- Experience operating Kubernetes platforms where external connectivity is limited or completely unavailable including handling upgrades and disaster recovery
- Exposure to hybrid scenarios where parts of the stack (e.g. AI components) may connect to cloud services with a clear understanding of separation and risk
- Self-driven engineer with a growth and leadership mindset comfortable working in long-running platform enablement engagements and collaborating closely with customers
Required Experience:
Senior IC
We are looking for a Senior Platform Engineer with hands-on experience in deploying and operating production-grade infrastructure with a strong focus on Kubernetes in on-premise and customer-owned environments. The role involves supporting highly available scalable and fault-tolerant platforms in re...
We are looking for a Senior Platform Engineer with hands-on experience in deploying and operating production-grade infrastructure with a strong focus on Kubernetes in on-premise and customer-owned environments. The role involves supporting highly available scalable and fault-tolerant platforms in restricted or air-gapped environments while driving automation CI/CD observability and platform improvements end-to-end.
Key Responsibilities
- Deploy and operate vanilla Kubernetes on on-premise customer-owned infrastructure
- Run Kubernetes in restricted or air-gapped environments including upgrades and dependency management
- Manage Helm-based application deployments upgrades and day-2 Kubernetes operations
- Design highly available fault-tolerant and scalable platforms for edge or sovereign environments
- Implement infrastructure automation using Infrastructure as Code tools such as Terraform or CloudFormation
- Design and operate CI/CD pipelines using GitHub Actions ArgoCD or Jenkins
- Operate monitoring and logging stacks such as Prometheus Grafana and ELK/OpenSearch
- Utilize configuration management tools such as Ansible Puppet or Chef
- Take ownership of platform operations end-to-end and drive continuous improvements
Skills Knowledge and Expertise
- 5 years of hands-on experience in DevOps / Platform / SRE roles owning production infrastructure end to end
- Strong experience with on-premise Kubernetes including clusters deployed and operated inside customer or third-party data centers
- Hands-on experience with Rancher (including Rancher Government Edition preferred) for Kubernetes cluster management and Fleet for multi-cluster GitOps
- Experience supporting fully air-gapped Kubernetes environments including strategies for shipping artifacts images and Helm charts into isolated networks
- Practical experience with private container registries (e.g. Harbor) and image mirroring for offline environments
- Solid understanding of Helm-based deployments upgrades and day-2 Kubernetes operations
- Working knowledge of enterprise networking fundamentals including firewalls local DNS network policies and security constraints within customer-owned IT environments
- Ability to troubleshoot Kubernetes and application issues caused by network restrictions DNS behavior or firewall rules
- Familiarity with tools used in restricted environments such as WireGuard Teleport and Velero for secure access connectivity and backup/restore
- Experience operating Kubernetes platforms where external connectivity is limited or completely unavailable including handling upgrades and disaster recovery
- Exposure to hybrid scenarios where parts of the stack (e.g. AI components) may connect to cloud services with a clear understanding of separation and risk
- Self-driven engineer with a growth and leadership mindset comfortable working in long-running platform enablement engagements and collaborating closely with customers
Required Experience:
Senior IC
View more
View less