Job Description:
- We are seeking a DevOps Engineer with strong hands-on experience across Kubernetes cloud and hybrid infrastructure CI/CD GitOps observability and Infrastructure-as-Code. This role supports the design automation deployment operation and modernization of production-grade Kubernetes and cloud-native platforms across on-premises and public cloud environments.
- The ideal candidate has a DevOps SRE or platform engineering background; is comfortable working with application security and operations teams; and can support customers through reliable repeatable and secure delivery of infrastructure and application platforms. Exposure to Spectro Cloud Palette or similar Kubernetes management platforms is preferred but training and enablement may be provided where needed.
Key Responsibilities:
Kubernetes and Platform Operations
- Deploy manage upgrade and troubleshoot Kubernetes clusters across on-premises public cloud and hybrid environments.
- Support Kubernetes lifecycle operations using tools such as kubeadm EKS AKS GKE Rancher OpenShift
Anthos or Spectro Cloud Palette.
- Implement and maintain Kubernetes RBAC network policies ingress controllers storage integrations and security frameworks.
- Verify cluster health workload readiness node status resource utilization and platform reliability after changes or deployments.
Automation Infrastructure-as-Code and GitOps
- Automate infrastructure provisioning and platform operations using Terraform Helm Ansible shell scripting and Git-based workflows.
- Build reusable automation modules templates and runbooks that enable consistent infrastructure delivery across staging and production environments.
- Implement and manage GitOps workflows using tools such as Argo CD or Flux for cluster platform and application configuration.
- Maintain disciplined branch pull request and change sequencing practices especially for high-impact or destructive operational actions.
CI/CD and Application Delivery
- Integrate Kubernetes platforms with modern CI/CD workflows for safe repeatable and auditable application delivery.
- Support CI/CD systems such as GitHub Actions GitLab CI Jenkins Argo CD or Flux.
- Partner with development teams to improve deployment reliability rollback processes release automation and environment consistency.
Monitoring Logging and Troubleshooting
- Implement and support observability for Kubernetes and cloud platforms using tools such as Prometheus Grafana Loki Fluentd/Fluent Bit ELK/EFK SignalFX Splunk
Cloud or equivalent platforms.
- Troubleshoot failed pods workload performance issues memory leaks failed deployments network connectivity storage failures and cluster degradation events.
- Create and maintain operational dashboards alerts runbooks and escalation procedures for production environments.
Cloud Hybrid and Infrastructure Support
- Operate workloads across AWS Azure GCP VMware and hybrid/on-premises infrastructure.
- Support cloud networking identity security compute storage backup/restore and platform reliability requirements.
- Use backup and restore tools such as Velero Kasten Stash or equivalent solutions where appropriate.
- Assist with cloud modernization migration support environment validation and production readiness activities.
Collaboration and Documentation
- Collaborate with applications DevOps SRE security and operations teams to align platform capabilities with business and delivery goals.
- Serve as a technical advisor on cloud-native architecture containerization practices Kubernetes operations and automation patterns.
- Create and maintain technical documentation operational runbooks implementation notes and reference architecture.
Required Qualifications
- Strong hands-on experience in DevOps SRE Cloud Engineering or Platform Engineering roles.
- Solid Kubernetes fundamentals including cluster operations networking storage security workload management and troubleshooting.
- Practical experience with CI/CD systems and modern software delivery workflows.
- Strong experience with Git GitOps principles branch/PR workflows and change-control discipline.
- Proficiency with Infrastructure-as-Code and automation tools such as Terraform Helm Ansible and shell scripting.
- Experience operating infrastructure in at least one major cloud provider: AWS Azure or GCP.
- Experience with Linux administration including SSH root-level operations environment variables package management service troubleshooting and scripting.
- Familiarity with cloud networking DNS ingress load balancing security identity and observability concepts.
- Ability to troubleshoot complex issues spanning Kubernetes containerized workloads cloud infrastructure networking storage CI/CD and platform tooling.
- Ability to learn new platforms quickly and support customers through cloud platform and application modernization journeys.
What we need to see from you
Requirements:
Preferred Qualifications
- Exposure to Spectro Cloud Palette or similar Kubernetes management platforms such as Rancher OpenShift Anthos or Tanzu.
- Experience with managed Kubernetes services such as EKS AKS or GKE.
- Experience with VMware vSphere/ESXi or other private cloud/on-premises platforms.
- Experience with observability stacks such as Prometheus Grafana Loki Fluentd/Fluent Bit ELK/EFK SignalFX or Splunk Cloud.
- Experience with service mesh technologies such as Istio or Linkerd.
- Experience with backup/restore and disaster recovery strategies for Kubernetes platforms.
- Open-source contributions or active participation in Kubernetes DevOps cloud-native or infrastructure automation communities.
- Exposure to Incus Platform9 OpenStack Cloud 9 or other private cloud and virtualization platforms is a plus.
Certifications:
At least one of the following certifications is preferred; CKA is highly preferred for Kubernetes-focused engagements:
- Certified Kubernetes Administrator (CKA) - highly preferred
- Certified Kubernetes Application Developer (CKAD)
- Certified Kubernetes Security Specialist (CKS)
- AWS Certified DevOps Engineer - Professional
- Google Professional Cloud DevOps Engineer
- Microsoft Azure DevOps Engineer Expert or Azure Administrator Associate
- HashiCorp Certified: Terraform Associate
- Linux Foundation Certified System Administrator (LFCS)
Working Conditions / Terms and Conditions:
- This position is primarily performed during regular business hours but may require occasional work outside standard hours including evenings and weekends based on business demands client requirements production issues and project deadlines.
- This role may require limited travel for client meetings on-site engagements internal meetings training or other business purposes. Travel is expected to be light and intermittent unless otherwise defined by the engagement.
- The successful candidate must be able to work a flexible schedule as needed to meet business objectives and support client project and operational needs.
- Reasonable accommodation may be made to enable qualified individuals with disabilities to perform the essential functions of this position.
Job Description: We are seeking a DevOps Engineer with strong hands-on experience across Kubernetes cloud and hybrid infrastructure CI/CD GitOps observability and Infrastructure-as-Code. This role supports the design automation deployment operation and modernization of production-grade Kubernetes a...
Job Description:
- We are seeking a DevOps Engineer with strong hands-on experience across Kubernetes cloud and hybrid infrastructure CI/CD GitOps observability and Infrastructure-as-Code. This role supports the design automation deployment operation and modernization of production-grade Kubernetes and cloud-native platforms across on-premises and public cloud environments.
- The ideal candidate has a DevOps SRE or platform engineering background; is comfortable working with application security and operations teams; and can support customers through reliable repeatable and secure delivery of infrastructure and application platforms. Exposure to Spectro Cloud Palette or similar Kubernetes management platforms is preferred but training and enablement may be provided where needed.
Key Responsibilities:
Kubernetes and Platform Operations
- Deploy manage upgrade and troubleshoot Kubernetes clusters across on-premises public cloud and hybrid environments.
- Support Kubernetes lifecycle operations using tools such as kubeadm EKS AKS GKE Rancher OpenShift
Anthos or Spectro Cloud Palette.
- Implement and maintain Kubernetes RBAC network policies ingress controllers storage integrations and security frameworks.
- Verify cluster health workload readiness node status resource utilization and platform reliability after changes or deployments.
Automation Infrastructure-as-Code and GitOps
- Automate infrastructure provisioning and platform operations using Terraform Helm Ansible shell scripting and Git-based workflows.
- Build reusable automation modules templates and runbooks that enable consistent infrastructure delivery across staging and production environments.
- Implement and manage GitOps workflows using tools such as Argo CD or Flux for cluster platform and application configuration.
- Maintain disciplined branch pull request and change sequencing practices especially for high-impact or destructive operational actions.
CI/CD and Application Delivery
- Integrate Kubernetes platforms with modern CI/CD workflows for safe repeatable and auditable application delivery.
- Support CI/CD systems such as GitHub Actions GitLab CI Jenkins Argo CD or Flux.
- Partner with development teams to improve deployment reliability rollback processes release automation and environment consistency.
Monitoring Logging and Troubleshooting
- Implement and support observability for Kubernetes and cloud platforms using tools such as Prometheus Grafana Loki Fluentd/Fluent Bit ELK/EFK SignalFX Splunk
Cloud or equivalent platforms.
- Troubleshoot failed pods workload performance issues memory leaks failed deployments network connectivity storage failures and cluster degradation events.
- Create and maintain operational dashboards alerts runbooks and escalation procedures for production environments.
Cloud Hybrid and Infrastructure Support
- Operate workloads across AWS Azure GCP VMware and hybrid/on-premises infrastructure.
- Support cloud networking identity security compute storage backup/restore and platform reliability requirements.
- Use backup and restore tools such as Velero Kasten Stash or equivalent solutions where appropriate.
- Assist with cloud modernization migration support environment validation and production readiness activities.
Collaboration and Documentation
- Collaborate with applications DevOps SRE security and operations teams to align platform capabilities with business and delivery goals.
- Serve as a technical advisor on cloud-native architecture containerization practices Kubernetes operations and automation patterns.
- Create and maintain technical documentation operational runbooks implementation notes and reference architecture.
Required Qualifications
- Strong hands-on experience in DevOps SRE Cloud Engineering or Platform Engineering roles.
- Solid Kubernetes fundamentals including cluster operations networking storage security workload management and troubleshooting.
- Practical experience with CI/CD systems and modern software delivery workflows.
- Strong experience with Git GitOps principles branch/PR workflows and change-control discipline.
- Proficiency with Infrastructure-as-Code and automation tools such as Terraform Helm Ansible and shell scripting.
- Experience operating infrastructure in at least one major cloud provider: AWS Azure or GCP.
- Experience with Linux administration including SSH root-level operations environment variables package management service troubleshooting and scripting.
- Familiarity with cloud networking DNS ingress load balancing security identity and observability concepts.
- Ability to troubleshoot complex issues spanning Kubernetes containerized workloads cloud infrastructure networking storage CI/CD and platform tooling.
- Ability to learn new platforms quickly and support customers through cloud platform and application modernization journeys.
What we need to see from you
Requirements:
Preferred Qualifications
- Exposure to Spectro Cloud Palette or similar Kubernetes management platforms such as Rancher OpenShift Anthos or Tanzu.
- Experience with managed Kubernetes services such as EKS AKS or GKE.
- Experience with VMware vSphere/ESXi or other private cloud/on-premises platforms.
- Experience with observability stacks such as Prometheus Grafana Loki Fluentd/Fluent Bit ELK/EFK SignalFX or Splunk Cloud.
- Experience with service mesh technologies such as Istio or Linkerd.
- Experience with backup/restore and disaster recovery strategies for Kubernetes platforms.
- Open-source contributions or active participation in Kubernetes DevOps cloud-native or infrastructure automation communities.
- Exposure to Incus Platform9 OpenStack Cloud 9 or other private cloud and virtualization platforms is a plus.
Certifications:
At least one of the following certifications is preferred; CKA is highly preferred for Kubernetes-focused engagements:
- Certified Kubernetes Administrator (CKA) - highly preferred
- Certified Kubernetes Application Developer (CKAD)
- Certified Kubernetes Security Specialist (CKS)
- AWS Certified DevOps Engineer - Professional
- Google Professional Cloud DevOps Engineer
- Microsoft Azure DevOps Engineer Expert or Azure Administrator Associate
- HashiCorp Certified: Terraform Associate
- Linux Foundation Certified System Administrator (LFCS)
Working Conditions / Terms and Conditions:
- This position is primarily performed during regular business hours but may require occasional work outside standard hours including evenings and weekends based on business demands client requirements production issues and project deadlines.
- This role may require limited travel for client meetings on-site engagements internal meetings training or other business purposes. Travel is expected to be light and intermittent unless otherwise defined by the engagement.
- The successful candidate must be able to work a flexible schedule as needed to meet business objectives and support client project and operational needs.
- Reasonable accommodation may be made to enable qualified individuals with disabilities to perform the essential functions of this position.
View more
View less