Azure Kubernetes Services Operations
Job Summary
Key ResponsibilitiesGitOps & Kubernetes Orchestration
Technical Qualifications
- Manage and scale production AKS clusters using Flux CD for automated reconciliation of cluster state.
- Develop version and maintain complex Helm Charts to standardize application deployments across multiple environments.
- Maintain and evolve Infrastructure as Code (IaC) using Terraform to ensure cloud resource consistency.
- Serve as a technical expert for complex production incidents utilising Dynatraces AI-driven root cause analysis (Davis) to reduce MTTR.
- Conduct performance tuning and capacity planning for containerised workloads to ensure cost-efficiency and performance.
- Dynatrace: Configure deep-stack monitoring including OneAgent deployment custom dashboards and alerting profiles for Kubernetes workloads.
- Refine SLIs and SLOs within Dynatrace to provide real-time visibility into service health and business impact.
- Integrate Dynatrace monitoring with GitOps workflows to enable automated canary analysis or rollbacks.
- Optimize GitHub Actions or Azure DevOps pipelines to integrate with the Flux/Helm deployment model.
Technical Qualifications
- Cloud Platform: 4 years of deep experience with Azure (Identity Networking AKS).
- GitOps & Tooling: Proven experience implementing and managing Flux CD and Helm for Kubernetes delivery.
- Observability: Strong proficiency in Dynatrace (Dashboards Synthetics and Log Management).
- Infrastructure as Code: Advanced proficiency in Terraform.
- Scripting: Strong skills in Python or Go for automation.
- CI/CD: Experience with GitHub Actions or Azure DevOps.