About the Role
Build production cloud infrastructure for Fortune 500 clients in healthcare finance and manufacturing. Architect Kubernetes platforms running enterprise applications and AI workloadsanalytics systems GPU inference servers and autonomous deployment pipelines.
The difference: Youll create infrastructure where AI agents deploy code run automated reviews and handle complex operationsdelivering the 60-90% efficiency gains companies like Spotify already achieve.
Experience required: 5-10 years DevOps/SRE
What Youll Do
Platform Operations (40%)
- Design and operate Azure Kubernetes clusters for production workloads
- Implement Infrastructure-as-Code with Terraform and Crossplane
- Deploy platforms using GitOps (ArgoCD/Flux)
- Build CI/CD pipelines with AI-powered code reviews and testing
- Manage multi-cluster environments with Azure Arc
Site Reliability Engineering (30%)
- Build observability stack: Prometheus OpenTelemetry Jaeger Loki Grafana
- Define SLIs SLOs error budgets (99.9% uptime target)
- Deploy automated incident response and root cause analysis
- Implement DevSecOps: SBOM generation policy-as-code container scanning
- Lead post-incident reviews and preventive measures
FinOps & Cost Engineering (15%)
- Deploy OpenCost/Kubecost for cost attribution
- Build cost dashboards with team showback/chargeback
- Optimize cloud spending (target: 20-30% reduction)
Platform Engineering (15%)
- Build internal developer platforms (Backstage)
- Create golden path templates and self-service tools
- Track DORA metrics and developer productivity
- Develop automation for infrastructure tasks
Qualifications :
Must Have:
Kubernetes & Containers
5 years production Kubernetes experience
Cluster design RBAC networking troubleshooting
Performance tuning and optimization
Infrastructure & Automation
Terraform for infrastructure provisioning
GitOps workflows (ArgoCD or Flux)
CI/CD pipelines (GitHub Actions Azure DevOps Jenkins GitLab CI)
Python OR Go for automation
Bash scripting
Cloud Platforms
Observability
Prometheus and Grafana (required)
Log aggregation (Loki ELK Splunk or similar)
Distributed tracing concepts
Alerting and on-call experience
Foundation
Strong Linux/Unix administration
Git and code review workflows
English proficiency
Technical documentation skills
Nice-to-Have
Helm Kustomize Crossplane
Container security and policy-as-code
Service mesh (Istio Linkerd)
Chaos engineering
FinOps tools and practices
Certifications: Azure (AZ-104 AZ-305) Kubernetes (CKA CKS) Terraform
AI Skills Well Train You
No AI experience required. We provide comprehensive training:
Claude Code and Cursor for development
AI agent integration in CI/CD
Multi-agent workflow orchestration
Additional Information :
Perks Youll Enjoy
- Working in one of the Best Places to Work in Vietnam
- Building large-scale & global software products
- Working & growing with Passionate & Talented Team
- Diverse careers opportunities with Software Outsourcing Software Product Development IT Solutions & Consulting
- Attractive Salary and Benefits
- Performance appraisals every year and performance bonus
- Onsite opportunities: short-term and long-term assignments in North American (U.S Canada) Europe Asia.
- Flexible working time
- Various training on hot-trend technologies best practices and soft skills
- Premium healthcare insurance for you and your loved ones
- Company trip big annual year-end party every year team building etc.
- Fitness & sport activities: football tennis table-tennis badminton yoga swimming
- Joining community development activities: 1% Pledge charity every quarter blood donation public seminars career orientation talks
- Free in-house entertainment facilities (foosball ping pong gym) coffee and snack (instant noodles cookies candies)
And much more join us and let yourself explore other fantastic things!
Remote Work :
No
Employment Type :
Full-time
About the RoleBuild production cloud infrastructure for Fortune 500 clients in healthcare finance and manufacturing. Architect Kubernetes platforms running enterprise applications and AI workloadsanalytics systems GPU inference servers and autonomous deployment pipelines.The difference: Youll create...
About the Role
Build production cloud infrastructure for Fortune 500 clients in healthcare finance and manufacturing. Architect Kubernetes platforms running enterprise applications and AI workloadsanalytics systems GPU inference servers and autonomous deployment pipelines.
The difference: Youll create infrastructure where AI agents deploy code run automated reviews and handle complex operationsdelivering the 60-90% efficiency gains companies like Spotify already achieve.
Experience required: 5-10 years DevOps/SRE
What Youll Do
Platform Operations (40%)
- Design and operate Azure Kubernetes clusters for production workloads
- Implement Infrastructure-as-Code with Terraform and Crossplane
- Deploy platforms using GitOps (ArgoCD/Flux)
- Build CI/CD pipelines with AI-powered code reviews and testing
- Manage multi-cluster environments with Azure Arc
Site Reliability Engineering (30%)
- Build observability stack: Prometheus OpenTelemetry Jaeger Loki Grafana
- Define SLIs SLOs error budgets (99.9% uptime target)
- Deploy automated incident response and root cause analysis
- Implement DevSecOps: SBOM generation policy-as-code container scanning
- Lead post-incident reviews and preventive measures
FinOps & Cost Engineering (15%)
- Deploy OpenCost/Kubecost for cost attribution
- Build cost dashboards with team showback/chargeback
- Optimize cloud spending (target: 20-30% reduction)
Platform Engineering (15%)
- Build internal developer platforms (Backstage)
- Create golden path templates and self-service tools
- Track DORA metrics and developer productivity
- Develop automation for infrastructure tasks
Qualifications :
Must Have:
Kubernetes & Containers
5 years production Kubernetes experience
Cluster design RBAC networking troubleshooting
Performance tuning and optimization
Infrastructure & Automation
Terraform for infrastructure provisioning
GitOps workflows (ArgoCD or Flux)
CI/CD pipelines (GitHub Actions Azure DevOps Jenkins GitLab CI)
Python OR Go for automation
Bash scripting
Cloud Platforms
Observability
Prometheus and Grafana (required)
Log aggregation (Loki ELK Splunk or similar)
Distributed tracing concepts
Alerting and on-call experience
Foundation
Strong Linux/Unix administration
Git and code review workflows
English proficiency
Technical documentation skills
Nice-to-Have
Helm Kustomize Crossplane
Container security and policy-as-code
Service mesh (Istio Linkerd)
Chaos engineering
FinOps tools and practices
Certifications: Azure (AZ-104 AZ-305) Kubernetes (CKA CKS) Terraform
AI Skills Well Train You
No AI experience required. We provide comprehensive training:
Claude Code and Cursor for development
AI agent integration in CI/CD
Multi-agent workflow orchestration
Additional Information :
Perks Youll Enjoy
- Working in one of the Best Places to Work in Vietnam
- Building large-scale & global software products
- Working & growing with Passionate & Talented Team
- Diverse careers opportunities with Software Outsourcing Software Product Development IT Solutions & Consulting
- Attractive Salary and Benefits
- Performance appraisals every year and performance bonus
- Onsite opportunities: short-term and long-term assignments in North American (U.S Canada) Europe Asia.
- Flexible working time
- Various training on hot-trend technologies best practices and soft skills
- Premium healthcare insurance for you and your loved ones
- Company trip big annual year-end party every year team building etc.
- Fitness & sport activities: football tennis table-tennis badminton yoga swimming
- Joining community development activities: 1% Pledge charity every quarter blood donation public seminars career orientation talks
- Free in-house entertainment facilities (foosball ping pong gym) coffee and snack (instant noodles cookies candies)
And much more join us and let yourself explore other fantastic things!
Remote Work :
No
Employment Type :
Full-time
View more
View less