Join our newly launched Roland Berger AI Lab to design automate and operate secure cloud infrastructure for (Gen)AI applications. You will build CI/CD and IaC foundations implement observability and incident response. Working closely with Architects and Backend teams you will productionize agentic/LLM services enable safe model fine-tuning pipelines and uphold service readiness with SLOs cost controls and performance guardrails. Expect hands-on delivery in Azure AWS and GCP working from PoV to scaled rollout.
Qualifications :
- Degree in Computer Science/Engineering or equivalent experience
- 4 years of experience in DevOps/SRE on AzureAWS or GCP
- Expertise in CI/CD at scale (e.g. GitHub GitLab Azure DevOps) artifact management and environment promotion
- Proficiency in Infrastructure as Code tools like Terraform Bicep or CloudFormation including secrets and configuration management
- Strong experience with Kubernetes/containers (AKS EKS GKE) autoscaling service mesh/ingress and image security
- Familiarity with end-to-end observability: metrics logs tracing; dashboards and alerts
- Experience in Incident management and on-call readiness including SLO/SLA design runbooks post-mortems
- Understanding of cloud networking & security: VNet/VPC IAM policies Key Vault/KMS and private endpoints
- Familiarity with MLOps/LLMOps workflows including MLflow model registries evaluation/finetuning pipelines and GPU workflows
- Excellent communication skills in English
Additional Information :
Do you have an entrepreneurial mindset with a winning personality If so we look forward to receiving your application (CV high school diploma certificates of all academic degrees work certificates including internships as well as proof of semesters abroad) via our online portal. If you have any questions dont hesitate to contact me.
Remote Work :
No
Employment Type :
Full-time
Join our newly launched Roland Berger AI Lab to design automate and operate secure cloud infrastructure for (Gen)AI applications. You will build CI/CD and IaC foundations implement observability and incident response. Working closely with Architects and Backend teams you will productionize agentic/...
Join our newly launched Roland Berger AI Lab to design automate and operate secure cloud infrastructure for (Gen)AI applications. You will build CI/CD and IaC foundations implement observability and incident response. Working closely with Architects and Backend teams you will productionize agentic/LLM services enable safe model fine-tuning pipelines and uphold service readiness with SLOs cost controls and performance guardrails. Expect hands-on delivery in Azure AWS and GCP working from PoV to scaled rollout.
Qualifications :
- Degree in Computer Science/Engineering or equivalent experience
- 4 years of experience in DevOps/SRE on AzureAWS or GCP
- Expertise in CI/CD at scale (e.g. GitHub GitLab Azure DevOps) artifact management and environment promotion
- Proficiency in Infrastructure as Code tools like Terraform Bicep or CloudFormation including secrets and configuration management
- Strong experience with Kubernetes/containers (AKS EKS GKE) autoscaling service mesh/ingress and image security
- Familiarity with end-to-end observability: metrics logs tracing; dashboards and alerts
- Experience in Incident management and on-call readiness including SLO/SLA design runbooks post-mortems
- Understanding of cloud networking & security: VNet/VPC IAM policies Key Vault/KMS and private endpoints
- Familiarity with MLOps/LLMOps workflows including MLflow model registries evaluation/finetuning pipelines and GPU workflows
- Excellent communication skills in English
Additional Information :
Do you have an entrepreneurial mindset with a winning personality If so we look forward to receiving your application (CV high school diploma certificates of all academic degrees work certificates including internships as well as proof of semesters abroad) via our online portal. If you have any questions dont hesitate to contact me.
Remote Work :
No
Employment Type :
Full-time
View more
View less