Are you ready to develop a greenfield stack spanning Google Cloud Azure and our own GPU cluster in Leipzig At Cyber Insight we develop DARA an AI-powered cyber-risk platform that is trusted by original equipment manufacturers (OEMs) and enterprises. We guarantee >99.95% uptime and release new features in hours not days. Join our 15-person hybrid team and take ownership of the infrastructure that keeps our customers safe.
Tasks
- Automate end-to-end deployments with GitOps (e.G. Argo CD/Flux) and CI/CD pipelines (e.G. GitHub Actions Google Cloud Build).
- Operate & scale a on-prem environment (NVIDIA GPU servers).
- Build an observability stack (e.G. Prometheus Grafana OpenTelemetry) and define SLOs for latency error budget and uptime.
- Harden Kubernetes implement Zero-Trust networking and manage secrets with Vault.
- Collaborate with Data & ML engineers so new AI models reach production in minutes.
Requirements
- 3 years as a DevOps/SRE in cloud or hybrid environments.
- Solid hands-on with Terraform Kubernetes Docker and Linux networking.
- Experience running CI/CD for microservices; GitOps mindset preferred.
- Security & hardening know-how (IAM least privilege vulnerability patching).
- Nice to have: Ceph Ansible Argo Workflow or Kubeflow.
- Fluent in English (our team language); German is a plus.
Benefits
- Ownership from Day 1: blueprint and build the infra you always wanted.
- Cutting-edge tech: GPU cluster MLOps IaCno legacy.
- VSOP possible hardware of your choice.
- Flat hierarchy 30 days vacation.
If you love automating everything care about secure observable systems and want your work to stop real-world cyber threats lets talk. Apply now with your rsum and tell us your salary expectations. We aim to move from first call to offer within two weeks.