Your mission
Sitting directly under the CTO youll join a team of two responsible for everything that runs our Adtech solution from DevEx CI/CD cloud infrastructure to incident response and production reliability.
Were not firefighters everyone on the team owns decisions drives improvements and gets plenty of room to make things better. We work proactively with developers data engineers and other stakeholders not just to put out fires but to build better more resilient infrastructure as we go.
What you will work on:
- Work with our core tech stack: AWS (multi-account multi-region) Terraform EKS (Kubernetes) complex GitLab CI pipelines Helm RDS S3 RabbitMQ Lambda Python and more plus observability tools like Prometheus Loki Datadog and OpenTelemetry
- Troubleshoot complex issues across distributed systems and apply SRE principles to drive root cause analysis long-term fixes and platform-wide reliability improvements.
- Design and implement robust backup and disaster recovery strategies for both stateless and stateful services
- Collaborate with engineers stakeholders and DevOps teammates to design evolve and maintain a scalable and secure cloud platform
- Continuously improve our tooling automation and operational workflows to reduce friction enhance developer experience and enable faster safer shipping
- Stay current with the evolving DevOps and cloud-native ecosystem not just to grow your own skill set but to help elevate the teams knowledge challenge assumptions and introduce better ways of thinking and working.
Your profile
- You have 35 years of hands-on experience in DevOps SRE or platform engineering roles.
- You are confident working with AWS at scale including IAM networking and security best practices and you know how to balance cost performance and simplicity.
- You are fluent in Terraform (or OpenTofu) and use it to build clean modular and scalable infrastructure.
- You have solid experience managing Kubernetes (we use EKS) in production with a strong grasp of workload security CI/CD patterns and day-2 operations.
- You are comfortable building and maintaining modern CI/CD pipelines using tools like GitLab CI or ArgoCD and you care about developer experience.
- You are proficient in Python Go or Bash for automation tooling and optimising engineering workflows.
- You use observability tools such as Prometheus Loki OpenTelemetry or DataDog to build insight into systems and debug issues quickly.
- You understand and apply SLIs/SLOs and you know how to turn monitoring into actionable alerting.
- You stay calm during incidents troubleshoot effectively and know when to roll back or escalate.
- You communicate clearly write good documentation and support your decisions with reasoning.
- You are curious you enjoy learning questioning the status quo and improving the platform for everyone around you.
Why us
- Work-Life Balance: 30 days of paid vacation.
- Commuter Benefits: Public transportation tickets provided.
- Professional Development: Annual education budget of 1500.
- Workation Opportunities: Combine work and vacation annually.
- Wellness: Access to over 7000 gyms and spas in Germany through Wellpass.
- Catering: Monthly team lunches daily fruits vegetables and a variety of beverages.
- Flexibility: Flexible working hours and hybrid work model.
- Corporate Benefits: Exclusive discounts for major brands and platforms.
- Diversity: Join an international team with diverse cultural backgrounds.
- Fun and Games: Socializing area for relaxing activities during the workday.Please note: this is not a remote only position we offer you a flexible hybrid model here in Hamburg Germany - working from home on Mondays & Fridays coming to the office on Tuesday Wednesday & Thursday!
Your missionSitting directly under the CTO youll join a team of two responsible for everything that runs our Adtech solution from DevEx CI/CD cloud infrastructure to incident response and production reliability.Were not firefighters everyone on the team owns decisions drives improvements and gets pl...
Your mission
Sitting directly under the CTO youll join a team of two responsible for everything that runs our Adtech solution from DevEx CI/CD cloud infrastructure to incident response and production reliability.
Were not firefighters everyone on the team owns decisions drives improvements and gets plenty of room to make things better. We work proactively with developers data engineers and other stakeholders not just to put out fires but to build better more resilient infrastructure as we go.
What you will work on:
- Work with our core tech stack: AWS (multi-account multi-region) Terraform EKS (Kubernetes) complex GitLab CI pipelines Helm RDS S3 RabbitMQ Lambda Python and more plus observability tools like Prometheus Loki Datadog and OpenTelemetry
- Troubleshoot complex issues across distributed systems and apply SRE principles to drive root cause analysis long-term fixes and platform-wide reliability improvements.
- Design and implement robust backup and disaster recovery strategies for both stateless and stateful services
- Collaborate with engineers stakeholders and DevOps teammates to design evolve and maintain a scalable and secure cloud platform
- Continuously improve our tooling automation and operational workflows to reduce friction enhance developer experience and enable faster safer shipping
- Stay current with the evolving DevOps and cloud-native ecosystem not just to grow your own skill set but to help elevate the teams knowledge challenge assumptions and introduce better ways of thinking and working.
Your profile
- You have 35 years of hands-on experience in DevOps SRE or platform engineering roles.
- You are confident working with AWS at scale including IAM networking and security best practices and you know how to balance cost performance and simplicity.
- You are fluent in Terraform (or OpenTofu) and use it to build clean modular and scalable infrastructure.
- You have solid experience managing Kubernetes (we use EKS) in production with a strong grasp of workload security CI/CD patterns and day-2 operations.
- You are comfortable building and maintaining modern CI/CD pipelines using tools like GitLab CI or ArgoCD and you care about developer experience.
- You are proficient in Python Go or Bash for automation tooling and optimising engineering workflows.
- You use observability tools such as Prometheus Loki OpenTelemetry or DataDog to build insight into systems and debug issues quickly.
- You understand and apply SLIs/SLOs and you know how to turn monitoring into actionable alerting.
- You stay calm during incidents troubleshoot effectively and know when to roll back or escalate.
- You communicate clearly write good documentation and support your decisions with reasoning.
- You are curious you enjoy learning questioning the status quo and improving the platform for everyone around you.
Why us
- Work-Life Balance: 30 days of paid vacation.
- Commuter Benefits: Public transportation tickets provided.
- Professional Development: Annual education budget of 1500.
- Workation Opportunities: Combine work and vacation annually.
- Wellness: Access to over 7000 gyms and spas in Germany through Wellpass.
- Catering: Monthly team lunches daily fruits vegetables and a variety of beverages.
- Flexibility: Flexible working hours and hybrid work model.
- Corporate Benefits: Exclusive discounts for major brands and platforms.
- Diversity: Join an international team with diverse cultural backgrounds.
- Fun and Games: Socializing area for relaxing activities during the workday.Please note: this is not a remote only position we offer you a flexible hybrid model here in Hamburg Germany - working from home on Mondays & Fridays coming to the office on Tuesday Wednesday & Thursday!
View more
View less