Senior SRE Engineer Cloud Operations

Berlin - Germany

Monthly Salary: Not Disclosed

Posted on: 30+ days ago

Vacancies: 1 Vacancy

Job Summary

Qdrant is a cutting-edge vector database company on a mission to revolutionize how organizations manage and query unstructured data. Our open-source engine and managed cloud solutions power AI-driven search recommendation and data discovery at scale. We are a remote-first company building a global team of passionate engineers to push the boundaries of database infrastructure.

As a Senior DevOps / SRE Engineer on the Cloud Operations team you will focus on keeping Qdrant Cloud reliable observable and secure as usage and infrastructure complexity grow. Your primary responsibility is operational excellence: stability incident response and continuous improvement of production systems.

This role is operations-heavy ideal for engineers who thrive in owning reliability and reducing operational risk at scale.

Tasks

Operate and maintain production cloud infrastructure at scale
Own Kubernetes infrastructure networking and deployment pipelines
Improve monitoring logging alerting and operational visibility
Lead incident response root cause analysis and follow-up actions
Reduce operational toil through automation and better tooling
Improve reliability security and performance of production systems
Collaborate closely with Platform and Regions & Clusters teams
Maintain and evolve runbooks operational procedures and alerts
Participate in on-call rotations and continuous reliability improvements

Requirements

Must have

5 years of experience in DevOps SRE or infrastructure operations roles
Strong hands-on experience operating Kubernetes in production
Solid knowledge of Linux systems networking and cloud infrastructure
Experience working with AWS GCP or Azure
Strong understanding of monitoring alerting and incident management
Experience with infrastructure-as-code and automation tooling
Comfortable owning on-call responsibilities and production incidents
Strong operational mindset and clear communication skills

Nice to have

Experience with Terraform or similar IaC tools
Familiarity with Prometheus Grafana Loki or OpenTelemetry
Exposure to security compliance or hardening initiatives
Scripting experience in Python Bash or Go
Experience in SaaS cloud or data infrastructure environments

Benefits

Competitive salary equity and benefits
Fully remote setup with flexible working hours
Clear ownership of reliability and operational excellence
Opportunity to work on mission-critical customer-facing infrastructure
Strong collaboration with platform and engineering teams

If you enjoy keeping complex systems reliable and improving operations through automation and discipline wed love to hear from you.

Recruiting Agencies and Headhunters please only via 𝙝𝙞𝙧𝙚𝙗𝙪𝙛𝙛𝙚𝙧.𝙘𝙤𝙢refqdrant

This role is operations-heavy ideal for engineers who thrive in owning reliability and reducing operational risk at scale.

Tasks

Operate and maintain production cloud infrastructure at scale
Own Kubernetes infrastructure networking and deployment pipelines
Improve monitoring logging alerting and operational visibility
Lead incident response root cause analysis and follow-up actions
Reduce operational toil through automation and better tooling
Improve reliability security and performance of production systems
Collaborate closely with Platform and Regions & Clusters teams
Maintain and evolve runbooks operational procedures and alerts
Participate in on-call rotations and continuous reliability improvements

Requirements

Must have

5 years of experience in DevOps SRE or infrastructure operations roles
Strong hands-on experience operating Kubernetes in production
Solid knowledge of Linux systems networking and cloud infrastructure
Experience working with AWS GCP or Azure
Strong understanding of monitoring alerting and incident management
Experience with infrastructure-as-code and automation tooling
Comfortable owning on-call responsibilities and production incidents
Strong operational mindset and clear communication skills

Nice to have

Experience with Terraform or similar IaC tools
Familiarity with Prometheus Grafana Loki or OpenTelemetry
Exposure to security compliance or hardening initiatives
Scripting experience in Python Bash or Go
Experience in SaaS cloud or data infrastructure environments

Benefits

Competitive salary equity and benefits
Fully remote setup with flexible working hours
Clear ownership of reliability and operational excellence
Opportunity to work on mission-critical customer-facing infrastructure
Strong collaboration with platform and engineering teams

If you enjoy keeping complex systems reliable and improving operations through automation and discipline wed love to hear from you.

Recruiting Agencies and Headhunters please only via 𝙝𝙞𝙧𝙚𝙗𝙪𝙛𝙛𝙚𝙧.𝙘𝙤𝙢refqdrant

Key Skills

Change Management
Software Deployment
Cloud Infrastructure
High Availability
IaaS
Firewall
Linux
Middleware
Jboss
Network Architecture
Scripting
Technical Support

Apply Now

About Company

Qdrant

Qdrant is powering the next generation of AI applications with advanced, high-performant vector similarity search technology. Our flagship product is the leading open-source Vector Search Engine. https://github.com/qdrant/qdrant

View Profile View Profile

AI AutoApply

Apply to 100+ jobs with one click

AI Resume Builder

Create an ATS-ready CV in minutes

AI Cover Letter

Write a personalized letter instantly

Senior SRE Engineer Cloud Operations

Berlin - Germany

Job Summary

Tasks

Requirements

Benefits

Tasks

Requirements

Benefits

Key Skills

About Company

Related Jobs