Senior Site Reliability Engineer (mfd)

Berlin - Germany

Yearly Salary: EUR 60 - 70

Posted on: 30+ days ago

Vacancies: 1 Vacancy

Job Summary

Are you passionate about building reliable scalable systems and ensuring seamless operations in the healthcare industry We are seeking an experienced Site Reliability Engineer (m/f/d) like you to join our team!

Since the foundation in 2019 Famedly has been committed to digitizing medical communication processes in compliance with data protection regulations and thus revolutionizing the healthcare system. Famedly has launched the first gematik-certified TI messenger to improve communication and collaboration within the healthcare sector. Famedly enables medical teams to share sensitive patient information images and other files in real time and from any location - from medication schedules and lab results to X-rays. As a dynamic remote-first startup based in Berlin with a growing and experienced team we work together every day towards our vision of a healthcare system without information barriers.

We are seeking a Site Reliability Engineer (m/f/d) to join our infrastructure this role you will ensure the reliability scalability and performance of our healthcare-critical systems. Youll design and implement SRE practices build robust infrastructure and collaborate with development teams to embed operational excellence throughout the software lifecycle.

Key Responsibilities:

Take ownership of reliability observability and performance of our backend systems spanning Rust microservices containerised deployments on Kubernetes/K8s and production in healthcare-critical environments.
Design implement and evolve SRE practices: Define service-level indicators (SLIs) service-level objectives (SLOs) error budgets conduct blameless post-mortems capacity planning and develop disaster-recovery strategies.
Build and maintain our infrastructure as code CI/CD pipelines configuration management and simplify deployment workflows to support rapid safe product iteration.
Collaborate closely with development teams to embed reliability and operational thinking early in the development lifecycle.
Lead automation of incident detection alerting diagnostics and remediation: instrumentation of services structured logs metrics tracing dashboards.
Work cross-functionally to drive technical standards share best practices and mentor engineers on operational maturity.
Participate in incident response and root-cause investigations. Drive improvements based on findings.
Contribute to the architecture and roadmap of our platform: propose and evaluate new technologies/services help evolve our Kubernetes footprint and cloud strategy to meet healthcare-grade compliance and scalability demands.

Requirements:

Excellent German and English communication skills both written and spoken.
Good understanding of modern software architecture APIs and system integrations.
5 years of experience in SRE/DevOps or infrastructure engineering at scale (preferably with SaaS or B2B products in regulated industries).
Strong hands-on experience with Kubernetes container orchestration service meshes (e.g. Istio) and microservice architecture.
Strong hands-on experience with observability tools and practices: metrics logging tracing and alerting (e.g. Prometheus Grafana Tempo).
Proficiency in infrastructure as code GitOps practices and CI/CD pipelines.
Experience with incident management on-call rotations and conducting post-mortems to drive continuous improvement.
Understanding of reliability engineering principles: SLIs/SLOs error budgets availability modelling capacity planning.

Nice-to-haves:

Familiarity with cloud environments networking security compliance in regulated spaces is a strong plus.
Self-starter mindset with an ability to influence and uplift engineering culture in a fast-growing startup context.
Experience with Rust backends multi-tenant architectures Kubernetes operators or service-meshes in regulated production.
Experience with Nix and related tooling.

Why you should work at Famedly:

Work in a rising and ambitious startup that is in an exciting start-up phase - We have grown very quickly since 2019 and we still have big plans! Famedly has launched the first gematik-certified TI messenger and we are developing it further with your help!
Attractive conditions - A permanent employment contract with appropriate remuneration (60-75k depending on your experience) professional development budget work equipment of your choice and 28 vacation days per year
Flexible benefits - You receive a monthly wellbeing budget that you can use flexibly for offers relating to fitness nutrition and mental addition you benefit from exclusive corporate discounts.
Your perfect work-life balance - With remote work you decide from wherever you are the most productive. At the same time you have the chance to work in our office in Berlin. The choice is yours: equip your home office or use a co-working space!
Responsibility and scope for action - At Famedly you are given the chance to anchor your own ideas and wishes in a defined process within the company.
A diverse and international team - We value an open mindset and a diverse and inclusive culture where everyone feels welcome.
Regular team meetings and events - Workshops team buildings virtual games nights and company events such as the bi-annual Famedly Summit and our Office Weeks.

Ready to go Then what are you waiting for Apply now! We are looking forward to your CV! You dont meet all the requirements We look forward to getting to know your personal path!

We are dedicated to promoting diversity and this is a key value in our selection process. We are happy to welcome you to our Famedly.

Do you have any questions Feel free to send them to us:

Are you passionate about building reliable scalable systems and ensuring seamless operations in the healthcare industry We are seeking an experienced Site Reliability Engineer (m/f/d) like you to join our team!Since the foundation in 2019 Famedly has been committed to digitizing medical communicatio...