Tasks
Positions: Senior Site Reliability Engineer
Location: Madrid, Spain
Duration: 12 Months + Possible extension
About the role
As a Site Reliability Engineer you will be part of a squad of very dynamic, highly motivated and diverse engineers with 9 different nationalities, developing and operating state-of-the-art logging, monitoring & event management platforms. With your open mind, you will be providing consultancy around logging and monitoring to application, product and service owners as well as developers across client, for them to better understand their workloads running in multi-cloud platforms and improve our cloud infrastructure and application oversight and stability, as well as our service resilience.
About you
We are looking forward to your application, particularly when you possess:
- 5+ years' software development, continuous integration/deployment and system engineering experience in cloud-native ecosystems.
- Hands on expertise in open-source application and infrastructure monitoring tools, e.g. Influx stack (TICK), elastic stack (ELK), Prometheus and Grafana, as well as how these are deployed and operated in modern hybrid cloud environment, i.e. through container orchestration system such as Kubernetes running in a cloud environment such as Azure, including the performance, network and security implications.
Requirements
- Experience in a modern language e.g. Camunda, Golang, Java and in scripting languages (Shell, PowerShell, Python).
- Passion for sharing knowledge, through interactive sessions as well as documentation.
- Strong analytical and problem-solving skills, as well as the ability to focus on details without losing track of the bigger picture.
- Excellent oral and written English skills, additional language skills are a plus.