* 3 years of experience in an Observability SRE DevOps or similar role with a strong focus on open-source tools.
* Expertise in Prometheus:
* Strong understanding of PromQL for querying and alerting.
* Experience with Prometheus architecture exporters and Alertmanager.
* Demonstrable experience in configuring and implementing Prometheus deployments from scratch or for new use cases.
* Proficiency in Grafana:
* Extensive experience in creating complex dashboards panels and data sources.
* Knowledge of Grafana alerting and templating.
* Proficiency with Logstash for designing configuring and implementing data parsing and transformation pipelines.
* Ability to create effective Kibana dashboards and visualizations.
* Strong understanding and practical experience with OpenTelemetry:
* Knowledge of OpenTelemetry concepts (metrics traces logs).
* Experience with OpenTelemetry Collector and instrumenting applications.
* Demonstrated ability to implement and configure OpenTelemetry across various services and integrate with backend systems.
* Scripting and Automation:
* Experience with cloud platforms (e.g. AWS Azure GCP) and their monitoring services.
* Familiarity with containerization technologies (Docker) and orchestration (Kubernetes) and their impact on observability.
* Troubleshooting and Problem-Solving:
* Excellent analytical and problem-solving skills with a methodical approach to troubleshooting complex issues.