This is a remote position.
We are seeking a Backend Engineer (Grafana and Prometheus) to join our team. You ll help build the core platform powering LLM-native development.
Responsibilites:
- Design and ship features that give users deep insight into their LLM usage and performance.
- Integrate proxy cache and collect data from OpenAI Anthropic Gemini and other major model providers that our customers rely on.
- Develop and evolve robust efficient open source libraries for tracing and evaluating LLM calls inside customer applications.
- Build highly available real-time data pipelines to ingest store and query large volumes of structured and semi-structured AI usage data.
- Own and operate backend systems designed to run reliably in both our SaaS and customer-managed environments.
- Collaborate closely with frontend and product engineers to ship polished end-to-end features that solve real problems for our customers.
- Improve our observability operability and internal tooling to help us move fast without breaking things.
Requirements
- 5 years of backend engineering experience across multiple parts of the stack.
- Strong systems thinking: you ve owned or contributed to infrastructure projects and know how to design for scale uptime and data integrity.
- Proficient in modern backend technologies (e.g. Go Python Rust Postgres Redis Terraform Docker AWS).
- Experience with observability tooling (e.g. Grafana Prometheus Datadog or similar) and a passion for making systems operable available and rock solid.
- Comfortable with ambiguous problems and excited to partner across the company to ship impactful work.
- Clear communicator who documents decisions shares context and helps lift the team.
Benefits
- Work Location: Remote
- 5 days working
5+ years of backend engineering experience across multiple parts of the stack. Strong systems thinking: you ve owned or contributed to infrastructure projects and know how to design for scale, uptime, and data integrity. Proficient in modern backend technologies (e.g., , Go, Python, Rust, Postgres, Redis, Terraform, Docker, AWS). Experience with observability tooling (e.g., Grafana, Prometheus, Datadog, or similar) and a passion for making systems operable, available and rock solid. Comfortable with ambiguous problems and excited to partner across the company to ship impactful work. Clear communicator who documents decisions, shares context, and helps lift the team.