Optimizes cloud resources for Stateful Kubernetes HPC and AI workloadsincluding GPUs.
End users are DevOps CloudOps managing complex distributed workloads across hundreds of clusters and hundreds of thousands of nodes.
Dashboards provide real-time observabilitymetrics andalertsat hyperscale.
The Role
Design and implementation of a modern UI for a multi-tenant infrastructure-scale platform including dashboards for real-time observability across 100k nodes
Develop Develop backend APIs and Integrations with Golang
Lead two software engineers (they are full stack with an emphasis on the UI)
Collaborate with two other dev teams (both are full stack with emphasis on platform and backend)
Experience & Skills
10 years of software engineering experience
Expertise in Go
Proven experience building scalable UI for multi-cluster multi-tenant environment modern frontend architecture and patternsUI State management and tooling
Extensive Golang skills including API design data flow optimization
Python Redis caching methods
Experience with Kubernetes (configuring pods operators CRDs) is a plus
Prometheus Grafana Loki OpenTelemetry
Authentication patterns and IAM (Keycloak or similar) is a plus
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.