drjobs Staff Software Engineer - Observability Platform

Staff Software Engineer - Observability Platform

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

Mountain View, CA - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

RDQ225R576

At Databricks we are passionate aboutenabling data teams to solve the worlds toughest problems from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the worlds best data and AI infrastructure platform so our customers can use deep data insights to improve their business. Founded by engineers and customer obsessed we leap at every opportunity to tackle technical challenges from designing nextgen UI/UX for interfacing with data to scaling our services and infrastructure across millions of virtual machines. And were only getting started.

We develop and operate one of the largestscale software platforms. The fleet consists of millions of virtual machines generating terabytes of logs and processing exabytes of data per day. At our scale we observe cloud hardware network and operating system faults and our software must gracefully shield our customers from any of the above.

As a software engineer in the Observability Platform team you will develop observability solutions that provide insights into the health and performance of our products and infrastructure.

The impact you will have:

  • You will build the next generation of observability platforms that support billions of active time series and process petabytes of logs daily.
  • You will manage infrastructure across nearly a hundred cloud regions enabling all Databricks engineers and customers to monitor the reliability of our product.
  • You will develop advanced workflows that accelerate incident diagnosis for Bricksters allowing engineers to quickly derive insights from logs and metrics. You will leverage powerful capabilities of Databricks own data intelligence platform to push the boundaries of troubleshooting practices in the industry.
  • You will uplevel monitoring and reliability practices across Databricks engineering developing opinionated tools that set common standards for managing structured logs metrics alerts dashboards and oncall rotations.
  • Mentor and uplevel engineers fostering a culture of technical excellence within the team and broader observability community.

What we look for:

  • BS (or higher) in Computer Science or a related field.
  • 7 years of productionlevel experience in one of: Go Python Java Scala Rust C or similar languages.
  • Experience in software development in largescale distributed systems.
  • Experience driving large projects involving multiple teams
  • Experience with cloud technologies e.g. AWS Azure GCP Docker or Kubernetes.
  • Familiarity with observability infrastructure monitoring patterns and reliability practices.

Required Experience:

Staff IC

Employment Type

Full-Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.