drjobs Manager II, Engineering - APM Root Cause Analysis

Manager II, Engineering - APM Root Cause Analysis

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

New York City, NY - USA

Monthly Salary drjobs

Not Disclosed

drjobs

Salary Not Disclosed

Vacancy

1 Vacancy

Job Description

The APM Root Cause Analysis teams mission is to help engineers and SREs respond rapidly and effectively to incidents affecting their production systems. During an incident one of the first questions a responder asks is What is the change that caused the incidentand thats exactly what this team aims to answer.

To answer that question the team is building several impactful systems:

  • A platform to ingest interesting changes from across our customer environments (Deployments DB changes Feature Flag changes K8s changes etc.)
  • A system to process past incidents in our environment and label the faulty changes that led to the incidents enabling us to build a high quality evaluation dataset for faulty change detection
  • A system that uses LLM ML and statistical models to assess whether a specific change is the cause of an incident
  • A product experience to expose those faulty changes in strategic locations in the product in a way that aids incident response and reduces MTTR

As a manager you will play an active role in shaping the roadmap for automated root cause analysis through collaboration with multiple stakeholder teams. You will have a deep and immediate impact in guiding the product through your design and engineering decisions.

At Datadog we place value in our office culture - the relationships that it builds the creativity it brings to the table and the collaboration of being together. We operate as a hybrid workplace to ensure our employees can create a work-life harmony that best fits them.


What Youll Do:

  • Solve challenging and ambiguous problems of automating root cause analysis through faulty change detection using latest agentic AI approaches as well as ML anomaly detection and statistical methods
  • Evaluate and benchmark the quality and real-world performance of the automated faulty change detection model
  • Lead and mentor a team of experienced software engineers fostering their career growth while ensuring high team performance
  • Drive the technical roadmap in collaboration with your team product managers and design teams

Who You Are:

  • An experienced software engineering leader with a track record of successfully delivering GenAI/ML products at scale
  • Experienced working with high scale distributed systems as well as participating in and structuring on-call processes for them
  • You are passionate about building products that solve real user problems you are adept at formulating an opinion on the product direction and how we should structure our execution strategy
  • You have a BS/MS/PhD in a Computer Science Engineering or related scientific field or equivalent experience

Datadog values people from all walks of life. We understand not everyone will meet all the above qualifications on day one. Thats okay. If youre passionate about technology and want to grow your skills we encourage you to apply.

Benefits and Growth:

  • New hire stock equity (RSUs) and employee stock purchase plan (ESPP)
  • Continuous professional development product training and career pathing
  • Intradepartmental mentor and buddy program for in-house networking
  • An inclusive company culture ability to join our Community Guilds (Datadog employee resource groups)
  • Access to Inclusion Talks our Internal panel discussions
  • Free global mental health benefits for employees and dependents age 6
  • Competitive global benefits


Benefits and Growth listed above may vary based on the country of your employment and the nature of your employment with Datadog.


Required Experience:

Manager

Employment Type

Full Time

Company Industry

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.