Software Engineering Manager, Triage Services and Infrastructure

Apple


Job Location:

Cupertino, CA - USA

Monthly Salary: Not Disclosed
Posted on: 12 hours ago
Vacancies: 1 Vacancy

Job Summary

The Core OS team is seeking an exceptional engineering manager to lead the team responsible for enabling Apples operating systems to achieve world-class reliability. This team develops and owns mission-critical tools and services that detect analyze and classify kernel panics and low-level crashes across all Apple platforms. You will be partnering with engineering teams across Software Hardware and Silicon groups to drive and deliver the rock-solid OS reliability for over 2 billion currently active Apple devices and shape the future of system reliability across Apples entire product ecosystem.

Lead a team of engineers triaging kernel panics and critical system-level issues across all Apple platforms (macOS iOS watchOS tvOS). Build intelligent automation pipelines that analyze group and prioritize failure signatures based on their reliability impact. Mentor engineers to design and develop advanced systems diagnostic and at-scale debug services to realize the vision of zero-iteration debugging and fully automated triage and root cause analysis. Develop telemetry-based dashboards to monitor at-scale panic/crash triage and analysis services to ensure they are working as expected and efficiently. Collaborate with Core OS Hardware Silicon and other engineering teams to champion and advance improvements in debuggability panic data quality symbolication and automation of triage and debug workflows.

Build and manage a world-class Panic Triage u0026 Tools team developing senior systems engineers into technical leadersnDefine and execute the multi-year technical roadmap for platform triage and reliability partnering with senior cross-functional leaders to align with Apples quality standardsnAttract develop and retain top-tier talent while fostering a culture of technical innovation collaborative problem-solving and engineering excellencenDrive engineering quality scalability and reliability for debug and triage services handling large scale of daily events across Apples ecosystemnEnsure the teams tools and processes directly contribute to the stability and reliability that defines the Apple user experience

Demonstrated track record of building and scaling high-performing engineering teamsnPassion for solving challenging technical problems that directly impact millions of usersnStrong communication skills with ability to influence technical direction across organizational boundariesnExperience managing complex multi-platform technical initiatives with measurable reliability improvementsnStrong technical depth in operating system internals will be helpfulnBS/MS in Computer Science Compute Engineering Electrical Engineering or equivalent experience

Experience applying AI/ML for automated triage and reliability services is preferrednExperience with large-scale telemetry systems processing millions of events daily is preferred

Required Experience:

Manager

The Core OS team is seeking an exceptional engineering manager to lead the team responsible for enabling Apples operating systems to achieve world-class reliability. This team develops and owns mission-critical tools and services that detect analyze and classify kernel panics and low-level crashes a...

About Company

Company Logo

Ask Siri to name the most successful company in the world and it might respond: Apple. And it's not just out of familial pride. Apple consistently ranks highly in profit, revenue, market capitalization, and consumer cachet. In 2018, the company became the first reach a trillion dollar ... View more

View Profile View Profile