drjobs Engineering Manager, Fleet Clusters

Engineering Manager, Fleet Clusters

Employer Active

1 Vacancy
drjobs

Job Alert

You will be updated with latest job alerts via email
Valid email field required
Send jobs
Send me jobs like this
drjobs

Job Alert

You will be updated with latest job alerts via email

Valid email field required
Send jobs
Job Location drjobs

San Francisco, CA - USA

Yearly Salary drjobs

USD 300000 - 450000

Vacancy

1 Vacancy

Job Description

About the Team

Our team runs the GPU fleet that serves the models backing ChatGPT and the API. We build automation to provision and manage one of the largest cutting edge GPU inference fleets in the world exposing it as a singular platform for other OpenAI teams to seamlessly run production applied AI workloads.

We seek to learn from deployment and distribute the benefits of AI while ensuring that this powerful tool is used responsibly and safely. Safety is more important to us than unfettered growth.

About the Role

We are looking for an experienced engineering manager to help lead our Fleet Clusters team. Youll be responsible for building scaling and operating the massive GPU fleet clusters that power AI inference and general purpose training at OpenAI. This role focuses on designing and managing largescale highavailability GPU clusters across multiple environments ensuring reliability scalability and efficiency. You will partner closely with product research and infrastructure teams to rapidly ship and support advanced AI products at global scale.

In this role you will:

  • Manage and build a diverse team of high performing infrastructure engineers

  • Guide the roadmap for automation for a fleet that can grow an order of magnitude in size or more

  • Build a worldclass secure compute fleet that serves users at scale

  • Set technical direction on evolving our compute and abstractions to support a growing business

  • Collaborate closely with a broad set of stakeholders including product engineering inference security research and finance

  • Work with external partners to unlock bleeding edge compute and making it available as a turnkey resource for scheduling workloads

  • Coach and nurture engineers to accelerate their growth and learning

You might thrive in this role if you:

  • 10 years of experience in infrastructure software engineering including 5 years in engineering management.

  • Proven track record of building highperformance computing infrastructure teams at scale.

  • Handson experience provisioning baremetal server data centers interconnected across WANs.

  • Experience designing and operating hybridcloud platforms.

  • Strong commitment to diversity equity and inclusion with a history of building inclusive teams.

  • Ownership mentality: willing to pick up new skills and knowledge to solve problems endtoend. Comfortable being handson when needed to help debug systems and support the team.

  • Ability to operate effectively in fastpaced environments with loosely defined priorities and competing deadlines.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that generalpurpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core and to achieve our mission we must encompass and value the many different perspectives voices and experiences that form the full spectrum of humanity.

We are an equal opportunity employer and do not discriminate on the basis of race religion national origin gender sexual orientation age veteran status disability or any other legally protected status.

OpenAI Affirmative Action and Equal Employment Opportunity Policy Statement

For US Based Candidates: Pursuant to the San Francisco Fair Chance Ordinance we will consider qualified applicants with arrest and conviction records.

We are committed to providing reasonable accommodations to applicants with disabilities and requests can be made via thislink.

OpenAI Global Applicant Privacy Policy

At OpenAI we believe artificial intelligence has the potential to help people solve immense global challenges and we want the upside of AI to be widely shared. Join us in shaping the future of technology.


Required Experience:

Manager

Employment Type

Full-Time

About Company

Report This Job
Disclaimer: Drjobpro.com is only a platform that connects job seekers and employers. Applicants are advised to conduct their own independent research into the credentials of the prospective employer.We always make certain that our clients do not endorse any request for money payments, thus we advise against sharing any personal or bank-related information with any third party. If you suspect fraud or malpractice, please contact us via contact us page.