Technical Program Manager, RL Infrastructure & Reliability

Google DeepMind

Not Interested
Bookmark
Report This Job

profile Job Location:

Mountain View, CA - USA

profile Monthly Salary: $ 156000 - 229000
Posted on: 30+ days ago
Vacancies: 1 Vacancy

Job Summary

The Role
As a Technical Program Manager for Reinforcement Learning (RL) Infrastructure & Reliability you will focus on a critical rapidly evolving area: the post-training stack that refines and improves Gemini models. You will be a hands-on driver of technical programs embedding with engineering teams to enhance the reliability performance and scalability of the infrastructure that powers our most advanced RL workloads.

This role is for a TPM who thrives on ambiguity and technical depth. You will lead concrete engineering initiatives from driving performance optimization programs to owning the execution of reliability roadmaps. Your work will have a direct and measurable impact on the quality of our models and the velocity of our research.

Responsibilities

  • Performance & Efficiency Optimization: Drive technical programs focused on optimizing the performance and efficiency of post-training and RL workloads. This includes quantitative analysis developing shared dashboards and guiding engineering execution on improvements.
  • Reliability Roadmap Execution: Execute key projects from the post-training reliability roadmap such as improving monitoring tools and centralizing core services to enhance the stability of the entire stack.
  • Code Health Initiatives: Own the technical project management for initiatives aimed at improving the long-term health testability and maintainability of the RL infrastructure codebases.
  • Roadmap & Backlog Management: Manage the engineering backlog and tactical execution for core RL framework development ensuring progress is tracked and aligned with the teams strategic roadmap.
  • Cross-Functional Coordination: Build effective working relationships with engineering teams guiding alignment on project goals managing interdependencies and ensuring clear communication and risk management.
  • Program Governance: Contribute to the broader program management of the Frameworks and Infrastructure team providing clear stakeholder updates and supporting team-wide events.

Minimum Qualifications

  • Bachelors degree in a technical field or equivalent practical experience.
  • 5 years of experience in program or project management in a technical software environment.
  • Experience working directly with engineering teams on the software development lifecycle.

Preferred Qualifications

  • 5 years of relevant work experience.
  • Experience with machine learning workflows particularly in training post-training or MLOps. Direct experience with Reinforcement Learning (RL) is a strong plus.
  • Strong analytical skills with experience in performance analysis reliability engineering (SRE) or technical efficiency projects.
  • Proficiency with project management and development tools (e.g. Jira Gantt charts) for managing technical backlogs.
  • Excellent interpersonal and communication skills with a demonstrated ability to work effectively in ambiguous fast-paced R&D environments.

The US base salary range for this full-time position is between $156000 - $229000 bonus equity benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.

Application deadline: 12pm PST Friday 14th Nov 2025

Note: In the event your application is successful and an offer of employment is made to you any offer of employment will be conditional on the results of a background check performed by a third party acting on our behalf. For more information on how we handle your data please see our Applicant and Candidate Privacy Policy.

At Google DeepMind we value diversity of experience knowledge backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunities regardless of sex race religion or belief ethnic or national origin disability age citizenship marital domestic or civil partnership status sexual orientation gender identity pregnancy or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation please do not hesitate to let us know.


Required Experience:

Manager

The RoleAs a Technical Program Manager for Reinforcement Learning (RL) Infrastructure & Reliability you will focus on a critical rapidly evolving area: the post-training stack that refines and improves Gemini models. You will be a hands-on driver of technical programs embedding with engineering team...
View more view more

Key Skills

  • Lean Manufacturing
  • Six Sigma
  • Food Industry
  • Root cause Analysis
  • SAP
  • CMMS
  • Conflict Management
  • Maintenance Management
  • Maintenance
  • Supplier Management
  • Team Management
  • Programmable Logic Controllers

About Company

Company Logo

Artificial intelligence could be one of humanity’s most useful inventions. We research and build safe artificial intelligence systems. We're committed to solving intelligence, to advance science and benefit humanity.

View Profile View Profile