Employer Active
Job Alert
You will be updated with latest job alerts via emailJob Alert
You will be updated with latest job alerts via email$ 166000 - 291000
1 Vacancy
Artificial Intelligence could be one of humanitys most useful inventions. At DeepMind were a team of scientists engineers machine learning experts and more working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery and collaborate with others on critical challenges ensuring safety and ethics are the highest priority.
The current paradigm for Large Language Models (LLMs) is largely one-size-fits-all. While powerful this approach fails to capture the diverse implicit and evolving preferences of individual users. A users true intent is often revealed not in a single prompt but over the course of a long interactive conversation. The next frontier in AI is to move beyond static instruction-following and create models that dynamically learn and adapt to each user personalizing their behavior to maximize helpfulness and satisfaction over the long term.
Our team is focused on this challenge: teaching Gemini to personalize itself through interaction. We frame this problem as a multi-turn imperfect information game where the model must learn to infer a users latent goals and preferences from conversational cues. Our aim is to leverage advanced Reinforcement Learning techniques to optimize for long-horizon user satisfaction this involves tackling complex credit assignment problems in stateful interactive environments.
The techniques you develop will have a direct impact on a wide range of Gemini applications like complex multi-step tool use and agentic workflows.
Key responsibilities:
To make this effort successful we need a strong RS who can help us deliver state-of-the-art personalized models. We are looking for a candidate with deep expertise in reinforcement learning and large-scale ML systems. You should be passionate about solving complex long-horizon problems and excited by the challenge of building truly adaptive and intelligent agents.
In order to set you up for success as a Research Scientist at DeepMind we look for the following skills and experience:
In addition the following would be an advantage:
The US base salary range for this full-time position is between $166000 - $291000 bonus equity benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.
Application Deadline: September 9 2025
At Google DeepMind we value diversity of experience knowledge backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex race religion or belief ethnic or national origin disability age citizenship marital domestic or civil partnership status sexual orientation gender identity pregnancy or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation please do not hesitate to let us know.
Full Time