data
Posted Mar 14Research Scientist, Reinforcement Learning
at DeepMind
London, United KingdomOn-site
You are nearing today's limit. Upgrade for unlimited access.
Responsibilities
- A research track record in RL, including peer-reviewed publications.
Requirements
- At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact.
- If you have a disability or additional need that requires accommodation, please do not hesitate to let us know. Snapshot
- We're looking for talented Research Scientists to push forward fundamental research and technology in Artificial Intelligence, as part of our interdisciplinary and collaborative Reinforcement Learning team. About Us
- Over the past decade, members of the RL team have been instrumental in building DQN, AlphaGo, Rainbow, AlphaZero, MuZero, AlphaStar, AlphaProof and Gemini.
- As a Research Scientist, you'll use machine learning knowledge and technical know-how to innovate, drive research projects, as well as apply research to impactful problems.
- Our projects span the full range of state-of-the-art machine learning and AI fields, including large language models, distributed machine learning techniques, and much more, but with an emphasis on reinforcement learning.
- In order to set you up for success as a Research Scientist at Google DeepMind, we look for the following skills and experience:
- PhD in ML preferred, or equivalent practical experience.
- Experience with RL for sequence models, post-training, preference-based learning, or agentic systems. •
- Experience with modern research stacks (e.g., JAX/Flax or PyTorch) and scaling experiments.
Additional details
- DeepMind’s RL team is a long-standing and tight-knit team of collaborative scientists and engineers, led by Tom Schaul.
- We tackle large scale research challenges in reinforcement learning.
- We design, refine, and scale RL algorithms and deliver meaningful scientific or product impact.
- You will be expected to implement code, run experiments, own results end-to-end, communicate them internally or externally, as well as collaborate with and empower others.