Research Scientist, Reinforcement Learning

DeepMind
London

At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.

Snapshot

We're looking for talented Research Scientists to push forward fundamental research and technology in Artificial Intelligence, as part of our interdisciplinary and collaborative Reinforcement Learning team.

About Us

DeepMind’s RL team is a long-standing and tight-knit team of collaborative scientists and engineers, led by Tom Schaul. We tackle large scale research challenges in reinforcement learning. We design, refine, and scale RL algorithms and deliver meaningful scientific or product impact. Over the past decade, members of the RL team have been instrumental in building DQN, AlphaGo, Rainbow, AlphaZero, MuZero, AlphaStar, AlphaProof and Gemini. Join us to build the next big thing!

The Role

As a Research Scientist, you'll use machine learning knowledge and technical know-how to innovate, drive research projects, as well as apply research to impactful problems. You will be expected to implement code, run experiments, own results end-to-end, communicate them internally or externally, as well as collaborate with and empower others.

Your work may involve:

  • Initiating or pursuing novel research directions, by proposing and testing research hypotheses.
  • Implementing algorithm ideas and run end-to-end experiments, including setup, execution, analysis, and iteration.
  • Sharing your skills and knowledge with other researchers.
  • Building or improving infrastructure for research at scale.
  • Designing evaluations and ablations that answer real questions and change minds.
  • Analyzing results carefully, including debugging and failure analysis.
  • Communicating clearly through plots, writeups, and paper-ready narratives and figures.
  • Contributing to a culture of first-principles thinking, high standards, and direct, constructive feedback.

Our projects span the full range of state-of-the-art machine learning and AI fields, including large language models, distributed machine learning techniques, and much more, but with an emphasis on reinforcement learning.

We take a holistic view of people's backgrounds, and do not expect you to be an expert in all areas. We do expect you to proactively and quickly adopt new technologies and systems, but we also invest a lot of time in training and helping people to continually learn as part of their role.

About You

In order to set you up for success as a Research Scientist at Google DeepMind, we look for the following skills and experience:

  • A passion for reinforcement learning
  • A research track record in RL, including peer-reviewed publications.
  • Strong implementation ability and comfort working in research codebases.
  • Evidence of owning experiments end-to-end, including analysis and interpretation.
  • Strong communication skills and a bias toward clarity and honesty regarding results.
  • High agency and drive: You push projects forward, prioritize effectively, and take initiative.
  • PhD in ML preferred, or equivalent practical experience.

In addition, the following would be an advantage:

  • Experience with RL for sequence models, post-training, preference-based learning, or agentic systems.
  • Experience with modern research stacks (e.g., JAX/Flax or PyTorch) and scaling experiments.
  • Strong experimental taste: Good judgment regarding baselines, ablations, and what is worth testing.
  • Comfort with scaling, evaluation methodologies, and diagnosing complex failure modes.
  • A focus on craft: You care about doing excellent work while maintaining a high velocity.
Posted 2026-06-21

Recommended Jobs

English ECT - Independent School, Islington

Marchant Recruitment
London

A prestigious Independent School in Islington is seeking a permanent, full-time Early Career Teacher (ECT) of English, starting January 2026. Start your career in an intellectually vibrant English de…

View Details
Posted 2025-10-24

Associate Director

London

This is a senior, client-facing leadership role for someone who can confidently lead major accounts, manage senior client relationships, oversee quality of delivery, support new business and act as a…

View Details
Posted 2026-06-18

Data Scientist

DAINTTA
London

Who are we looking for? You enjoy working on complex data problems whilst being able to suggest simple (yet effective) solutions. You are comfortable working with uncertainty and like to make thing…

View Details
Posted 2025-06-01

EYFS Teacher Vacancy - Enriching School in Westminster

Marchant Recruitment
London

Our client is looking for a EYFS Teacher to work within a highly regarded mixed school in Westminster. The department is well resourced and boasts and approachable and supportive head of department. …

View Details
Posted 2025-10-07

AOG Operations Supervisor

CEVALogistics
Heathrow, Greater London

CEVA Logistics provides global supply chain solutions to connect people, products, and providers all around the world. Present in 170+ countries and with more than 110,000 employees spread over 1,500…

View Details
Posted 2026-06-12

Band 7 - Cardiac MRI Radiographer- Central London

Pulse
London

Job Title : Cardiac MRI Radiographer – Central London Banding : Band 7 Location : Central London Start: ASAP Duration : Ongoing Rate : £33– £34 Working hours : Monday to Friday We…

View Details
Posted 2025-07-31

Corporate Account Manager

Storm Technologies
London

Established in June 2000, with a current turnover of £150 million, Storm has grown from strength to strength as a focused IT value-added reseller. Our aim is to deliver exceptional service to our cus…

View Details
Posted 2026-01-10

Project Coordinator

Emota
London

A bit about the role … As a Project Coordinator you'll play a pivotal role in supporting the planning and delivery of innovative, high-impact exhibitions and booth projects. Reporting to a Senor Pr…

View Details
Posted 2026-06-12

Intelligent Marketplace - Talent Pool (Hiring Immediately)

LoopMe
London

Our vision is to change advertising for the better, by building technology that will redefine brand advertising. LoopMe powers programmatic advertising, improves media delivery, develops bespoke audi…

View Details
Posted 2026-03-18