Google DeepMind: The Podcast

Apr 10 2025 49 mins 475 1 0

Other Episodes

In this episode of Google DeepMind: The Podcast, VP of Reinforcement Learning, David Silver, describes his vision for the future of AI, exploring the concept of the "era of experience" versus the current "era of human data". Using AlphaGo and AlphaZero as examples, he highlights how these systems surpassed human capabilities by engaging in reinforcement learning without prior human knowledge. This approach contrasts with large language models, which depend on human data and feedback. Silver emphasizes the need to explore this path to drive AI progress and achieve artificial superintelligence.

Timestamps

00:00 Introduction
01:50 Era of experience
03:45 AlphaZero
10:19 Move 37
15:20 Reinforcement learning and human feedback
24:30 AlphaProof
29:50 Math Olympiads
35:00 Experience based methods
42:56 Hannah's reflections
44:00 Fan Hui joins

___

Thanks to everyone who made this possible, including but not limited to:

Presenter: Professor Hannah Fry
Series Producer: Dan Hardoon
Series Editor: Rami Tzabar
Commissioner & Producer: Emma Yousif
Music Composition: Eleni Shaw
Audio Engineer: Richard Courtice
Production Manager: Dan Lazard
Video Director and Editor: Bernardo Resende
Video Studio Production: Nicholas Duke
Video Editor: Bilal Merhi
Audio Engineer: Perry Rogantin
Camera and Lighting Operator: Robert Messere
Production Coordination: Zoey Roberts, Sarah Ellen Morton
Visual Identity and Design: Rob Ashley
Commissioned by Google DeepMind

Please like and subscribe on your preferred podcast platform. Want to share feedback? Or have a suggestion for a guest that we should have on next? Leave us a comment on YouTube and stay tuned for future episodes.

Download episode Share

Copy URL

Listen on Podcast Addict

Subscribe on Podcast Addict

Is Human Data Enough? With David Silver

Apr 10 2025 49 mins 475 1 0