Stephen Casper on Technical and Sociotechnical AI Safety Research


Episode Artwork
1.0x
0% played 00:00 00:00
Aug 02 2024 59 mins   7

Stephen Casper, a computer science PhD student at MIT, joined the podcast to discuss AI interpretability, red-teaming and robustness, evaluations and audits, reinforcement learning from human feedback, Goodhart’s law, and more.


Our music is by Micah Rubin (Producer) and John Lisi (Composer).


For a transcript and relevant links, visit the Center for AI Policy Podcast Substack.