TalkRL: The Reinforcement Learning Podcast

by Robin Ranjit Singh Chauhan

TalkRL podcast is All Reinforcement Learning, All the Time. In-depth interviews with brilliant people at the forefront of RL research and practice. Guests from places like MILA, OpenAI, MIT, DeepMind, Berkeley, Amii, Oxford, Google Research, Brown, Waymo, Caltech, and Vector Institute. Hosted by Rob...

Insights from recent episode analysis

Audience Interest

Estimated Reach: 1K to 10K

Listeners across platforms

Podcast Focus

Categories: technology

Publishing Consistency

Frequency: ~Weekly

50+ episodes since 2019

Platform Reach

Insights are generated by CastFox AI using publicly available data, episode content, and proprietary models.

Most discussed topics

reinforcement learning

research

awards

decision making

ai research

dreamer v4

Brands & references

University of Alberta3

Google Scholar3

Openmind Research Institute2

Google DeepMind2

DeepMind2

the University of Edinburgh2

The Alberta Plan for AI Research1

Dreamer v41

Generic platforms filtered out.

Low Confidence

Total monthly reach

1K to 10K

Estimated from 1 chart position in 1 market.

By chart position

🇯🇵
JP · Technology
#108
1K to 10K

Per-Episode Audience
Est. listeners per new episode within ~30 days
500 to 5K
🎙 Weekly cadence·74 episodes·Last published 5mo ago
Monthly Reach
Unique listeners across all episodes (30 days)
1K to 10K
🇯🇵100%
Active Followers
Loyal subscribers who consistently listen
300 to 3K

Market Insights

This ShowCategory Avg

No category insights available.

📡

Platform Distribution

Reach across major podcast platforms, updated hourly

Total Followers

—

Total Plays

—

Total Reviews

—

YouTube

Subscribers

—

Views

—

Videos

—

Castbox

Followers

—

Plays

—

Reviews

—

Podcast App

Followers

—

Plays

—

Reviews

—

Podcast Republic

Followers

—

Plays

—

Reviews

—

TuneIn

Followers

—

Plays

—

Reviews

—

* Data sourced directly from platform APIs and aggregated hourly across all major podcast directories.

On the show

From 10 eps

Hosts

Not detected.

Recent guests

20 across last 10 eps

Joseph Modayil

1 ep

Danijar Hafner

1 ep

David Abel

1 ep

Jake Beck

1 ep

Alex Goldie

1 ep

Cornelius Braun

1 ep

Ayush Jain

1 ep

Norio Kosaka

1 ep

Xinhu Li

1 ep

Kyung-Min Kim

1 ep

Erdem Biyik

1 ep

Joseph J Lim

1 ep

William Solow

1 ep

Sandhya Saisubramanian

1 ep

Alan Fern

1 ep

Calarina Muslimani

1 ep

Kerrick Johnstonbaugh

1 ep

Suyog Chandramouli

1 ep

Serena Booth

1 ep

W. Bradley Knox

1 ep

Recent episodes

Joseph Modayil of Openmind Research Institute @ RLC 2025

Jan 3, 2026

4m 27s

Danijar Hafner on Dreamer v4

Nov 10, 2025

1h 40m 52s

David Abel on the Science of Agency @ RLDM 2025

Sep 8, 2025

59m 42s

Jake Beck, Alex Goldie, & Cornelius Braun on Sutton's OaK, Metalearning, LLMs, Squirrels @ RLC 2025

Aug 19, 2025

12m 20s

Outstanding Paper Award Winners - 2/2 @ RLC 2025

Aug 18, 2025

14m 18s

🔗

Social Links & Contact

Official channels & resources

🌐

Official Website

📡

RSS Feed

Episodes

monthly

Avg length

23m 43s

4m 27s – 1h 40m 52s

Range

Aug 2020 – Jan 2026

Topics

reinforcement learning, research +20

Guests

Joseph Modayil +51 · last 10 eps

25 of 25

Date	Episode	Topics	Guests	Brands	Places	Keywords	Sponsor	Length
1/3/26	Joseph Modayil of Openmind Research Institute @ RLC 2025✨	Reinforcement LearningAI Research	Joseph Modayil	Openmind Research InstituteThe Alberta Plan for AI Research+1	—	Openmind Research InstituteAI+1	—	4m 27s
11/10/25	Danijar Hafner on Dreamer v4✨	Reinforcement LearningDreamer v4+2	Danijar Hafner	Dreamer v4Dreamer v3+4	—	Google DeepMindResearch Scientist+3	—	1h 40m 52s
9/8/25	David Abel on the Science of Agency @ RLDM 2025✨	Reinforcement LearningAgency+2	David Abel	DeepMindAgency+3	—	DeepMindUniversity of Edinburgh+1	—	59m 42s
8/19/25	Jake Beck, Alex Goldie, & Cornelius Braun on Sutton's OaK, Metalearning, LLMs, Squirrels @ RLC 2025✨	OaK ArchitectureMetalearning+2	Jake BeckAlex Goldie+1	OaK ArchitectureAlberta Plan+4	EdmontonAlberta+1	Reinforcement Learning ConferenceUniversity of Alberta+1	—	12m 20s
8/18/25	Outstanding Paper Award Winners - 2/2 @ RLC 2025✨	Reinforcement LearningResearch+1	Ayush JainNorio Kosaka+16	WOFOSTGymCrop Simulator+1	EdmontonAlberta+1	Empirical Reinforcement LearningCrop Management+2	—	14m 18s
8/15/25	Outstanding Paper Award Winners - 1/2 @ RLC 2025✨	Reinforcement LearningResearch+1	Alexander David GoldieZilin Wang+17	PufferLib 2.0RLC+1	EdmontonAlberta+1	RLC 2025Outstanding Paper+3	—	6m 46s
8/4/25	Thomas Akam on Model-based RL in the Brain✨	Model-based Reinforcement LearningNeuroscience+1	Thomas Akam	pyPhotometrypyControl+5	—	brain architectureadaptive behavior+1	—	52m 06s
7/22/25	Stefano Albrecht on Multi-Agent RL @ RLDM 2025✨	Multi-Agent Reinforcement LearningAI+2	Stefano V. Albrecht	Multi-Agent Reinforcement Learning: Foundations and Modern ApproachesEPyMARL+4	DublinIreland	AIDeepflow+3	—	31m 34s
6/25/25	Satinder Singh: The Origin Story of RLDM @ RLDM 2025✨	Reinforcement LearningDecision Making+1	Satinder Singh	RLDM 2025Google DeepMind+5	IrelandDublin	Google DeepMindU of Michigan+1	—	5m 57s
3/9/25	NeurIPS 2024 - Posters and Hallways 3✨	Reinforcement LearningNeurIPS 2024+1	Claire Bizon MonrocAndrew Wagenmaker+4	InriaUC Berkeley+4	VancouverBC+1	Wind Farm ControlSim-to-Real Gap+3	—	10m 01s
Want analysis for the episodes below?Free for Pro Submit a request, we'll have your selected episodes analyzed within an hour. Free, at no cost to you, for Pro users.
3/5/25	NeurIPS 2024 - Posters and Hallways 2	Posters and Hallway episodes are short interviews and poster summaries. Recorded at NeurIPS 2024 in Vancouver BC Canada. Featuring Jonathan Cook from University of Oxford: Artificial Generational Intelligence: Cultural Accumulation in Reinforcement Learning Yifei Zhou from Berkeley AI Research: DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning Rory Young from University of Glasgow: Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach Glen Berseth from MILA: Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn Alexander Rutherford from University of Oxford: JaxMARL: Multi-Agent RL Environments and Algorithms in JAX						8m 48s
3/3/25	NeurIPS 2024 - Posters and Hallways 1	Posters and Hallway episodes are short interviews and poster summaries. Recorded at NeurIPS 2024 in Vancouver BC Canada. Featuring Jiaheng Hu of University of Texas: Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning Skander Moalla of EPFL: No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO Adil Zouitine of IRT Saint Exupery/Hugging Face : Time-Constrained Robust MDPs Soumyendu Sarkar of HP Labs : SustainDC: Benchmarking for Sustainable Data Center Control Matteo Bettini of Cambridge University: BenchMARL: Benchmarking Multi-Agent Reinforcement Learning Michael Bowling of U Alberta : Beyond Optimism: Exploration With Partially Observable Rewards						9m 32s
2/10/25	Abhishek Naik on Continuing RL & Average Reward	Abhishek Naik was a student at University of Alberta and Alberta Machine Intelligence Institute, and he just finished his PhD in reinforcement learning, working with Rich Sutton. Now he is a postdoc fellow at the National Research Council of Canada, where he does AI research on Space applications. Featured References Reinforcement Learning for Continuing Problems Using Average Reward Abhishek Naik Ph.D. dissertation 2024 Reward Centering Abhishek Naik, Yi Wan, Manan Tomar, Richard S. Sutton 2024 Learning and Planning in Average-Reward Markov Decision Processes Yi Wan, Abhishek Naik, Richard S. Sutton 2020 Discounted Reinforcement Learning Is Not an Optimization Problem Abhishek Naik, Roshan Shariff, Niko Yasui, Hengshuai Yao, Richard S. Sutton 2019 Additional References Explaining dopamine through prediction errors and beyond, Gershman et al 2024 (proposes Differential-TD-like learning mechanism in the brain around Box 4)						1h 21m 40s
12/23/24	Neurips 2024 RL meetup Hot takes: What sucks about RL?	What do RL researchers complain about after hours at the bar? In this "Hot takes" episode, we find out! Recorded at The Pearl in downtown Vancouver, during the RL meetup after a day of Neurips 2024. Special thanks to "David Beckham" for the inspiration :)						17m 45s
9/20/24	RLC 2024 - Posters and Hallways 5	Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA. Featuring: 0:01 David Radke of the Chicago Blackhawks NHL on RL for professional sports 0:56 Abhishek Naik from the National Research Council on Continuing RL and Average Reward 2:42 Daphne Cornelisse from NYU on Autonomous Driving and Multi-Agent RL 08:58 Shray Bansal from Georgia Tech on Cognitive Bias for Human AI Ad hoc Teamwork 10:21 Claas Voelcker from University of Toronto on Can we hop in general? 11:23 Brent Venable from The Institute for Human & Machine Cognition on Cooperative information dissemination						13m 17s
9/19/24	RLC 2024 - Posters and Hallways 4	Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA. Featuring: 0:01 David Abel from DeepMind on 3 Dogmas of RL 0:55 Kevin Wang from Brown on learning variable depth search for MCTS 2:17 Ashwin Kumar from Washington University in St Louis on fairness in resource allocation 3:36 Prabhat Nagarajan from UAlberta on Value overestimation						4m 52s
9/18/24	RLC 2024 - Posters and Hallways 3	Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA. Featuring: 0:01 Kris De Asis from Openmind on Time Discretization 2:23 Anna Hakhverdyan from U of Alberta on Online Hyperparameters 3:59 Dilip Arumugam from Princeton on Information Theory and Exploration 5:04 Micah Carroll from UC Berkeley on Changing preferences and AI alignment						6m 43s
9/16/24	RLC 2024 - Posters and Hallways 2	Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA. Featuring: 0:01 Hector Kohler from Centre Inria de l'Université de Lille with "Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning" 2:29 Quentin Delfosse from TU Darmstadt on "Interpretable Concept Bottlenecks to Align Reinforcement Learning Agents" 4:15 Sonja Johnson-Yu from Harvard on "Understanding biological active sensing behaviors by interpreting learned artificial agent policies" 6:42 Jannis Blüml from TU Darmstadt on "OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments" 8:20 Cameron Allen from UC Berkeley on "Resolving Partial Observability in Decision Processes via the Lambda Discrepancy" 9:48 James Staley from Tufts on "Agent-Centric Human Demonstrations Train World Models" 14:54 Jonathan Li from Rensselaer Polytechnic Institute						15m 52s
9/10/24	RLC 2024 - Posters and Hallways 1	Posters and Hallway episodes are short interviews and poster summaries. Recorded at RLC 2024 in Amherst MA. Featuring: 0:01 Ann Huang from Harvard on Learning Dynamics and the Geometry of Neural Dynamics in Recurrent Neural Controllers 1:37 Jannis Blüml from TU Darmstadt on HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning 3:13 Benjamin Fuhrer from NVIDIA on Gradient Boosting Reinforcement Learning 3:54 Paul Festor from Imperial College London on Evaluating the impact of explainable RL on physician decision-making in high-fidelity simulations: insights from eye-tracking metrics						5m 46s
9/2/24	Finale Doshi-Velez on RL for Healthcare @ RCL 2024	Finale Doshi-Velez is a Professor at the Harvard Paulson School of Engineering and Applied Sciences. This off-the-cuff interview was recorded at UMass Amherst during the workshop day of RL Conference on August 9th 2024. Host notes: I've been a fan of some of Prof Doshi-Velez' past work on clinical RL and hoped to feature her for some time now, so I jumped at the chance to get a few minutes of her thoughts -- even though you can tell I was not prepared and a bit flustered tbh. Thanks to Prof Doshi-Velez for taking a moment for this, and I hope to cross paths in future for a more in depth interview. References Finale Doshi-Velez Homepage @ Harvard Finale Doshi-Velez on Google Scholar						7m 35s
8/28/24	David Silver 2 - Discussion after Keynote @ RCL 2024	Thanks to Professor Silver for permission to record this discussion after his RLC 2024 keynote lecture. Recorded at UMass Amherst during RCL 2024.Due to the live recording environment, audio quality varies. We publish this audio in its raw form to preserve the authenticity and immediacy of the discussion. References AlphaProof announcement on DeepMind's blogDiscovering Reinforcement Learning Algorithms, Oh et al -- His keynote at RLC 2024 referred to more recent update to this work, yet to be published Reinforcement Learning Conference 2024 David Silver on Google Scholar						16m 17s
8/26/24	David Silver @ RCL 2024	David Silver is a principal research scientist at DeepMind and a professor at University College London. This interview was recorded at UMass Amherst during RLC 2024. References Discovering Reinforcement Learning Algorithms, Oh et al -- His keynote at RLC 2024 referred to more recent update to this work, yet to be published Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm, Silver et al 2017 -- the AlphaZero algo was used in his recent work on AlphaProof AlphaProof on the DeepMind blog AlphaFold on the DeepMind blog Reinforcement Learning Conference 2024 David Silver on Google Scholar						11m 27s
4/8/24	Vincent Moens on TorchRL	Dr. Vincent Moens is an Applied Machine Learning Research Scientist at Meta, and an author of TorchRL and TensorDict in pytorch. Featured References TorchRL: A data-driven decision-making library for PyTorch Albert Bou, Matteo Bettini, Sebastian Dittert, Vikash Kumar, Shagun Sodhani, Xiaomeng Yang, Gianni De Fabritiis, Vincent Moens Additional References TorchRL on github TensorDict Documentation						40m 14s
3/25/24	Arash Ahmadian on Rethinking RLHF	Arash Ahmadian is a Researcher at Cohere and Cohere For AI focussed on Preference Training of large language models. He’s also a researcher at the Vector Institute of AI.Featured ReferenceBack to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMsArash Ahmadian, Chris Cremer, Matthias Gallé, Marzieh Fadaee, Julia Kreutzer, Olivier Pietquin, Ahmet Üstün, Sara HookerAdditional ReferencesSelf-Rewarding Language Models, Yuan et al 2024 Reinforcement Learning: An Introduction, Sutton and Barto 1992Learning from Delayed Rewards, Chris Watkins 1989Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning, Williams 1992						33m 30s
3/11/24	Glen Berseth on RL Conference	Glen Berseth is an assistant professor at the Université de Montréal, a core academic member of the Mila - Quebec AI Institute, a Canada CIFAR AI chair, member l'Institute Courtios, and co-director of the Robotics and Embodied AI Lab (REAL). Featured Links Reinforcement Learning Conference Closing the Gap between TD Learning and Supervised Learning--A Generalisation Point of View Raj Ghugare, Matthieu Geist, Glen Berseth, Benjamin Eysenbach						21m 38s