The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

by Sam Charrington

Machine learning and artificial intelligence are dramatically changing the way businesses operate and people live. The TWIML AI Podcast brings the top minds and ideas from the world of ML and AI to a broad and influential community of ML/AI researchers, data scientists, engineers and tech-savvy busi...

Insights from recent episode analysis

Audience Interest

Estimated Reach: 133K to 510K

Listeners across platforms

Podcast Focus

Categories: technology · news

+1 more

Publishing Consistency

Frequency: ~3-4 / Week

750+ episodes since 2016

Platform Reach

Insights are generated by CastFox AI using publicly available data, episode content, and proprietary models.

Most discussed topics

multi-agent systemsgenerative aitechnical challengesreinforcement learningcustomer experienceenterprise aiai governancedeveloper experiencediffusion language modelstext generationcode generationinference speeds

Brands & references

Stanford University2

Sphere1

TRAM (Tax Review and Assessment Model)1

Kumo1

Stanford1

DoorDash1

Coinbase1

AI Virtual Cell1

Generic platforms filtered out.

High Confidence

Total monthly reach

133K to 510K

Estimated from 25 chart positions in 25 markets.

By chart position

🇺🇸
US · Technology
#141
5K to 30K
🇬🇧
GB · Technology
#194
5K to 30K
🇦🇺
AU · Technology
#196
5K to 30K
🇪🇸
ES · Technology
#38
30K to 100K
🇮🇳
IN · Technology
#42
30K to 100K

Per-Episode Audience
Est. listeners per new episode within ~30 days
67K to 255K
🎙 ~2x weekly·783 episodes·Last published 1w ago
Monthly Reach
Unique listeners across all episodes (30 days)
133K to 510K
🇪🇸20%🇮🇳20%🇫🇷20%+22 more
Active Followers
Loyal subscribers who consistently listen
53K to 204K

Market Insights

This ShowCategory Avg

No category insights available.

📡

Platform Distribution

Reach across major podcast platforms, updated hourly

Total Followers

—

Total Plays

—

Total Reviews

—

YouTube

Subscribers

—

Views

—

Videos

—

Castbox

Followers

—

Plays

—

Reviews

—

Podcast App

Followers

—

Plays

—

Reviews

—

Podcast Republic

Followers

—

Plays

—

Reviews

—

TuneIn

Followers

—

Plays

—

Reviews

—

* Data sourced directly from platform APIs and aggregated hourly across all major podcast directories.

On the show

From 14 eps

Host

Sam Charrington

14 eps

Recent guests

14 across last 14 eps

Rashmi Shetty

1 ep

Stefano Ermon

1 ep

Siddhant Pardeshi

1 ep

Sebastian Raschka

1 ep

Yejin Choi

1 ep

Nikita Rudin

1 ep

Aakanksha Chowdhery

1 ep

Munawar Hayat

1 ep

Zain Asgar

1 ep

Devi Parikh

1 ep

Philip Kiely

1 ep

Scott Clark

1 ep

Jure Leskovec

1 ep

Alex Bowcut

1 ep

Recent episodes

Why AI Agents Break the GenAI Security Model with Devvret Rishi - #770

Jun 16, 2026

56m 18s

Is RAG Dead? Lessons from Building AI for Tax Law with Alex Bowcut - #769

Jun 9, 2026

51m 32s

Relational Foundation Models for Enterprise Data with Jure Leskovec - #768

May 21, 2026

1h 06m 23s

How to Find the Agent Failures Your Evals Miss with Scott Clark - #767

May 7, 2026

53m 19s

How to Engineer AI Inference Systems with Philip Kiely - #766

Apr 30, 2026

54m 51s

🔗

Social Links & Contact

Official channels & resources

🌐

Official Website

📡

RSS Feed

Episodes

788

monthly

Avg length

1h 00m 15s

48m 44s – 1h 18m 55s

Range

Sep 2020 – Apr 2026

Topics

multi-agent systems, generative ai +78

Guests

Alex Bowcut +13 · last 14 eps

25 of 25

Date	Episode	Topics	Guests	Brands	Places	Keywords	Sponsor	Length
6/16/26	Why AI Agents Break the GenAI Security Model with Devvret Rishi - #770	In this episode, Sam talks with Dev Rishi, GM of AI at Rubrik, about what happens when agents move beyond answering questions and start taking action across tools, systems, and business processes. We explore why the enterprise playbook of static guardrails plus human approval starts to break down in the agent era. Agents are useful because they can plan, call tools, update systems, write code, send messages, and operate across workflows at machine speed, but those same capabilities make them difficult to govern with rules written in advance or approval prompts reviewed one at a time. Dev explains why tool access increases blast radius, why agents can route around controls in surprising ways, and why human-in-the-loop review can become security theater when agents operate at scale. We also discuss what enterprises need instead: better visibility, runtime enforcement, policy-aware governance, agent observability, and recovery mechanisms for when something goes wrong. Along the way, we dig into MCP and tool sprawl, small language models for policy enforcement, defense in depth, agent rewind, and why AI may be needed to help secure AI. 🗒️ Full show notes: https://twimlai.com/go/770.						56m 18s
6/9/26	Is RAG Dead? Lessons from Building AI for Tax Law with Alex Bowcut - #769✨	retrieval-augmented generationAI in tax law+4	Alex Bowcut	SphereTRAM (Tax Review and Assessment Model)	—	retrieval-augmented generationAI+5	—	51m 32s
5/21/26	Relational Foundation Models for Enterprise Data with Jure Leskovec - #768✨	AI for sciencerelational deep learning+4	Jure Leskovec	AI Virtual CellESM+8	—	relational deep learningenterprise data+5	—	1h 06m 23s
5/7/26	How to Find the Agent Failures Your Evals Miss with Scott Clark - #767✨	LLM systemsobservability+4	Scott Clark	Distributional	—	LLMobservability+5	—	53m 19s
4/30/26	How to Engineer AI Inference Systems with Philip Kiely - #766✨	AI inference systemsGPU programming+4	Philip Kiely	vLLMSGLang+2	—	inference engineeringGPU programming+5	—	54m 51s
4/16/26	How Capital One Delivers Multi-Agent Systems with Rashmi Shetty - #765✨	multi-agent systemsgenerative AI+4	Rashmi Shetty	Chat ConciergeCapital One	—	multi-agent systemsgenerative AI+6	—	54m 18s
3/26/26	The Race to Production-Grade Diffusion LLMs with Stefano Ermon - #764✨	diffusion language modelstext generation+4	Stefano Ermon	Mercury 2Stanford University+1	—	diffusion modelslanguage models+5	—	1h 03m 18s
3/10/26	Agent Swarms and Knowledge Graphs for Autonomous Software Development with Siddhant Pardeshi - #763✨	autonomous software developmentAI-assisted coding+4	Siddhant Pardeshi	Blitzy	—	autonomous systemsAI agents+5	—	1h 16m 14s
2/26/26	AI Trends 2026: OpenClaw Agents, Reasoning LLMs, and More with Sebastian Raschka - #762✨	AI trendsLLM landscape+4	Sebastian Raschka	Build A Reasoning Model (From Scratch)	—	LLMreasoning+6	—	1h 18m 55s
1/29/26	The Evolution of Reasoning in Small Language Models with Yejin Choi - #761✨	small language modelsreasoning capabilities+4	Yejin Choi	Stanford UniversityInstitute for Human-Centered AI+1	—	language modelsreasoning+5	—	1h 06m 21s
Want analysis for the episodes below?Free for Pro Submit a request, we'll have your selected episodes analyzed within an hour. Free, at no cost to you, for Pro users.
1/8/26	Intelligent Robots in 2026: Are We There Yet? with Nikita Rudin - #760✨	roboticsautonomous robots+4	Nikita Rudin	Flexion Robotics	—	roboticsautonomous robots+4	—	1h 06m 37s
12/17/25	Rethinking Pre-Training for Agentic AI with Aakanksha Chowdhery - #759✨	agentic AIpre-training+5	Aakanksha Chowdhery	PaLMGemini+2	—	agentic AIpre-training+5	—	52m 54s
12/9/25	Why Vision Language Models Ignore What They See with Munawar Hayat - #758✨	Vision-Language Modelsmultimodal AI+4	Munawar Hayat	Qualcomm AI Research	—	Vision-Language Modelsobject hallucination+4	—	57m 40s
12/2/25	Scaling Agentic Inference Across Heterogeneous Compute with Zain Asgar - #757✨	AI inferenceheterogeneous compute+3	Zain Asgar	Gimlet Labs	—	AI workloadsGPUs+3	—	48m 44s
11/19/25	Proactive Agents for the Web with Devi Parikh - #756✨	proactive agentsweb interaction+4	Devi Parikh	Yutori	—	proactive agentsbrowser use models+4	—	56m 04s
11/12/25	AI Orchestration for Smart Cities and the Enterprise with Robin Braun and Luke Norris - #755	Today, we're joined by Robin Braun, VP of AI business development for hybrid cloud at HPE, and Luke Norris, co-founder and CEO of Kamiwaza, to discuss how AI systems can be used to automate complex workflows and unlock value from legacy enterprise data. Robin and Luke detail high-impact use cases from HPE and Kamiwaza’s collaboration on an “Agentic Smart City” project for Vail, Colorado, including remediation and automation of website accessibility for 508 compliance, digitization and understanding of deed restrictions, and combining contextual information with camera feeds for fire detection and risk assessment. Additionally, we discuss the role of private cloud infrastructure in overcoming challenges like cost, data privacy, and compliance. Robin and Luke also share their lessons learned, including the importance of fresh data, and the value of a "mud puddle by mud puddle" approach in achieving practical AI wins. The complete show notes for this episode can be found at https://twimlai.com/go/755.						54m 46s
11/4/25	Building an AI Mathematician with Carina Hong - #754	In this episode, Carina Hong, founder and CEO of Axiom, joins us to discuss her work building an "AI Mathematician." Carina explains why this is a pivotal moment for AI in mathematics, citing a convergence of three key areas: the advanced reasoning capabilities of modern LLMs, the rise of formal proof languages like Lean, and breakthroughs in code generation. We explore the core technical challenges, including the massive data gap between general-purpose code and formal math code, and the difficult problem of "autoformalization," or translating natural language proofs into a machine-verifiable format. Carina also shares Axiom's vision for a self-improving system that uses a self-play loop of conjecturing and proving to discover new mathematical knowledge. Finally, we discuss the broader applications of this technology in areas like formal verification for high-stakes software and hardware. The complete show notes for this episode can be found at https://twimlai.com/go/754.						55m 52s
10/28/25	High-Efficiency Diffusion Models for On-Device Image Generation and Editing with Hung Bui - #753	In this episode, Hung Bui, Technology Vice President at Qualcomm, joins us to explore the latest high-efficiency techniques for running generative AI, particularly diffusion models, on-device. We dive deep into the technical challenges of deploying these models, which are powerful but computationally expensive due to their iterative sampling process. Hung details his team's work on SwiftBrush and SwiftEdit, which enable high-quality text-to-image generation and editing in a single inference step. He explains their novel distillation framework, where a multi-step teacher model guides the training of an efficient, single-step student model. We explore the architecture and training, including the use of a secondary 'coach' network that aligns the student's denoising function with the teacher's, allowing the model to bypass the iterative process entirely. Finally, we discuss how these efficiency breakthroughs pave the way for personalized on-device agents and the challenges of running reasoning models with techniques like inference-time scaling under a fixed compute budget. The complete show notes for this episode can be found at https://twimlai.com/go/753.						52m 23s
10/22/25	Vibe Coding's Uncanny Valley with Alexandre Pesant - #752	Today, we're joined by Alexandre Pesant, AI lead at Lovable, who joins us to discuss the evolution and practice of vibe coding. Alex shares his take on how AI is enabling a shift in software development from typing characters to expressing intent, creating a new layer of abstraction similar to how high-level code compiles to machine code. We explore the current capabilities and limitations of coding agents, the importance of context engineering, and the practices that separate successful vibe coders from frustrated ones. Alex also shares Lovable’s technical journey, from an early, complex agent architecture that failed, to a simpler workflow-based system, and back again to an agentic approach as foundation models improved. He also details the company's massive scaling challenges—like accidentally taking down GitHub—and makes the case for why robust evaluations and more expressive user interfaces are the most critical components for AI-native development tools to succeed in the near future. The complete show notes for this episode can be found at https://twimlai.com/go/752.						1h 12m 36s
10/14/25	Dataflow Computing for AI Inference with Kunle Olukotun - #751	In this episode, we're joined by Kunle Olukotun, professor of electrical engineering and computer science at Stanford University and co-founder and chief technologist at Sambanova Systems, to discuss reconfigurable dataflow architectures for AI inference. Kunle explains the core idea of building computers that are dynamically configured to match the dataflow graph of an AI model, moving beyond the traditional instruction-fetch paradigm of CPUs and GPUs. We explore how this architecture is well-suited for LLM inference, reducing memory bandwidth bottlenecks and improving performance. Kunle reviews how this system also enables efficient multi-model serving and agentic workflows through its large, tiered memory and fast model-switching capabilities. Finally, we discuss his research into future dynamic reconfigurable architectures, and the use of AI agents to build compilers for new hardware. The complete show notes for this episode can be found at https://twimlai.com/go/751.						57m 37s
10/7/25	Recurrence and Attention for Long-Context Transformers with Jacob Buckman - #750	Today, we're joined by Jacob Buckman, co-founder and CEO of Manifest AI to discuss achieving long context in transformers. We discuss the bottlenecks of scaling context length and recent techniques to overcome them, including windowed attention, grouped query attention, and latent space attention. We explore the idea of weight-state balance and the weight-state FLOP ratio as a way of reasoning about the optimality of compute architectures, and we dig into the Power Retention architecture, which blends the parallelization of attention with the linear scaling of recurrence and promises speedups of >10x during training and >100x during inference. We review Manifest AI’s recent open source projects as well: Vidrial—a custom CUDA framework for building highly optimized GPU kernels in Python, and PowerCoder—a 3B-parameter coding model fine-tuned from StarCoder to use power retention. Our chat also covers the use of metrics like in-context learning curves and negative log likelihood to measure context utility, the implications of scaling laws, and the future of long context lengths in AI applications. The complete show notes for this episode can be found at https://twimlai.com/go/750.						57m 23s
9/30/25	The Decentralized Future of Private AI with Illia Polosukhin - #749	In this episode, Illia Polosukhin, a co-author of the seminal "Attention Is All You Need" paper and co-founder of Near AI, joins us to discuss his vision for building private, decentralized, and user-owned AI. Illia shares his unique journey from developing the Transformer architecture at Google to building the NEAR Protocol blockchain to solve global payment challenges, and now applying those decentralized principles back to AI. We explore how Near AI is creating a decentralized cloud that leverages confidential computing, secure enclaves, and the blockchain to protect both user data and proprietary model weights. Illia also shares his three-part approach to fostering trust: open model training to eliminate hidden biases and "sleeper agents," verifiability of inference to ensure the model runs as intended, and formal verification at the invocation layer to enforce composable guarantees on AI agent actions. Finally, Illia shares his perspective on the future of open research, the role of tokenized incentive models, and the need for formal verification in building compliance and user trust. The complete show notes for this episode can be found at https://twimlai.com/go/749.						1h 05m 03s
9/23/25	Inside Nano Banana 🍌 and the Future of Vision-Language Models with Oliver Wang - #748	Today, we’re joined by Oliver Wang, principal scientist at Google DeepMind and tech lead for Gemini 2.5 Flash Image—better known by its code name, “Nano Banana.” We dive into the development and capabilities of this newly released frontier vision-language model, beginning with the broader shift from specialized image generators to general-purpose multimodal agents that can use both visual and textual data for a variety of tasks. Oliver explains how Nano Banana can generate and iteratively edit images while maintaining consistency, and how its integration with Gemini’s world knowledge expands creative and practical use cases. We discuss the tension between aesthetics and accuracy, the relative maturity of image models compared to text-based LLMs, and scaling as a driver of progress. Oliver also shares surprising emergent behaviors, the challenges of evaluating vision-language models, and the risks of training on AI-generated data. Finally, we look ahead to interactive world models and VLMs that may one day “think” and “reason” in images. The complete show notes for this episode can be found at https://twimlai.com/go/748.						1h 03m 39s
9/16/25	Is It Time to Rethink LLM Pre-Training? with Aditi Raghunathan - #747	Today, we're joined by Aditi Raghunathan, assistant professor at Carnegie Mellon University, to discuss the limitations of LLMs and how we can build more adaptable and creative models. We dig into her ICML 2025 Outstanding Paper Award winner, “Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction,” which examines why LLMs struggle with generating truly novel ideas. We dig into the "Roll the dice" approach, which encourages structured exploration by injecting randomness at the start of generation, and the "Look before you leap" concept, which trains models to take "leaps of thought" using alternative objectives to create more diverse and structured outputs. We also discuss Aditi’s papers exploring the counterintuitive phenomenon of "catastrophic overtraining," where training models on more data improves benchmark performance but degrades their ability to be fine-tuned for new tasks, and dig into her lab's work on creating more controllable and reliable models, including the concept of "memorization sinks," an architectural approach to isolate and enable the targeted unlearning of specific information. The complete show notes for this episode can be found at https://twimlai.com/go/747.						58m 26s
9/9/25	Building an Immune System for AI Generated Software with Animesh Koratana - #746	Today, we're joined by Animesh Koratana, founder and CEO of PlayerZero to discuss his team’s approach to making agentic and AI-assisted coding tools production-ready at scale. Animesh explains how rapid advances in AI-assisted coding have created an “asymmetry” where the speed of code output outpaces the maturity of processes for maintenance and support. We explore PlayerZero’s debugging and code verification platform, which uses code simulations to build a "memory bank" of past bugs and leverages an ensemble of LLMs and agents to proactively simulate and verify changes, predicting potential failures. Animesh also unpacks the underlying technology, including a semantic graph that analyzes code bases, ticketing systems, and telemetry to trace and reason through complex systems, test hypotheses, and apply reinforcement learning techniques to create an “immune system” for software. Finally, Animesh shares his perspective on the future of the software development lifecycle (SDLC), rethinking organizational workflows, and ensuring security as AI-driven tools continue to mature. The complete show notes for this episode can be found at https://twimlai.com/go/746.						1h 05m 11s