
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
by Sam Charrington
Is this your podcast?Insights from recent episode analysis
Audience Interest
Podcast Focus
Publishing Consistency
Platform Reach
Insights are generated by CastFox AI using publicly available data, episode content, and proprietary models.
Most discussed topics
Brands & references
Total monthly reach
Estimated from 25 chart positions in 25 markets.
By chart position
- 🇺🇸US · Technology#1415K to 30K
- 🇬🇧GB · Technology#1945K to 30K
- 🇦🇺AU · Technology#1965K to 30K
- 🇪🇸ES · Technology#3830K to 100K
- 🇮🇳IN · Technology#4230K to 100K
- Per-Episode Audience
Est. listeners per new episode within ~30 days
67K to 255K🎙 ~2x weekly·783 episodes·Last published 1w ago - Monthly Reach
Unique listeners across all episodes (30 days)
133K to 510K🇪🇸20%🇮🇳20%🇫🇷20%+22 more - Active Followers
Loyal subscribers who consistently listen
53K to 204K
Market Insights
Platform Distribution
Reach across major podcast platforms, updated hourly
Total Followers
—
Total Plays
—
Total Reviews
—
* Data sourced directly from platform APIs and aggregated hourly across all major podcast directories.
On the show
From 14 epsHost
Recent guests
Recent episodes
Why AI Agents Break the GenAI Security Model with Devvret Rishi - #770
Jun 16, 2026
56m 18s
Is RAG Dead? Lessons from Building AI for Tax Law with Alex Bowcut - #769
Jun 9, 2026
51m 32s
Relational Foundation Models for Enterprise Data with Jure Leskovec - #768
May 21, 2026
1h 06m 23s
How to Find the Agent Failures Your Evals Miss with Scott Clark - #767
May 7, 2026
53m 19s
How to Engineer AI Inference Systems with Philip Kiely - #766
Apr 30, 2026
54m 51s
Social Links & Contact
Official channels & resources
Official Website
Login
RSS Feed
Login
| Date | Episode | Topics | Guests | Brands | Places | Keywords | Sponsor | Length | |
|---|---|---|---|---|---|---|---|---|---|
| 6/16/26 | ![]() Why AI Agents Break the GenAI Security Model with Devvret Rishi - #770 | In this episode, Sam talks with Dev Rishi, GM of AI at Rubrik, about what happens when agents move beyond answering questions and start taking action across tools, systems, and business processes. We explore why the enterprise playbook of static guardrails plus human approval starts to break down in the agent era. Agents are useful because they can plan, call tools, update systems, write code, send messages, and operate across workflows at machine speed, but those same capabilities make them difficult to govern with rules written in advance or approval prompts reviewed one at a time. Dev explains why tool access increases blast radius, why agents can route around controls in surprising ways, and why human-in-the-loop review can become security theater when agents operate at scale. We also discuss what enterprises need instead: better visibility, runtime enforcement, policy-aware governance, agent observability, and recovery mechanisms for when something goes wrong. Along the way, we dig into MCP and tool sprawl, small language models for policy enforcement, defense in depth, agent rewind, and why AI may be needed to help secure AI. 🗒️ Full show notes: https://twimlai.com/go/770. | 56m 18s | ||||||
| 6/9/26 | ![]() Is RAG Dead? Lessons from Building AI for Tax Law with Alex Bowcut - #769✨ | retrieval-augmented generationAI in tax law+4 | Alex Bowcut | SphereTRAM (Tax Review and Assessment Model) | — | retrieval-augmented generationAI+5 | — | 51m 32s | |
| 5/21/26 | ![]() Relational Foundation Models for Enterprise Data with Jure Leskovec - #768✨ | AI for sciencerelational deep learning+4 | Jure Leskovec | AI Virtual CellESM+8 | — | relational deep learningenterprise data+5 | — | 1h 06m 23s | |
| 5/7/26 | ![]() How to Find the Agent Failures Your Evals Miss with Scott Clark - #767✨ | LLM systemsobservability+4 | Scott Clark | Distributional | — | LLMobservability+5 | — | 53m 19s | |
| 4/30/26 | ![]() How to Engineer AI Inference Systems with Philip Kiely - #766✨ | AI inference systemsGPU programming+4 | Philip Kiely | vLLMSGLang+2 | — | inference engineeringGPU programming+5 | — | 54m 51s | |
| 4/16/26 | ![]() How Capital One Delivers Multi-Agent Systems with Rashmi Shetty - #765✨ | multi-agent systemsgenerative AI+4 | Rashmi Shetty | Chat ConciergeCapital One | — | multi-agent systemsgenerative AI+6 | — | 54m 18s | |
| 3/26/26 | ![]() The Race to Production-Grade Diffusion LLMs with Stefano Ermon - #764✨ | diffusion language modelstext generation+4 | Stefano Ermon | Mercury 2Stanford University+1 | — | diffusion modelslanguage models+5 | — | 1h 03m 18s | |
| 3/10/26 | ![]() Agent Swarms and Knowledge Graphs for Autonomous Software Development with Siddhant Pardeshi - #763✨ | autonomous software developmentAI-assisted coding+4 | Siddhant Pardeshi | Blitzy | — | autonomous systemsAI agents+5 | — | 1h 16m 14s | |
| 2/26/26 | ![]() AI Trends 2026: OpenClaw Agents, Reasoning LLMs, and More with Sebastian Raschka - #762✨ | AI trendsLLM landscape+4 | Sebastian Raschka | Build A Reasoning Model (From Scratch) | — | LLMreasoning+6 | — | 1h 18m 55s | |
| 1/29/26 | ![]() The Evolution of Reasoning in Small Language Models with Yejin Choi - #761✨ | small language modelsreasoning capabilities+4 | Yejin Choi | Stanford UniversityInstitute for Human-Centered AI+1 | — | language modelsreasoning+5 | — | 1h 06m 21s | |
Want analysis for the episodes below?Free for Pro Submit a request, we'll have your selected episodes analyzed within an hour. Free, at no cost to you, for Pro users. | |||||||||
| 1/8/26 | ![]() Intelligent Robots in 2026: Are We There Yet? with Nikita Rudin - #760✨ | roboticsautonomous robots+4 | Nikita Rudin | Flexion Robotics | — | roboticsautonomous robots+4 | — | 1h 06m 37s | |
| 12/17/25 | ![]() Rethinking Pre-Training for Agentic AI with Aakanksha Chowdhery - #759✨ | agentic AIpre-training+5 | Aakanksha Chowdhery | PaLMGemini+2 | — | agentic AIpre-training+5 | — | 52m 54s | |
| 12/9/25 | ![]() Why Vision Language Models Ignore What They See with Munawar Hayat - #758✨ | Vision-Language Modelsmultimodal AI+4 | Munawar Hayat | Qualcomm AI Research | — | Vision-Language Modelsobject hallucination+4 | — | 57m 40s | |
| 12/2/25 | ![]() Scaling Agentic Inference Across Heterogeneous Compute with Zain Asgar - #757✨ | AI inferenceheterogeneous compute+3 | Zain Asgar | Gimlet Labs | — | AI workloadsGPUs+3 | — | 48m 44s | |
| 11/19/25 | ![]() Proactive Agents for the Web with Devi Parikh - #756✨ | proactive agentsweb interaction+4 | Devi Parikh | Yutori | — | proactive agentsbrowser use models+4 | — | 56m 04s | |
| 11/12/25 | ![]() AI Orchestration for Smart Cities and the Enterprise with Robin Braun and Luke Norris - #755 | Today, we're joined by Robin Braun, VP of AI business development for hybrid cloud at HPE, and Luke Norris, co-founder and CEO of Kamiwaza, to discuss how AI systems can be used to automate complex workflows and unlock value from legacy enterprise data. Robin and Luke detail high-impact use cases from HPE and Kamiwaza’s collaboration on an “Agentic Smart City” project for Vail, Colorado, including remediation and automation of website accessibility for 508 compliance, digitization and understanding of deed restrictions, and combining contextual information with camera feeds for fire detection and risk assessment. Additionally, we discuss the role of private cloud infrastructure in overcoming challenges like cost, data privacy, and compliance. Robin and Luke also share their lessons learned, including the importance of fresh data, and the value of a "mud puddle by mud puddle" approach in achieving practical AI wins. The complete show notes for this episode can be found at https://twimlai.com/go/755. | 54m 46s | ||||||
| 11/4/25 | ![]() Building an AI Mathematician with Carina Hong - #754 | In this episode, Carina Hong, founder and CEO of Axiom, joins us to discuss her work building an "AI Mathematician." Carina explains why this is a pivotal moment for AI in mathematics, citing a convergence of three key areas: the advanced reasoning capabilities of modern LLMs, the rise of formal proof languages like Lean, and breakthroughs in code generation. We explore the core technical challenges, including the massive data gap between general-purpose code and formal math code, and the difficult problem of "autoformalization," or translating natural language proofs into a machine-verifiable format. Carina also shares Axiom's vision for a self-improving system that uses a self-play loop of conjecturing and proving to discover new mathematical knowledge. Finally, we discuss the broader applications of this technology in areas like formal verification for high-stakes software and hardware. The complete show notes for this episode can be found at https://twimlai.com/go/754. | 55m 52s | ||||||
| 10/28/25 | ![]() High-Efficiency Diffusion Models for On-Device Image Generation and Editing with Hung Bui - #753 | In this episode, Hung Bui, Technology Vice President at Qualcomm, joins us to explore the latest high-efficiency techniques for running generative AI, particularly diffusion models, on-device. We dive deep into the technical challenges of deploying these models, which are powerful but computationally expensive due to their iterative sampling process. Hung details his team's work on SwiftBrush and SwiftEdit, which enable high-quality text-to-image generation and editing in a single inference step. He explains their novel distillation framework, where a multi-step teacher model guides the training of an efficient, single-step student model. We explore the architecture and training, including the use of a secondary 'coach' network that aligns the student's denoising function with the teacher's, allowing the model to bypass the iterative process entirely. Finally, we discuss how these efficiency breakthroughs pave the way for personalized on-device agents and the challenges of running reasoning models with techniques like inference-time scaling under a fixed compute budget. The complete show notes for this episode can be found at https://twimlai.com/go/753. | 52m 23s | ||||||
| 10/22/25 | ![]() Vibe Coding's Uncanny Valley with Alexandre Pesant - #752 | Today, we're joined by Alexandre Pesant, AI lead at Lovable, who joins us to discuss the evolution and practice of vibe coding. Alex shares his take on how AI is enabling a shift in software development from typing characters to expressing intent, creating a new layer of abstraction similar to how high-level code compiles to machine code. We explore the current capabilities and limitations of coding agents, the importance of context engineering, and the practices that separate successful vibe coders from frustrated ones. Alex also shares Lovable’s technical journey, from an early, complex agent architecture that failed, to a simpler workflow-based system, and back again to an agentic approach as foundation models improved. He also details the company's massive scaling challenges—like accidentally taking down GitHub—and makes the case for why robust evaluations and more expressive user interfaces are the most critical components for AI-native development tools to succeed in the near future. The complete show notes for this episode can be found at https://twimlai.com/go/752. | 1h 12m 36s | ||||||
| 10/14/25 | ![]() Dataflow Computing for AI Inference with Kunle Olukotun - #751 | In this episode, we're joined by Kunle Olukotun, professor of electrical engineering and computer science at Stanford University and co-founder and chief technologist at Sambanova Systems, to discuss reconfigurable dataflow architectures for AI inference. Kunle explains the core idea of building computers that are dynamically configured to match the dataflow graph of an AI model, moving beyond the traditional instruction-fetch paradigm of CPUs and GPUs. We explore how this architecture is well-suited for LLM inference, reducing memory bandwidth bottlenecks and improving performance. Kunle reviews how this system also enables efficient multi-model serving and agentic workflows through its large, tiered memory and fast model-switching capabilities. Finally, we discuss his research into future dynamic reconfigurable architectures, and the use of AI agents to build compilers for new hardware. The complete show notes for this episode can be found at https://twimlai.com/go/751. | 57m 37s | ||||||
| 10/7/25 | ![]() Recurrence and Attention for Long-Context Transformers with Jacob Buckman - #750 | Today, we're joined by Jacob Buckman, co-founder and CEO of Manifest AI to discuss achieving long context in transformers. We discuss the bottlenecks of scaling context length and recent techniques to overcome them, including windowed attention, grouped query attention, and latent space attention. We explore the idea of weight-state balance and the weight-state FLOP ratio as a way of reasoning about the optimality of compute architectures, and we dig into the Power Retention architecture, which blends the parallelization of attention with the linear scaling of recurrence and promises speedups of >10x during training and >100x during inference. We review Manifest AI’s recent open source projects as well: Vidrial—a custom CUDA framework for building highly optimized GPU kernels in Python, and PowerCoder—a 3B-parameter coding model fine-tuned from StarCoder to use power retention. Our chat also covers the use of metrics like in-context learning curves and negative log likelihood to measure context utility, the implications of scaling laws, and the future of long context lengths in AI applications. The complete show notes for this episode can be found at https://twimlai.com/go/750. | 57m 23s | ||||||
| 9/30/25 | ![]() The Decentralized Future of Private AI with Illia Polosukhin - #749 | In this episode, Illia Polosukhin, a co-author of the seminal "Attention Is All You Need" paper and co-founder of Near AI, joins us to discuss his vision for building private, decentralized, and user-owned AI. Illia shares his unique journey from developing the Transformer architecture at Google to building the NEAR Protocol blockchain to solve global payment challenges, and now applying those decentralized principles back to AI. We explore how Near AI is creating a decentralized cloud that leverages confidential computing, secure enclaves, and the blockchain to protect both user data and proprietary model weights. Illia also shares his three-part approach to fostering trust: open model training to eliminate hidden biases and "sleeper agents," verifiability of inference to ensure the model runs as intended, and formal verification at the invocation layer to enforce composable guarantees on AI agent actions. Finally, Illia shares his perspective on the future of open research, the role of tokenized incentive models, and the need for formal verification in building compliance and user trust. The complete show notes for this episode can be found at https://twimlai.com/go/749. | 1h 05m 03s | ||||||
| 9/23/25 | ![]() Inside Nano Banana 🍌 and the Future of Vision-Language Models with Oliver Wang - #748 | Today, we’re joined by Oliver Wang, principal scientist at Google DeepMind and tech lead for Gemini 2.5 Flash Image—better known by its code name, “Nano Banana.” We dive into the development and capabilities of this newly released frontier vision-language model, beginning with the broader shift from specialized image generators to general-purpose multimodal agents that can use both visual and textual data for a variety of tasks. Oliver explains how Nano Banana can generate and iteratively edit images while maintaining consistency, and how its integration with Gemini’s world knowledge expands creative and practical use cases. We discuss the tension between aesthetics and accuracy, the relative maturity of image models compared to text-based LLMs, and scaling as a driver of progress. Oliver also shares surprising emergent behaviors, the challenges of evaluating vision-language models, and the risks of training on AI-generated data. Finally, we look ahead to interactive world models and VLMs that may one day “think” and “reason” in images. The complete show notes for this episode can be found at https://twimlai.com/go/748. | 1h 03m 39s | ||||||
| 9/16/25 | ![]() Is It Time to Rethink LLM Pre-Training? with Aditi Raghunathan - #747 | Today, we're joined by Aditi Raghunathan, assistant professor at Carnegie Mellon University, to discuss the limitations of LLMs and how we can build more adaptable and creative models. We dig into her ICML 2025 Outstanding Paper Award winner, “Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction,” which examines why LLMs struggle with generating truly novel ideas. We dig into the "Roll the dice" approach, which encourages structured exploration by injecting randomness at the start of generation, and the "Look before you leap" concept, which trains models to take "leaps of thought" using alternative objectives to create more diverse and structured outputs. We also discuss Aditi’s papers exploring the counterintuitive phenomenon of "catastrophic overtraining," where training models on more data improves benchmark performance but degrades their ability to be fine-tuned for new tasks, and dig into her lab's work on creating more controllable and reliable models, including the concept of "memorization sinks," an architectural approach to isolate and enable the targeted unlearning of specific information. The complete show notes for this episode can be found at https://twimlai.com/go/747. | 58m 26s | ||||||
| 9/9/25 | ![]() Building an Immune System for AI Generated Software with Animesh Koratana - #746 | Today, we're joined by Animesh Koratana, founder and CEO of PlayerZero to discuss his team’s approach to making agentic and AI-assisted coding tools production-ready at scale. Animesh explains how rapid advances in AI-assisted coding have created an “asymmetry” where the speed of code output outpaces the maturity of processes for maintenance and support. We explore PlayerZero’s debugging and code verification platform, which uses code simulations to build a "memory bank" of past bugs and leverages an ensemble of LLMs and agents to proactively simulate and verify changes, predicting potential failures. Animesh also unpacks the underlying technology, including a semantic graph that analyzes code bases, ticketing systems, and telemetry to trace and reason through complex systems, test hypotheses, and apply reinforcement learning techniques to create an “immune system” for software. Finally, Animesh shares his perspective on the future of the software development lifecycle (SDLC), rethinking organizational workflows, and ensuring security as AI-driven tools continue to mature. The complete show notes for this episode can be found at https://twimlai.com/go/746. | 1h 05m 11s | ||||||
Showing 25 of 788
Pitch Fit is a Pro feature
See how bookable this show is for guests, which brands already advertise, the per-episode ad value, and the best-fit guest and sponsor profile. The numbers are blurred on the free plan.
How readily this show books outside guests like you.
How proven this show is for host-read sponsorships.
For Guests
ProFor Advertisers
ProUpgrade to Pro to unlock guest cadence, sponsor categories, fit scores, and per-episode ad value for this show.
Chart Positions
25 placements across 25 markets.
Chart Positions
25 placements across 25 markets.

























