
Insights from recent episode analysis
Audience Interest
Podcast Focus
Publishing Consistency
Platform Reach
Insights are generated by CastFox AI using publicly available data, episode content, and proprietary models.
Most discussed topics
Brands & references
Est. Listeners
Insufficient chart data. Estimates will improve as the show charts.
- Per-Episode Audience
Est. listeners per new episode within ~30 days
N/A🎙 ~2x weekly·136 episodes·Last published 3w ago - Monthly Reach
Unique listeners across all episodes (30 days)
N/A - Active Followers
Loyal subscribers who consistently listen
N/A
Market Insights
Platform Distribution
Reach across major podcast platforms, updated hourly
Total Followers
—
Total Plays
—
Total Reviews
—
* Data sourced directly from platform APIs and aggregated hourly across all major podcast directories.
On the show
From 13 epsHosts
Recent guests
Recent episodes
Knowledge Engineering with Bradley Allen - Weaviate Podcast #139!
Jun 1, 2026
1h 00m 39s
Booking.com and Weaviate with Başak Eskili - Weaviate Podcast #138!
May 18, 2026
41m 40s
Search Agents with Nandan Thakur - Weaviate Podcast #137!
May 5, 2026
1h 01m 16s
AgentIR with Zijian Chen and Xueguang Ma - Weaviate Podcast #136!
Apr 27, 2026
1h 03m 16s
Data Agents with Shreya Shankar - Weaviate Podcast #135!
Apr 6, 2026
57m 23s
Social Links & Contact
Official channels & resources
Official Website
Login
RSS Feed
Login
| Date | Episode | Topics | Guests | Brands | Places | Keywords | Sponsor | Length | |
|---|---|---|---|---|---|---|---|---|---|
| 6/1/26 | ![]() Knowledge Engineering with Bradley Allen - Weaviate Podcast #139!✨ | knowledge engineeringneurosymbolic AI+4 | Dr. Bradley Allen | Weaviate | — | AI historyknowledge graphs+4 | — | 1h 00m 39s | |
| 5/18/26 | ![]() Booking.com and Weaviate with Başak Eskili - Weaviate Podcast #138!✨ | vector searchsemantic retrieval+4 | Başak Eskili | Booking.comWeaviate+3 | — | vector searchsemantic retrieval+5 | — | 41m 40s | |
| 5/5/26 | ![]() Search Agents with Nandan Thakur - Weaviate Podcast #137!✨ | neural retrievalagentic search+4 | Dr. Nandan Thakur | OrbitDeepSeek+3 | — | search agentsneural retrieval+6 | — | 1h 01m 16s | |
| 4/27/26 | ![]() AgentIR with Zijian Chen and Xueguang Ma - Weaviate Podcast #136!✨ | AI agentsretrieval systems+4 | Zijian ChenXueguang Ma | ChatGPTUniversity of Waterloo+2 | — | retrieval algorithmsreasoning traces+3 | — | 1h 03m 16s | |
| 4/6/26 | ![]() Data Agents with Shreya Shankar - Weaviate Podcast #135!✨ | data agentsData Agent Benchmark+4 | Shreya Shankar | UC Berkeley | — | data agentsbenchmark+5 | — | 57m 23s | |
| 3/23/26 | ![]() Multi-Vector Search with Amélie Chatelain and Antoine Chaffin - Weaviate Podcast #134!✨ | Multi-Vector SearchLate Interaction+5 | Amélie ChatelainAntoine Chaffin | ColGrepMaxSim+5 | — | Multi-Vector representationsLate Interaction+5 | — | 1h 21m 13s | |
| 3/1/26 | ![]() AI-Powered Search with Doug Turnbull and Trey Grainger [#133]✨ | AI-Powered Searchquery understanding+4 | Doug TurnbullTrey Grainger | WeaviateGoogle+1 | — | AIsearch experience+4 | — | 53m 20s | |
| 12/8/25 | ![]() Pyversity with Thomas van Dongen - Weaviate Podcast #132!✨ | AI engineeringdiversification+3 | Thomas van Dongen | PyversitySpringer Nature | — | Pyversitydiversification strategies+5 | — | 1h 00m 31s | |
| 11/18/25 | ![]() Semantic Query Engines with Matthew Russo - Weaviate Podcast #131!✨ | AIDatabase Systems+4 | Matthew Russo | MITWeaviate | — | Semantic Query EnginesAI_WHERE+3 | — | 1h 02m 25s | |
| 11/3/25 | ![]() REFRAG with Xiaoqiang Lin - Weaviate Podcast #130!✨ | RAG-based DecodingLLM inference+3 | Xiaoqiang Lin | REFRAGNational University of Singapore+1 | — | REFRAGRAG systems+6 | — | 1h 00m 00s | |
Want analysis for the episodes below?Free for Pro Submit a request, we'll have your selected episodes analyzed within an hour. Free, at no cost to you, for Pro users. | |||||||||
| 10/13/25 | ![]() Weaviate and SAS with Saurabh Mishra and Bob van Luijt - Weaviate Podcast #129!✨ | WeaviateSAS+4 | Saurabh Mishra | SAS Retrieval Agent ManagerWeaviate+1 | — | WeaviateSAS+5 | — | 43m 55s | |
| 9/22/25 | ![]() Weaviate's Query Agent with Charles Pierse - Weaviate Podcast #128!✨ | Weaviate Query Agentproduct design+5 | Charles Pierse | Weaviate Query AgentWeaviate+1 | — | WeaviateQuery Agent+7 | — | 1h 01m 32s | |
| 8/13/25 | ![]() GEPA with Lakshya A. Agrawal - Weaviate Podcast #127!✨ | GEPALarge Language Models+5 | Lakshya A. Agrawal | GEPADSPy+2 | — | GEPALarge Language Models+6 | — | 1h 01m 55s | |
| 7/9/25 | ![]() Agentic Topic Modeling with Maarten Grootendorst - Weaviate Podcast #126! | Maarten Grootendorst is a psychologist turned AI engineer who has created BERTopic and authored "Hands-On Large Language Models" with Jay Alammar. The rise of LLMs and Agents are transforming many areas of software! This podcast dives deep into their impact on Topic Modeling! Maarten designed BERTopic from the start with modularity in mind -- letting you ablate embedding models, dimensionality reduction, clustering algorithms, and more. This early insight to prioritize modularity makes BERTopic incredibly well structured to become more "Agentic". An "Agentic" Topic Modeling algorithm can use LLMs to generate topics or topic descriptions, as well as contrast them with other topics. It can decide which topics to subdivide, and it can integrate human feedback and evaluate topics in novel ways... I hope you find the podcast interesting! | — | ||||||
| 7/2/25 | ![]() Sufficient Context with Hailey Joren - Weaviate Podcast #125! | Hailey Joren is a Ph.D. student at UCSD! Hailey and collaborators at Duke University and Google have recently published Sufficient Context: A New Lens on Retrieval Augmented Generation Systems in ICLR 2025! There are so many interesting nuggets to this work! Firstly, it really helped me understand the difference between *relevant* search results and sufficient context for answering the question. Armed with this lens of looking at retrieved context, Hailey and collaborators make all sorts of interesting observations about the current state of Hallucination. RAG unfortunately makes the models far less likely to hallucinate, and the existing RAG benchmarks unfortunately do not emphasize retrieval adaptation well enough -- indicated by LLMs outputting correct answers despite insufficient context 35-62% of the time! However, reason for optimism! Hailey and team develop an autorater that can detect insufficient context 93% of the time! There are all sorts of interesting ideas around this paper! I really hope you find the podcast useful! | — | ||||||
| 6/25/25 | ![]() RAG Benchmarks with Nandan Thakur - Weaviate Podcast #124! | Nandan Thakur is a Ph.D. student at the University of Waterloo! Nandan has worked on many of the most impactful works in Retrieval-Augmented Generation (RAG) and Information Retrieval. His work ranges from benchmarks such as BEIR, MIRACLE, TREC, and FreshStack, to improving the training of embedding models and re-rankings, and more! | — | ||||||
| 5/28/25 | ![]() MUVERA with Rajesh Jayaram and Roberto Esposito - Weaviate Podcast #123! | Multi-vector retrieval offers richer, more nuanced search, but often comes with a significant cost in storage and computational overhead. How can we harness the power of multi-vector representations without breaking the bank? Rajesh Jayaram, the first author of the groundbreaking MUVERA algorithm from Google, and Roberto Esposito from Weaviate, who spearheaded its implementation, reveal how MUVERA tackles this critical challenge.Dive deep into MUVERA, a novel compression technique specifically designed for multi-vector retrieval. Rajesh and Roberto explain how it leverages contextualized token embeddings and innovative fixed dimensional encodings to dramatically reduce storage requirements while maintaining high retrieval accuracy. Discover the intricacies of quantization within MUVERA, the interpretability benefits of this approach, and how LSH clustering can play a role in topic modeling with these compressed representations.This conversation explores the core mechanics of efficient multi-vector retrieval, the challenges of benchmarking these advanced systems, and the evolving landscape of vector database schemas designed to handle such complex data. Rajesh and Roberto also share their insights on the future directions in artificial intelligence where efficient, high-dimensional data representation is paramount.Whether you're an AI researcher grappling with the scalability of vector search, an engineer building advanced retrieval systems, or fascinated by the cutting edge of information retrieval and AI frameworks, this episode delivers unparalleled insights directly from the source. You'll gain a fundamental understanding of MUVERA, practical considerations for its application in making multi-vector retrieval feasible, and a clear view of future directions in AI. | — | ||||||
| 5/15/25 | ![]() Patronus AI with Anand Kannappan - Weaviate Podcast #122! | AI agents are getting more complex and harder to debug. How do you know what's happening when your agent makes 20+ function calls? What if you have a Multi-Agent System orchestrating several Agents? Anand Kannappan, co-founder of Patronus AI, reveals how their groundbreaking tool Percival transforms agent debugging and evaluation. Percival can instantly analyze complex agent traces, it pinpoints failures across 60 different modes, and it automatically suggests prompt fixes to improve performance. Anand unpacks several of these common failure modes. This includes the critical challenges of "context explosion" where agents process millions of tokens. He also explains domain adaptation for specific use cases, and the complex challenge of multi-agent orchestration. The paradigm of AI Evals is shifting from static evaluation to dynamic oversight! Also learn how Percival's memory architecture leverages both episodic and semantic knowledge with Weaviate!This conversation explores powerful concepts like process vs. outcome rewards and LLM-as-judge approaches. Anand shares his vision for "agentic supervision" where equally capable AI systems provide oversight for complex agent workflows. Whether you're building AI agents, evaluating LLM systems, or interested in how debugging autonomous systems will evolve, this episode delivers concrete techniques. You'll gain philosophical insights on evaluation and a roadmap for how evaluation must transform to keep pace with increasingly autonomous AI systems. | — | ||||||
| 5/12/25 | ![]() Haize Labs with Leonard Tang - Weaviate Podcast #121! | How do you ensure your AI systems actually do what you expect them to do? Leonard Tang takes us deep into the revolutionary world of AI evaluation with concrete techniques you can apply today. Learn how Haize Labs is transforming AI testing through "scaling judge-time compute" - stacking weaker models to effectively evaluate stronger ones. Leonard unpacks the game-changing Verdict library that outperforms frontier models by 10-20% while dramatically reducing costs. Discover practical insights on creating contrastive evaluation sets that extract maximum signal from human feedback, implementing debate-based judging systems, and building custom reward models that align with enterprise needs. The conversation reveals powerful nuggets like using randomized agent debates to achieve consensus and lightweight guardrail models that run alongside inference. Whether you're developing AI applications or simply fascinated by how we'll ensure increasingly powerful AI systems perform as expected, this episode delivers immediate value with techniques you can implement right away, philosophical perspectives on AI safety, and a glimpse into the future of evaluation that will fundamentally shape how AI evolves. | — | ||||||
| 5/7/25 | ![]() Box AI with Ben Kus and Bob van Luijt | Ben walks us through Box's three-layer infrastructure puzzle: First, the mind-boggling base infrastructure (think millions of interactions per second and trillions of files). Second, their unique multi-tenant security challenge - unlike most SaaS platforms, Box users share content across company boundaries, making traditional tenant isolation impossible. And third, ensuring AI respects all these complex permissions while still delivering value. The podcast then dives further into how vector embeddings can balloon file sizes - a few hundred bytes of text can require 4-6KB of vector data storage! We also dig into why RAG remains essential despite growing context windows, and how Box is developing AI agents that transform painful enterprise processes like RFP responses. | — | ||||||
| 4/9/25 | ![]() Structured Outputs with Will Kurt and Cameron Pfiffer - Weaviate Podcast #119! | Hey everyone! Thanks so much for watching another episode of the Weaviate Podcast! Dive into the fascinating world of structured outputs with Will Kurt and Cameron Pfeiffer, the brilliant minds behind Outlines, the revolutionary open-source library from .txt.ai that's changing how we interact with LLMs. In this episode, we explore how constrained decoding enables predictable, reliable outputs from language models—unlocking everything from perfect JSON generation to guided reasoning processes.Will and Cameron share their journey to founding .txt.ai, explain the technical magic behind Outlines (hint: it involves finite state machines!), and debunk misconceptions around structured generation performance. You'll discover practical applications like knowledge graph construction, metadata extraction, and report generation that simply weren't possible before this technology.Whether you're building AI systems or curious about where the field is heading, you'll gain valuable insights on how structured outputs integrate with inference engines like vLLM, why multi-task inference outperforms single-task approaches, and how this technology enables scalable agent systems that could transform software architecture forever. Join us for this mind-expanding conversation about one of AI's most important but under appreciated innovations—and discover why the future might belong to systems that combine freedom with structure. | — | ||||||
| 3/25/25 | ![]() Synthetic Data with David Berenstein and Ben Burtenshaw - Weaviate Podcast #118! | Synthetic Data: The Building Bocks of AI's Future! Hey everyone! I am SUPER EXCITED to publish the 118th episode of the Weaviate Podcast featuring David Berenstein and Ben Burtenshaw from HuggingFace! This podcast explores the intricacies of synthetic data generation, detailing methodologies such as data augmentation, distillation, and instruction refinement. The conversation delves into persona-driven synthetic data, highlighting applications like Persona Hub, and discusses algorithms to enhance diversity, complexity, and quality of generated data. Additionally, they cover integration with Hugging Face’s ecosystem, including Argilla for annotation, AutoTrain for fine-tuning, and advanced data exploration tools like the Data Studio and SQL console. The podcast also touches upon the potential for synthetic image data generation and the exciting future of AI education and accessibility. | — | ||||||
| 3/3/25 | ![]() Letta AI with Sarah Wooders - Weaviate Podcast #117! | Hey everyone! Thank you so much for watching the 117th episode of the Weaviate podcast! In this episode, we dive deep into the cutting edge of AI agent development with Sarah Wooders, co-founder and CTO of Letta AI. Emerging from Berkeley's Sky Computing Lab, Sarah and her team have pioneered a revolutionary approach to stateful agents - AI systems that genuinely remember both you and themselves across extended conversations. The conversation explores how the groundbreaking MemGPT project evolved into Letta's comprehensive Agent Development Environment (ADE), which empowers developers to build truly persistent AI experiences. Sarah shares powerful insights on context management, memory prioritization, and the critical role of databases in agent architecture. Whether you're building AI systems or simply curious about where conversational AI is heading, this episode illuminates how the future of agents depends not just on their reasoning capabilities, but on their ability to maintain coherent identity and memory over time. | — | ||||||
| 2/27/25 | ![]() Agent Experience with Matt Biilmann, Sebastian Witalec, and Charles Pierse - Weaviate Podcast #116! | Hey everyone! Thank you so much for watching another episode of the Weaviate Podcast! I am SUPER excited to welcome Matt Biilmann, Co-Founder and CEO of Netlify, as well as Sebastian Witalec and Charles Pierse from Weaviate to discuss Agent Experience! You have probably heard about how you can connect LLMs to external software tools. This supercharges the capabilities of AI systems and what they can do. So what does that mean for you as a software developer?This podcast explores different ideas around designing software user experiences for Agents as well as Humans. How do we write documentation for Agents differently than Humans? How do we design REST or gRPC APIs, or programming languages clients, for Agents differently than Humans? llms.txt, JSON tool definitions, agents to agent, breaking changes, … there were so many interesting topics explored in this podcast! I really hope you find it useful! As always more than happy to discuss these ideas further with you! | — | ||||||
| 2/19/25 | ![]() Optimizing Retrieval Agents with Shirley Wu - Weaviate Podcast #115! | Hey everyone! Thank you so much for watching the 115th episode of the Weaviate Podcast featuring Shirley Wu from Stanford University!We explore the innovative Avatar Optimizer—a novel framework that leverages contrastive reasoning to refine LLM agent prompts for optimal tool usage. Shirley explains how this self-improving system evolves through iterative feedback by contrasting positive and negative examples, enabling agents to handle complex tasks more effectively.We also dive into the STaRK Benchmark, a comprehensive testbed designed to evaluate retrieval systems on semi-structured knowledge bases. The discussion highlights the challenges of unifying textual and relational retrieval, exploring concepts such as multi-vector embeddings, relational graphs, and dynamic data modeling. Learn how these approaches help overcome information loss, enhance precision, and enable scalable, context-aware retrieval in diverse domains—from product recommendations to precision medicine.Whether you’re interested in advanced prompt optimization, multi-agent system design, or the future of human-centered language models, this episode offers a wealth of insights and a forward-looking perspective on integrating sophisticated AI techniques into real-world applications. | — | ||||||
Showing 25 of 140
Pitch Fit is a Pro feature
See how bookable this show is for guests, which brands already advertise, the per-episode ad value, and the best-fit guest and sponsor profile. The numbers are blurred on the free plan.
How readily this show books outside guests like you.
How proven this show is for host-read sponsorships.
For Guests
ProFor Advertisers
ProUpgrade to Pro to unlock guest cadence, sponsor categories, fit scores, and per-episode ad value for this show.
