Activetechnology

Weaviate Podcast

by Weaviate

Is this your podcast?

Join Connor Shorten as he interviews machine learning experts and explores Weaviate use cases from users and customers.

Insights from recent episode analysis

Audience Interest

Podcast Focus

Categories: technology

Publishing Consistency

Frequency: ~3-4 / Week

100+ episodes since 2021

Platform Reach

Insights are generated by CastFox AI using publicly available data, episode content, and proprietary models.

Most discussed topics

Brands & references

Generic platforms filtered out.

Low Confidence

Est. Listeners

N/A

Insufficient chart data. Estimates will improve as the show charts.

Per-Episode Audience
Est. listeners per new episode within ~30 days
N/A
🎙 ~2x weekly·136 episodes·Last published 3w ago
Monthly Reach
Unique listeners across all episodes (30 days)
N/A
Active Followers
Loyal subscribers who consistently listen
N/A

Market Insights

This ShowCategory Avg

No category insights available.

📡

Platform Distribution

Reach across major podcast platforms, updated hourly

Total Followers

—

Total Plays

—

Total Reviews

—

YouTube

Subscribers

—

Views

—

Videos

—

Castbox

Followers

—

Plays

—

Reviews

—

Podcast App

Followers

—

Plays

—

Reviews

—

Podcast Republic

Followers

—

Plays

—

Reviews

—

TuneIn

Followers

—

Plays

—

Reviews

—

* Data sourced directly from platform APIs and aggregated hourly across all major podcast directories.

On the show

From 13 eps

Hosts

Weaviate

8 eps

Bob van Luijt

1 ep

Recent guests

16 across last 13 eps

Zijian Chen

1 ep

Xueguang Ma

1 ep

Shreya Shankar

1 ep

Amélie Chatelain

1 ep

Antoine Chaffin

1 ep

Doug Turnbull

1 ep

Trey Grainger

1 ep

Thomas van Dongen

1 ep

Matthew Russo

1 ep

Xiaoqiang Lin

1 ep

Saurabh Mishra

1 ep

Charles Pierse

1 ep

Lakshya A. Agrawal

1 ep

Dr. Nandan Thakur

1 ep

Başak Eskili

1 ep

Dr. Bradley Allen

1 ep

Recent episodes

Knowledge Engineering with Bradley Allen - Weaviate Podcast #139!

Jun 1, 2026

1h 00m 39s

Booking.com and Weaviate with Başak Eskili - Weaviate Podcast #138!

May 18, 2026

41m 40s

Search Agents with Nandan Thakur - Weaviate Podcast #137!

May 5, 2026

1h 01m 16s

AgentIR with Zijian Chen and Xueguang Ma - Weaviate Podcast #136!

Apr 27, 2026

1h 03m 16s

Data Agents with Shreya Shankar - Weaviate Podcast #135!

Apr 6, 2026

57m 23s

🔗

Social Links & Contact

Official channels & resources

🌐

Official Website

📡

RSS Feed

Episodes

140

monthly

Avg length

59m 10s

41m 40s – 1h 21m 13s

Range

Apr 2023 – Apr 2026

Topics

natural language processing, agentic search +72

Guests

Dr. Bradley Allen +15 · last 13 eps

25 of 25

Date	Episode	Topics	Guests	Brands	Places	Keywords	Sponsor	Length
6/1/26	Knowledge Engineering with Bradley Allen - Weaviate Podcast #139!✨	knowledge engineeringneurosymbolic AI+4	Dr. Bradley Allen	Weaviate	—	AI historyknowledge graphs+4	—	1h 00m 39s
5/18/26	Booking.com and Weaviate with Başak Eskili - Weaviate Podcast #138!✨	vector searchsemantic retrieval+4	Başak Eskili	Booking.comWeaviate+3	—	vector searchsemantic retrieval+5	—	41m 40s
5/5/26	Search Agents with Nandan Thakur - Weaviate Podcast #137!✨	neural retrievalagentic search+4	Dr. Nandan Thakur	OrbitDeepSeek+3	—	search agentsneural retrieval+6	—	1h 01m 16s
4/27/26	AgentIR with Zijian Chen and Xueguang Ma - Weaviate Podcast #136!✨	AI agentsretrieval systems+4	Zijian ChenXueguang Ma	ChatGPTUniversity of Waterloo+2	—	retrieval algorithmsreasoning traces+3	—	1h 03m 16s
4/6/26	Data Agents with Shreya Shankar - Weaviate Podcast #135!✨	data agentsData Agent Benchmark+4	Shreya Shankar	UC Berkeley	—	data agentsbenchmark+5	—	57m 23s
3/23/26	Multi-Vector Search with Amélie Chatelain and Antoine Chaffin - Weaviate Podcast #134!✨	Multi-Vector SearchLate Interaction+5	Amélie ChatelainAntoine Chaffin	ColGrepMaxSim+5	—	Multi-Vector representationsLate Interaction+5	—	1h 21m 13s
3/1/26	AI-Powered Search with Doug Turnbull and Trey Grainger [#133]✨	AI-Powered Searchquery understanding+4	Doug TurnbullTrey Grainger	WeaviateGoogle+1	—	AIsearch experience+4	—	53m 20s
12/8/25	Pyversity with Thomas van Dongen - Weaviate Podcast #132!✨	AI engineeringdiversification+3	Thomas van Dongen	PyversitySpringer Nature	—	Pyversitydiversification strategies+5	—	1h 00m 31s
11/18/25	Semantic Query Engines with Matthew Russo - Weaviate Podcast #131!✨	AIDatabase Systems+4	Matthew Russo	MITWeaviate	—	Semantic Query EnginesAI_WHERE+3	—	1h 02m 25s
11/3/25	REFRAG with Xiaoqiang Lin - Weaviate Podcast #130!✨	RAG-based DecodingLLM inference+3	Xiaoqiang Lin	REFRAGNational University of Singapore+1	—	REFRAGRAG systems+6	—	1h 00m 00s
Want analysis for the episodes below?Free for Pro Submit a request, we'll have your selected episodes analyzed within an hour. Free, at no cost to you, for Pro users.
10/13/25	Weaviate and SAS with Saurabh Mishra and Bob van Luijt - Weaviate Podcast #129!✨	WeaviateSAS+4	Saurabh Mishra	SAS Retrieval Agent ManagerWeaviate+1	—	WeaviateSAS+5	—	43m 55s
9/22/25	Weaviate's Query Agent with Charles Pierse - Weaviate Podcast #128!✨	Weaviate Query Agentproduct design+5	Charles Pierse	Weaviate Query AgentWeaviate+1	—	WeaviateQuery Agent+7	—	1h 01m 32s
8/13/25	GEPA with Lakshya A. Agrawal - Weaviate Podcast #127!✨	GEPALarge Language Models+5	Lakshya A. Agrawal	GEPADSPy+2	—	GEPALarge Language Models+6	—	1h 01m 55s
7/9/25	Agentic Topic Modeling with Maarten Grootendorst - Weaviate Podcast #126!	Maarten Grootendorst is a psychologist turned AI engineer who has created BERTopic and authored "Hands-On Large Language Models" with Jay Alammar. The rise of LLMs and Agents are transforming many areas of software! This podcast dives deep into their impact on Topic Modeling! Maarten designed BERTopic from the start with modularity in mind -- letting you ablate embedding models, dimensionality reduction, clustering algorithms, and more. This early insight to prioritize modularity makes BERTopic incredibly well structured to become more "Agentic". An "Agentic" Topic Modeling algorithm can use LLMs to generate topics or topic descriptions, as well as contrast them with other topics. It can decide which topics to subdivide, and it can integrate human feedback and evaluate topics in novel ways... I hope you find the podcast interesting!						—
7/2/25	Sufficient Context with Hailey Joren - Weaviate Podcast #125!	Hailey Joren is a Ph.D. student at UCSD! Hailey and collaborators at Duke University and Google have recently published Sufficient Context: A New Lens on Retrieval Augmented Generation Systems in ICLR 2025! There are so many interesting nuggets to this work! Firstly, it really helped me understand the difference between relevant search results and sufficient context for answering the question. Armed with this lens of looking at retrieved context, Hailey and collaborators make all sorts of interesting observations about the current state of Hallucination. RAG unfortunately makes the models far less likely to hallucinate, and the existing RAG benchmarks unfortunately do not emphasize retrieval adaptation well enough -- indicated by LLMs outputting correct answers despite insufficient context 35-62% of the time! However, reason for optimism! Hailey and team develop an autorater that can detect insufficient context 93% of the time! There are all sorts of interesting ideas around this paper! I really hope you find the podcast useful!						—
6/25/25	RAG Benchmarks with Nandan Thakur - Weaviate Podcast #124!	Nandan Thakur is a Ph.D. student at the University of Waterloo! Nandan has worked on many of the most impactful works in Retrieval-Augmented Generation (RAG) and Information Retrieval. His work ranges from benchmarks such as BEIR, MIRACLE, TREC, and FreshStack, to improving the training of embedding models and re-rankings, and more!						—
5/28/25	MUVERA with Rajesh Jayaram and Roberto Esposito - Weaviate Podcast #123!	Multi-vector retrieval offers richer, more nuanced search, but often comes with a significant cost in storage and computational overhead. How can we harness the power of multi-vector representations without breaking the bank? Rajesh Jayaram, the first author of the groundbreaking MUVERA algorithm from Google, and Roberto Esposito from Weaviate, who spearheaded its implementation, reveal how MUVERA tackles this critical challenge.Dive deep into MUVERA, a novel compression technique specifically designed for multi-vector retrieval. Rajesh and Roberto explain how it leverages contextualized token embeddings and innovative fixed dimensional encodings to dramatically reduce storage requirements while maintaining high retrieval accuracy. Discover the intricacies of quantization within MUVERA, the interpretability benefits of this approach, and how LSH clustering can play a role in topic modeling with these compressed representations.This conversation explores the core mechanics of efficient multi-vector retrieval, the challenges of benchmarking these advanced systems, and the evolving landscape of vector database schemas designed to handle such complex data. Rajesh and Roberto also share their insights on the future directions in artificial intelligence where efficient, high-dimensional data representation is paramount.Whether you're an AI researcher grappling with the scalability of vector search, an engineer building advanced retrieval systems, or fascinated by the cutting edge of information retrieval and AI frameworks, this episode delivers unparalleled insights directly from the source. You'll gain a fundamental understanding of MUVERA, practical considerations for its application in making multi-vector retrieval feasible, and a clear view of future directions in AI.						—
5/15/25	Patronus AI with Anand Kannappan - Weaviate Podcast #122!	AI agents are getting more complex and harder to debug. How do you know what's happening when your agent makes 20+ function calls? What if you have a Multi-Agent System orchestrating several Agents? Anand Kannappan, co-founder of Patronus AI, reveals how their groundbreaking tool Percival transforms agent debugging and evaluation. Percival can instantly analyze complex agent traces, it pinpoints failures across 60 different modes, and it automatically suggests prompt fixes to improve performance. Anand unpacks several of these common failure modes. This includes the critical challenges of "context explosion" where agents process millions of tokens. He also explains domain adaptation for specific use cases, and the complex challenge of multi-agent orchestration. The paradigm of AI Evals is shifting from static evaluation to dynamic oversight! Also learn how Percival's memory architecture leverages both episodic and semantic knowledge with Weaviate!This conversation explores powerful concepts like process vs. outcome rewards and LLM-as-judge approaches. Anand shares his vision for "agentic supervision" where equally capable AI systems provide oversight for complex agent workflows. Whether you're building AI agents, evaluating LLM systems, or interested in how debugging autonomous systems will evolve, this episode delivers concrete techniques. You'll gain philosophical insights on evaluation and a roadmap for how evaluation must transform to keep pace with increasingly autonomous AI systems.						—
5/12/25	Haize Labs with Leonard Tang - Weaviate Podcast #121!	How do you ensure your AI systems actually do what you expect them to do? Leonard Tang takes us deep into the revolutionary world of AI evaluation with concrete techniques you can apply today. Learn how Haize Labs is transforming AI testing through "scaling judge-time compute" - stacking weaker models to effectively evaluate stronger ones. Leonard unpacks the game-changing Verdict library that outperforms frontier models by 10-20% while dramatically reducing costs. Discover practical insights on creating contrastive evaluation sets that extract maximum signal from human feedback, implementing debate-based judging systems, and building custom reward models that align with enterprise needs. The conversation reveals powerful nuggets like using randomized agent debates to achieve consensus and lightweight guardrail models that run alongside inference. Whether you're developing AI applications or simply fascinated by how we'll ensure increasingly powerful AI systems perform as expected, this episode delivers immediate value with techniques you can implement right away, philosophical perspectives on AI safety, and a glimpse into the future of evaluation that will fundamentally shape how AI evolves.						—
5/7/25	Box AI with Ben Kus and Bob van Luijt	Ben walks us through Box's three-layer infrastructure puzzle: First, the mind-boggling base infrastructure (think millions of interactions per second and trillions of files). Second, their unique multi-tenant security challenge - unlike most SaaS platforms, Box users share content across company boundaries, making traditional tenant isolation impossible. And third, ensuring AI respects all these complex permissions while still delivering value. The podcast then dives further into how vector embeddings can balloon file sizes - a few hundred bytes of text can require 4-6KB of vector data storage! We also dig into why RAG remains essential despite growing context windows, and how Box is developing AI agents that transform painful enterprise processes like RFP responses.						—
4/9/25	Structured Outputs with Will Kurt and Cameron Pfiffer - Weaviate Podcast #119!	Hey everyone! Thanks so much for watching another episode of the Weaviate Podcast! Dive into the fascinating world of structured outputs with Will Kurt and Cameron Pfeiffer, the brilliant minds behind Outlines, the revolutionary open-source library from .txt.ai that's changing how we interact with LLMs. In this episode, we explore how constrained decoding enables predictable, reliable outputs from language models—unlocking everything from perfect JSON generation to guided reasoning processes.Will and Cameron share their journey to founding .txt.ai, explain the technical magic behind Outlines (hint: it involves finite state machines!), and debunk misconceptions around structured generation performance. You'll discover practical applications like knowledge graph construction, metadata extraction, and report generation that simply weren't possible before this technology.Whether you're building AI systems or curious about where the field is heading, you'll gain valuable insights on how structured outputs integrate with inference engines like vLLM, why multi-task inference outperforms single-task approaches, and how this technology enables scalable agent systems that could transform software architecture forever. Join us for this mind-expanding conversation about one of AI's most important but under appreciated innovations—and discover why the future might belong to systems that combine freedom with structure.						—
3/25/25	Synthetic Data with David Berenstein and Ben Burtenshaw - Weaviate Podcast #118!	Synthetic Data: The Building Bocks of AI's Future! Hey everyone! I am SUPER EXCITED to publish the 118th episode of the Weaviate Podcast featuring David Berenstein and Ben Burtenshaw from HuggingFace! This podcast explores the intricacies of synthetic data generation, detailing methodologies such as data augmentation, distillation, and instruction refinement. The conversation delves into persona-driven synthetic data, highlighting applications like Persona Hub, and discusses algorithms to enhance diversity, complexity, and quality of generated data. Additionally, they cover integration with Hugging Face’s ecosystem, including Argilla for annotation, AutoTrain for fine-tuning, and advanced data exploration tools like the Data Studio and SQL console. The podcast also touches upon the potential for synthetic image data generation and the exciting future of AI education and accessibility.						—
3/3/25	Letta AI with Sarah Wooders - Weaviate Podcast #117!	Hey everyone! Thank you so much for watching the 117th episode of the Weaviate podcast! In this episode, we dive deep into the cutting edge of AI agent development with Sarah Wooders, co-founder and CTO of Letta AI. Emerging from Berkeley's Sky Computing Lab, Sarah and her team have pioneered a revolutionary approach to stateful agents - AI systems that genuinely remember both you and themselves across extended conversations. The conversation explores how the groundbreaking MemGPT project evolved into Letta's comprehensive Agent Development Environment (ADE), which empowers developers to build truly persistent AI experiences. Sarah shares powerful insights on context management, memory prioritization, and the critical role of databases in agent architecture. Whether you're building AI systems or simply curious about where conversational AI is heading, this episode illuminates how the future of agents depends not just on their reasoning capabilities, but on their ability to maintain coherent identity and memory over time.						—
2/27/25	Agent Experience with Matt Biilmann, Sebastian Witalec, and Charles Pierse - Weaviate Podcast #116!	Hey everyone! Thank you so much for watching another episode of the Weaviate Podcast! I am SUPER excited to welcome Matt Biilmann, Co-Founder and CEO of Netlify, as well as Sebastian Witalec and Charles Pierse from Weaviate to discuss Agent Experience! You have probably heard about how you can connect LLMs to external software tools. This supercharges the capabilities of AI systems and what they can do. So what does that mean for you as a software developer?This podcast explores different ideas around designing software user experiences for Agents as well as Humans. How do we write documentation for Agents differently than Humans? How do we design REST or gRPC APIs, or programming languages clients, for Agents differently than Humans? llms.txt, JSON tool definitions, agents to agent, breaking changes, … there were so many interesting topics explored in this podcast! I really hope you find it useful! As always more than happy to discuss these ideas further with you!						—
2/19/25	Optimizing Retrieval Agents with Shirley Wu - Weaviate Podcast #115!	Hey everyone! Thank you so much for watching the 115th episode of the Weaviate Podcast featuring Shirley Wu from Stanford University!We explore the innovative Avatar Optimizer—a novel framework that leverages contrastive reasoning to refine LLM agent prompts for optimal tool usage. Shirley explains how this self-improving system evolves through iterative feedback by contrasting positive and negative examples, enabling agents to handle complex tasks more effectively.We also dive into the STaRK Benchmark, a comprehensive testbed designed to evaluate retrieval systems on semi-structured knowledge bases. The discussion highlights the challenges of unifying textual and relational retrieval, exploring concepts such as multi-vector embeddings, relational graphs, and dynamic data modeling. Learn how these approaches help overcome information loss, enhance precision, and enable scalable, context-aware retrieval in diverse domains—from product recommendations to precision medicine.Whether you’re interested in advanced prompt optimization, multi-agent system design, or the future of human-centered language models, this episode offers a wealth of insights and a forward-looking perspective on integrating sophisticated AI techniques into real-world applications.						—

Showing 25 of 140

Explore More on CastFox

Podcast Charts Browse Categories Best Podcasts PodcastGPT Search Podcasts

Weaviate Podcast

Insights from recent episode analysis

Audience Interest

Podcast Focus

Publishing Consistency

Platform Reach

Most discussed topics

Brands & references

Market Insights

Platform Distribution

On the show

Hosts

Recent guests

Recent episodes

Social Links & Contact

Pitch Fit is a Pro feature

For Guests

For Advertisers

Explore More on CastFox