TWIML AI Podcast

The Race to Production-Grade Diffusion LLMs with Stefano Ermon - #764

Mar 26, 2026 · 1h 3m

Today, we're joined by Stefano Ermon, associate professor at Stanford University and CEO of Inception Labs to discuss diffusion language models. We dig into how diffusion approaches—traditionally used for images—are being adapted for text and code generation, the technical challenges of applying continuous methods to discrete token spaces, and how diffusion models compare to traditional autoregressive LLMs. Stefano introduces Mercury 2, a commercial-scale diffusion LLM that can generate multiple tokens simultaneously and achieve inference speeds 5-10x faster than small frontier …

הפרק הזה עדיין לא תועתק.

השתמש ב-STT.ai כדי לתעתק פרק זה עם AI. קבל טקסט מדויק עם זיהוי רמקול, חותמת זמן ויצוא בתבניות מרובות.

תורגם וסונכרן ע"י Qsubs מצוות glfinish

זיהוי רמקול חותמת זמן ברמת מילים ייצא בתור SRT, TXT, JSON

עוד פרקים

Agent Swarms and Knowledge Graphs for Autonomous Software Development with Siddhant Pardeshi …

Mar 10, 2026 · 1h 16m

AI Trends 2026: OpenClaw Agents, Reasoning LLMs, and More with Sebastian Raschka …

Feb 26, 2026 · 1h 18m

The Race to Production-Grade Diffusion LLMs with Stefano Ermon - #764

הפרק הזה עדיין לא תועתק.

עוד פרקים

Agent Swarms and Knowledge Graphs for Autonomous Software Development with Siddhant Pardeshi …

AI Trends 2026: OpenClaw Agents, Reasoning LLMs, and More with Sebastian Raschka …

The Evolution of Reasoning in Small Language Models with Yejin Choi - …

Intelligent Robots in 2026: Are We There Yet? with Nikita Rudin - …

Rethinking Pre-Training for Agentic AI with Aakanksha Chowdhery - #759

Why Vision Language Models Ignore What They See with Munawar Hayat - …

Scaling Agentic Inference Across Heterogeneous Compute with Zain Asgar - #757

Proactive Agents for the Web with Devi Parikh - #756

AI Orchestration for Smart Cities and the Enterprise with Robin Braun and …

Building an AI Mathematician with Carina Hong - #754