TWIML AI Podcast
The Race to Production-Grade Diffusion LLMs with Stefano Ermon - #764
Mar 26, 2026
· 1h 3m
Today, we're joined by Stefano Ermon, associate professor at Stanford University and CEO of Inception Labs to discuss diffusion language models. We dig into how diffusion approaches—traditionally used for images—are being adapted for text and code generation, the technical challenges of applying continuous methods to discrete token spaces, and how diffusion models compare to traditional autoregressive LLMs. Stefano introduces Mercury 2, a commercial-scale diffusion LLM that can generate multiple tokens simultaneously and achieve inference speeds 5-10x faster than small frontier …
이 에피소드는 아직 녹음되지 않았습니다
STT.ai을 사용하여 AI로 이 에피소드를 기록합니다. 발음기 감지, 타임스탬프, 다양한 형식으로 내보내기를 통해 정확한 텍스트를 얻으십시오.
스피커 감지
단어 수준 시간 스탬프
SRT, TXT, JSON으로 내보내기