TWIML AI Podcast

Recurrence and Attention for Long-Context Transformers with Jacob Buckman - #750

Oct 07, 2025 · 57m

Today, we're joined by Jacob Buckman, co-founder and CEO of Manifest AI to discuss achieving long context in transformers. We discuss the bottlenecks of scaling context length and recent techniques to overcome them, including windowed attention, grouped query attention, and latent space attention. We explore the idea of weight-state balance and the weight-state FLOP ratio as a way of reasoning about the optimality of compute architectures, and we dig into the Power Retention architecture, which blends the parallelization of attention …

Denne episode er ikke blevet transskriberet endnu

Brug STT.ai til at transskribere denne episode med AI. Få præcis tekst med højttalerdetektering, tidsstempler og eksport i flere formater.

Overskriv denne episode

Højttalerdetektering Tidsstempler på ordniveau Eksport som SRT, TXT, JSON

Flere episoder

The Race to Production-Grade Diffusion LLMs with Stefano Ermon - #764

Mar 26, 2026 · 1h 3m

Agent Swarms and Knowledge Graphs for Autonomous Software Development with Siddhant Pardeshi …

Mar 10, 2026 · 1h 16m

AI Trends 2026: OpenClaw Agents, Reasoning LLMs, and More with Sebastian Raschka …

Feb 26, 2026 · 1h 18m

Recurrence and Attention for Long-Context Transformers with Jacob Buckman - #750

Denne episode er ikke blevet transskriberet endnu

Flere episoder

The Race to Production-Grade Diffusion LLMs with Stefano Ermon - #764

Agent Swarms and Knowledge Graphs for Autonomous Software Development with Siddhant Pardeshi …

AI Trends 2026: OpenClaw Agents, Reasoning LLMs, and More with Sebastian Raschka …

The Evolution of Reasoning in Small Language Models with Yejin Choi - …

Intelligent Robots in 2026: Are We There Yet? with Nikita Rudin - …

Rethinking Pre-Training for Agentic AI with Aakanksha Chowdhery - #759

Why Vision Language Models Ignore What They See with Munawar Hayat - …

Scaling Agentic Inference Across Heterogeneous Compute with Zain Asgar - #757

Proactive Agents for the Web with Devi Parikh - #756

AI Orchestration for Smart Cities and the Enterprise with Robin Braun and …