TWIML AI Podcast
TWIML AI Podcast

Dataflow Computing for AI Inference with Kunle Olukotun - #751

Oct 14, 2025 · 57m

In this episode, we're joined by Kunle Olukotun, professor of electrical engineering and computer science at Stanford University and co-founder and chief technologist at Sambanova Systems, to discuss reconfigurable dataflow architectures for AI inference. Kunle explains the core idea of building computers that are dynamically configured to match the dataflow graph of an AI model, moving beyond the traditional instruction-fetch paradigm of CPUs and GPUs. We explore how this architecture is well-suited for LLM inference, reducing memory bandwidth bottlenecks and …

Ta epizoda še ni bila prepisana.

Uporabite STT.ai za transkripcijo te epizode z AI. Dobite natančno besedilo z odkrivanjem zvočnika, časovne oznake in izvoz v več formatih.

Odkrivanje zvočnika Časovne oznake na ravni besede Izvoz kot SRT, TXT, JSON

Več epizod