TWIML AI Podcast

Why Vision Language Models Ignore What They See with Munawar Hayat - #758

Dec 09, 2025 · 57m

In this episode, we’re joined by Munawar Hayat, researcher at Qualcomm AI Research, to discuss a series of papers presented at NeurIPS 2025 focusing on multimodal and generative AI. We dive into the persistent challenge of object hallucination in Vision-Language Models (VLMs), why models often discard visual information in favor of pre-trained language priors, and how his team used attention-guided alignment to enforce better visual grounding. We also explore a novel approach to generalized contrastive learning designed to solve complex, …

Aquest episodi encara no ha estat transcrit

Use STT.ai to transcribe this episode with AI. Get accurate text with speaker detection, timestamps, and export in multiple formats.

Trancricte Aquest episodi

Detecció del ponent Marca horària de nivell de paraula Exporta com SRT, TXT, JSON

Més capítols

The Race to Production-Grade Diffusion LLMs with Stefano Ermon - #764

Mar 26, 2026 · 1h 3m

Agent Swarms and Knowledge Graphs for Autonomous Software Development with Siddhant Pardeshi …

Mar 10, 2026 · 1h 16m

AI Trends 2026: OpenClaw Agents, Reasoning LLMs, and More with Sebastian Raschka …

Feb 26, 2026 · 1h 18m

Why Vision Language Models Ignore What They See with Munawar Hayat - #758

Aquest episodi encara no ha estat transcrit

Més capítols

The Race to Production-Grade Diffusion LLMs with Stefano Ermon - #764

Agent Swarms and Knowledge Graphs for Autonomous Software Development with Siddhant Pardeshi …

AI Trends 2026: OpenClaw Agents, Reasoning LLMs, and More with Sebastian Raschka …

The Evolution of Reasoning in Small Language Models with Yejin Choi - …

Intelligent Robots in 2026: Are We There Yet? with Nikita Rudin - …

Rethinking Pre-Training for Agentic AI with Aakanksha Chowdhery - #759

Scaling Agentic Inference Across Heterogeneous Compute with Zain Asgar - #757

Proactive Agents for the Web with Devi Parikh - #756

AI Orchestration for Smart Cities and the Enterprise with Robin Braun and …

Building an AI Mathematician with Carina Hong - #754