Practical AI

Technical advances in document understanding

Dec 02, 2025 · 49m

<p>Chris and Daniel unpack how AI-driven document processing has rapidly evolved well beyond traditional OCR with many technical advances that fly under the radar. They explore the progression from document structure models to language-vision models, all the way to the newest innovations like Deepseek-OCR. The discussion highlights the pros and cons of these various approaches focusing on practical implementation and usage.</p><p>Featuring:</p><ul><li>Chris Benson – <a href="https://chrisbenson.com/">Website</a>, <a href="https://www.linkedin.com/in/chrisbenson">LinkedIn</a>, <a href="https://bsky.app/profile/chrisbenson.bsky.social">Bluesky</a>, <a href="https://github.com/chrisbenson">GitHub</a>, <a href="https://x.com/chrisbenson">X</a></li><li>Daniel Whitenack – <a href="https://www.datadan.io/">Website</a>, <a href="https://github.com/dwhitena">GitHub</a>, <a …

Tập này chưa được chuyển thể.

Dùng STT.ai để phiên âm tập phim này với AI. Lấy văn bản chính xác với phát hiện người nói, dấu thời gian, và xuất vào nhiều định dạng.

Bản dịch tập này

Kiểm tra loa Thời gian cấp từ Xuất dạng SRT, TXT, JSON

Nhiều tập hơn

Technical advances in document understanding

Tập này chưa được chuyển thể.

Nhiều tập hơn

Agentic Coding and the Economics of Open Source

AI at the Edge is a different operating environment

Humility in the Age of Agentic Coding

AI policy and the battle for computing power

Cognitive Synthesis and Neural Athletes

AI incidents, audits, and the limits of benchmarks

Inside an AI-Run Company

How is AI shaping democracy?

Controlling AI Models from the Inside

2025 was the year of agents, what's coming in 2026?