Practical AI
Practical AI

Technical advances in document understanding

Dec 02, 2025 · 49m

<p>Chris and Daniel unpack how AI-driven document processing has rapidly evolved well beyond traditional OCR with many technical advances that fly under the radar. They explore the progression from document structure models to language-vision models, all the way to the newest innovations like Deepseek-OCR. The discussion highlights the pros and cons of these various approaches focusing on practical implementation and usage.</p><p>Featuring:</p><ul><li>Chris Benson – <a href="https://chrisbenson.com/">Website</a>, <a href="https://www.linkedin.com/in/chrisbenson">LinkedIn</a>, <a href="https://bsky.app/profile/chrisbenson.bsky.social">Bluesky</a>, <a href="https://github.com/chrisbenson">GitHub</a>, <a href="https://x.com/chrisbenson">X</a></li><li>Daniel Whitenack – <a href="https://www.datadan.io/">Website</a>, <a href="https://github.com/dwhitena">GitHub</a>, <a …

이 에피소드는 아직 녹음되지 않았습니다

STT.ai을 사용하여 AI로 이 에피소드를 기록합니다. 발음기 감지, 타임스탬프, 다양한 형식으로 내보내기를 통해 정확한 텍스트를 얻으십시오.

스피커 감지 단어 수준 시간 스탬프 SRT, TXT, JSON으로 내보내기

더 많은 에피소드