如何运作
右点击
右键点击任何网页上的任何音频或视频元素。 从上下文菜单中选择“ 使用 STT.ai 进行记录 ” 。
AI 树丛书
STT.ai 以预先装入的媒体 URL 打开。 从 10+ AI 模型和 100+ 语言中选择 。
获得轨迹
以时间戳和语音探测方式获取您的笔录。 导出为 TXT、 SRT、 VTT、 DOCX 或 PDF 。
特征特征
YouTube 整合
在每个YouTube视频的 Share 按钮旁边添加一个“ 设置” 按钮 。
媒体探测
自动检测任何网页上的音频和视频元素,并添加浮动定线按钮。
链接传输
右键点击媒体文件(MP3、MP4、WAV等)的任何链接,直接进行抄录。
快速弹出
将任何 URL 粘贴到弹出中, 以便立即将其剪贴。 最近的抄录被保存为快速存取 。
100+语文
以100+语言中任何一种语言记录音频,由STT.ai人支持。
隐私第一
没有收集浏览数据。 扩展仅当您单击它时才激活。 开源 。
安装扩展名
STT.ai 铬扩展可在 Chrome Web Store 上查阅。 安装它几秒钟后, 开始在网上抄录任何音频或视频 。
手动安装( 开发者模式)
跳转到chrome://extensions, 启用开发者模式, 单击“ load undroaded”, 并选择扩展文件夹 。
支助平台
扩展功能与任何具有音频或视频内容的网页一起工作。
YouTube
Vimeo
SoundCloud
Spotify
Twitch
TikTok
Dailymotion
Facebook
Instagram
Twitter / X
Reddit
Rumble
Google Drive
Dropbox
All supported platforms →
常见问题
chrome extension runs in your browser: paste a URL, upload a file, or record from your mic. STT.ai picks the AI model and returns the transcript in under 5 minutes. Export as TXT, SRT, VTT, DOCX, JSON, or PDF.
Yes — every visitor gets 600 free minutes/month on STT.ai, usable for chrome extension the same as any other workflow. Paid plans starting at $5/month unlock longer files, private transcripts, and priority queueing.
chrome extension runs on the same AI models as the rest of STT.ai — our best models reach 95-97% accuracy on clean speech (3-5% Word Error Rate on benchmarks). Switch models on the fly if the first pass is below your target.
chrome extension can run on any of STT.ai's 10+ models — STT.ai Enhanced (most accurate), Whisper Large V3 (99 languages), NVIDIA Canary (#1 WER on supported langs), Whisper Turbo (fast), Moonshine (lightweight), and more.
Yes. Every transcript exports as SRT or VTT — works with YouTube, Vimeo, TikTok, VLC, and every major video player. The burn-subtitles tool overlays them onto video as hardsubs.
Yes. Speaker diarization automatically labels each voice (Speaker 1, Speaker 2, ...) and you can rename them in the built-in editor. Works across all models and languages.
Most chrome extension jobs finish in under 5 minutes. A 1-hour audio file typically completes in 2-3 minutes with our fastest models. Speed depends on chosen model and current GPU load.
chrome extension accepts 20+ formats — MP3, WAV, M4A, FLAC, OGG, MP4, MKV, MOV, WebM, AVI, and more. Output to TXT, SRT, VTT, DOCX, JSON, or PDF.
Yes. Audio files submitted to chrome extension are processed and deleted by default. Pro plans add client-side encryption — even if STT.ai's database is breached, your transcripts are unreadable without your key. Data is never used for model training without explicit opt-in.
Yes. STT.ai offers a REST API with Python and Node.js SDKs, plus an MCP server for Claude and Cursor — all usable for chrome extension workflows. Free API tier includes 100 minutes/month.
Yes. Every transcript opens in the built-in editor where you can correct words, rename speakers, adjust timestamps, and add notes. All changes save automatically.
Every transcript gets a unique shareable URL. Export to DOCX or PDF for email. Pro plans add password-protected and permanent links — useful for client work.
STT.ai handles 1,300+ platforms including YouTube, Vimeo, TikTok, SoundCloud, Zoom, Google Meet, podcast hosts, and more. URL transcription works with publicly-available content only — DRM-protected sources can't be transcribed.