免费在线实时转录

使用AI驱动的转录进行实时转录。对着麦克风说话,实时看到文字。支持100多种语言、10多种模型,准确率超过98%。

使用公开的音频和视频工作。 DRM 保护的内容不支持 。

增强的升级
Private transcript
与笔录聊天
以 Pro 解锁 →
在此拖放文件或单击以浏览文件
MP3、WAV、M4A、FLAC、MP4、MKV、MOV、WebM-至多2GB
增强的升级
Private transcript
与笔录聊天
以 Pro 解锁 →
增强的升级
录音: 0:00
实时 伏( 即时)
增强 耳语( 准确)
公共链接:24小时,仅文本 · 签名签名 7d+音频 · 职业 用于私人链接的私人链接

文本的实时演讲。 AI 自动校正, 使用较长的演讲, 准确性会提高 。

先测试一下麦克风
❤️ 爱你的STT. AI 告诉你的朋友!
你用的是免费的抄本

免费报名每月获得600分钟,或升级无限制的抄本。

每天10分钟免费 600分钟免费,有注册 无信用卡 已加密
免费签名 →

1. 点击录音

点击麦克风按钮开始说话。您的话语即时显示。

2. AI实时转录

Vosk提供即时文字。Whisper在您说话时自动校正以提高准确性。

3. 增强与分享

使用完整AI转录增强。下载、分享或保存到您的账户。

也可转录预录文件

实时转录使用场景

准备好体验实时转录了吗?

免费开始 →

常见问题

Live transcription converts speech to text in real time as you talk, instead of after a recording finishes. STT.ai streams the words to your screen within a second or two of being spoken.

Click the microphone, allow mic access when your browser prompts you, and start speaking — captions appear live. To caption a meeting or video playing on your computer, share system audio instead of the mic.

Typically one to two seconds between speech and text. Latency depends on your network and current GPU load; a stable connection keeps captions flowing smoothly without large gaps.

It works in current Chrome, Edge, Firefox, and Safari on desktop and mobile, using the standard microphone and WebSocket APIs. No plugin or download is required; just grant microphone permission.

Yes. STT.ai includes 600 free minutes per month of live transcription. Paid plans starting at $5/month add longer sessions, private transcripts, and priority streaming.

Live transcription reaches 90-95% on clear speech — slightly below batch transcription because the model commits to words in real time rather than reviewing the whole recording. A good microphone and a quiet room make the biggest difference.

Yes. Point live transcription at the event audio (mic or system audio) and display the captions on screen for accessibility. You can also save the full transcript when the session ends.

Yes. 100+ languages are supported. Set the language before you start for the most reliable real-time results, since auto-detection needs a moment of audio to lock onto the language.

Yes. When you stop, the live session is saved as a full transcript you can edit, rename speakers in, and export to TXT, DOCX, PDF, SRT, or VTT.

Yes. Speaker diarization labels voices during the session, and you can rename them to real names in the saved transcript afterwards.

Yes. Streamed audio is processed in real time and not retained beyond producing the transcript, which is deleted by default. Pro plans add client-side encryption for the saved transcript.

Lag and dropped words usually come from an unstable network or talking far from the mic. A wired or strong Wi-Fi connection and a closer microphone keep real-time captions accurate and on time.