Report Bug / Feature Request

Free Live Transcription Online

Convert live transcription with AI-powered transcription. Speak into your microphone and see your words appear as text in real-time. 100+ languages, 10+ models, 98%+ accuracy.

Works with publicly available audio & video. DRM-protected content is not supported.

Upgrade for Enhanced

Private transcript

Chat with transcript

Unlock with Pro →

Drop file here or click to browse

MP3, WAV, M4A, FLAC, MP4, MKV, MOV, WebM — up to 2GB

Batch upload multiple files with Pro

Upgrade for Enhanced

Private transcript

Chat with transcript

Unlock with Pro →

Upgrade for Enhanced

Real-time speech to text. AI auto-corrects as you speak — accuracy improves with longer speech.

Test your microphone first

10 free min/day 600 min free with signup No credit card Encrypted

1. Click Record

Click the mic button and start speaking. Your words appear instantly.

2. AI Transcribes Live

Vosk provides instant words. Whisper auto-corrects for accuracy as you speak.

3. Enhance & Share

Enhance with full AI transcription. Download, share, or save to your account.

Also Transcribe Pre-Recorded Files

MP3 WAV M4A FLAC OGG MP4 MKV MOV WebM AVI

Live Transcription Models

Choose the AI model that fits your needs — or let us pick the best one.

Live Transcription in 100+ Languages

English Spanish French German Japanese Arabic Hindi Portuguese Russian Korean All languages →

Live Transcription Use Cases

Ready to try live transcription?

Start Free →

Frequently Asked Questions

Live transcription converts speech to text in real time as you talk, instead of after a recording finishes. STT.ai streams the words to your screen within a second or two of being spoken.

Click the microphone, allow mic access when your browser prompts you, and start speaking — captions appear live. To caption a meeting or video playing on your computer, share system audio instead of the mic.

Typically one to two seconds between speech and text. Latency depends on your network and current GPU load; a stable connection keeps captions flowing smoothly without large gaps.

It works in current Chrome, Edge, Firefox, and Safari on desktop and mobile, using the standard microphone and WebSocket APIs. No plugin or download is required; just grant microphone permission.

Yes. STT.ai includes 600 free minutes per month of live transcription. Paid plans starting at $5/month add longer sessions, private transcripts, and priority streaming.

Live transcription reaches 90-95% on clear speech — slightly below batch transcription because the model commits to words in real time rather than reviewing the whole recording. A good microphone and a quiet room make the biggest difference.

Yes. Point live transcription at the event audio (mic or system audio) and display the captions on screen for accessibility. You can also save the full transcript when the session ends.

Yes. 100+ languages are supported. Set the language before you start for the most reliable real-time results, since auto-detection needs a moment of audio to lock onto the language.

Yes. When you stop, the live session is saved as a full transcript you can edit, rename speakers in, and export to TXT, DOCX, PDF, SRT, or VTT.

Yes. Speaker diarization labels voices during the session, and you can rename them to real names in the saved transcript afterwards.

Yes. Streamed audio is processed in real time and not retained beyond producing the transcript, which is deleted by default. Pro plans add client-side encryption for the saved transcript.

Lag and dropped words usually come from an unstable network or talking far from the mic. A wired or strong Wi-Fi connection and a closer microphone keep real-time captions accurate and on time.

Free Live Transcription Online

1. Click Record

2. AI Transcribes Live

3. Enhance & Share

Also Transcribe Pre-Recorded Files

Live Transcription Models

Live Transcription in 100+ Languages

Live Transcription Use Cases

Ready to try live transcription?

Frequently Asked Questions

What is live transcription?

How do I start live transcription?

How much delay is there in real-time transcription?

Which browsers support live transcription?

Is live transcription free?

How accurate is live transcription?

Can I caption a live event or webinar?

Does live transcription support multiple languages?

Can I save and edit a live transcript afterwards?

Does live transcription detect different speakers?

Is live transcription private?

Why do my live captions lag or drop words?