Export Formats
Download your transcripts in any format you need. STT.ai supports six export formats, each optimized for different workflows.
Real-time speech to text. AI auto-corrects as you speak — accuracy improves with longer speech.
Test your microphone firstSign up for free to get 600 minutes/month, or upgrade for unlimited transcriptions.
Supported Export Formats
After transcribing your audio or video, you can download the transcript in any of the following formats. All formats include the full transcript text, and timed formats include word-level or segment-level timestamps.
TXT (Plain Text)
.txtSimple plain text transcript without formatting. Best for copying into documents, emails, or other applications. Includes speaker labels when speaker detection is enabled.
SRT (SubRip Subtitle)
.srtThe most widely supported subtitle format. Includes sequential numbering, timestamps, and text. Compatible with YouTube, Vimeo, VLC, Premiere Pro, Final Cut, and virtually every video player and editor.
VTT (WebVTT)
.vttWeb Video Text Tracks format, the standard for HTML5 video captions. Supports styling, positioning, and metadata. Used by web browsers, streaming platforms, and modern video players.
DOCX (Word Document)
.docxFormatted Word document with proper headings, timestamps, and speaker labels. Ideal for meeting minutes, reports, and documents that need further editing in Microsoft Word or Google Docs.
JSON (Structured Data)
.jsonMachine-readable structured format with word-level timestamps, confidence scores, speaker IDs, and segment data. Perfect for developers building on top of STT.ai or feeding data into other systems.
PDF (Portable Document)
.pdfProfessional formatted PDF with timestamps, speaker labels, and STT.ai branding. Ideal for sharing with clients, archiving records, or printing. Layout is optimized for readability.
Format Comparison
| Feature | TXT | SRT | VTT | DOCX | JSON | |
|---|---|---|---|---|---|---|
| Plain text | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Timestamps | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Speaker labels | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Word-level timing | ✗ | ✗ | ✗ | ✗ | ✓ | ✗ |
| Confidence scores | ✗ | ✗ | ✗ | ✗ | ✓ | ✗ |
| Video player compatible | ✗ | ✓ | ✓ | ✗ | ✗ | ✗ |
| Editable | ✓ | ✓ | ✓ | ✓ | ✓ | ✗ |
| Machine-readable | ✗ | ✗ | ✗ | ✗ | ✓ | ✗ |
Which Format Should You Choose?
Use SRT for maximum compatibility or VTT for web-based video players. SRT works with YouTube, Vimeo, Premiere Pro, Final Cut, and DaVinci Resolve.
Use DOCX for editable documents or PDF for sharing and archiving. Both include formatted timestamps and speaker labels.
Use JSON for the richest data including word-level timestamps, confidence scores, and speaker IDs. Ideal for building custom applications.
Use TXT for a simple plain text transcript you can paste anywhere -- emails, notes, chat, or any text field.
Batch Export
Need to export multiple transcripts at once? STT.ai supports batch export from your transcript library. Select multiple transcripts, choose your format, and download them all in a single ZIP file. Available on all paid plans.
API Export
Developers can retrieve transcripts in any format via the STT.ai API. Simply specify the desired format in your API request and receive the formatted output directly. The JSON format includes the most detailed data including word-level timestamps and confidence scores.
Transcribe and export in any format
Upload audio or video. Choose your export format. Download instantly.
Start Transcribing Free