Export Formats

Download your transcripts in any format you need. STT.ai supports six export formats, each optimized for different workflows.

Funziona con contenuti audio e video disponibili pubblicamente. Il contenuto protetto da DRM non è supportato.

Upgrade for Enhanced
Private transcript
Chat avec transcription
Unlock with Pro →
Drop file here or click to browse
MP3, WAV, M4A, FLAC, MP4, MKV, MOV, WebM — up to 2GB
Upgrade for Enhanced
Private transcript
Chat avec transcription
Unlock with Pro →
Upgrade for Enhanced
Recording: 0:00
Real-time Vosk (instant)
Enhanced Whisper (accurate)
Public links: 24h, text only · Sign up for 7d + audio · Pro for private links

Real-time speech to text. AI auto-corrects as you speak — accuracy improves with longer speech.

Test your microphone first
❤️ Love STT.ai? Tell your friends!
You've used your free transcriptions

Sign up for free to get 600 minutes/month, or upgrade for unlimited transcriptions.

10 free min/day 600 min free with signup No credit card Encrypted
Sign up free →

Supported Export Formats

After transcribing your audio or video, you can download the transcript in any of the following formats. All formats include the full transcript text, and timed formats include word-level or segment-level timestamps.

TXT (Plain Text)

.txt

Simple plain text transcript without formatting. Best for copying into documents, emails, or other applications. Includes speaker labels when speaker detection is enabled.

Free plan

SRT (SubRip Subtitle)

.srt

The most widely supported subtitle format. Includes sequential numbering, timestamps, and text. Compatible with YouTube, Vimeo, VLC, Premiere Pro, Final Cut, and virtually every video player and editor.

Free plan

VTT (WebVTT)

.vtt

Web Video Text Tracks format, the standard for HTML5 video captions. Supports styling, positioning, and metadata. Used by web browsers, streaming platforms, and modern video players.

Basic plan+

DOCX (Word Document)

.docx

Formatted Word document with proper headings, timestamps, and speaker labels. Ideal for meeting minutes, reports, and documents that need further editing in Microsoft Word or Google Docs.

Basic plan+

JSON (Structured Data)

.json

Machine-readable structured format with word-level timestamps, confidence scores, speaker IDs, and segment data. Perfect for developers building on top of STT.ai or feeding data into other systems.

Basic plan+

PDF (Portable Document)

.pdf

Professional formatted PDF with timestamps, speaker labels, and STT.ai branding. Ideal for sharing with clients, archiving records, or printing. Layout is optimized for readability.

Basic plan+

Format Comparison

Feature TXT SRT VTT DOCX JSON PDF
Plain text
Timestamps
Speaker labels
Word-level timing
Confidence scores
Video player compatible
Editable
Machine-readable

Which Format Should You Choose?

For subtitles and captions

Use SRT for maximum compatibility or VTT for web-based video players. SRT works with YouTube, Vimeo, Premiere Pro, Final Cut, and DaVinci Resolve.

For documents and reports

Use DOCX for editable documents or PDF for sharing and archiving. Both include formatted timestamps and speaker labels.

For developers and integrations

Use JSON for the richest data including word-level timestamps, confidence scores, and speaker IDs. Ideal for building custom applications.

For quick copy-paste

Use TXT for a simple plain text transcript you can paste anywhere -- emails, notes, chat, or any text field.

Batch Export

Need to export multiple transcripts at once? STT.ai supports batch export from your transcript library. Select multiple transcripts, choose your format, and download them all in a single ZIP file. Available on all paid plans.

API Export

Developers can retrieve transcripts in any format via the STT.ai API. Simply specify the desired format in your API request and receive the formatted output directly. The JSON format includes the most detailed data including word-level timestamps and confidence scores.

Transcribe and export in any format

Upload audio or video. Choose your export format. Download instantly.

Start Transcribing Free

Frequently Asked Questions

Upload your audio or video file to STT.ai. Select your preferred AI model and options, then click Transcribe. Your transcript will be ready in minutes. Export as TXT, SRT, VTT, DOCX, JSON, or PDF.

STT.ai offre 600 minutes gratuites par mois pour tous les utilisateurs. Aucune inscription n'est requise pour votre première transcription.Les plans payés avec plus de minutes et de fonctionnalités commencent à $5/mois.

La précision dépend du modèle d’IA que vous choisissez et de la qualité audio. Nos meilleurs modèles atteignent un taux d’erreur de mots de 5-7% sur les benchmarks, ce qui signifie une précision de 93-95% +.

STT.ai offre 10+ modèles incluant Whisper Large V3, NVIDIA Canary, et plus.Vous pouvez comparer les résultats de différents modèles sur le même fichier.

Ye. Après la transcription, exportez votre transcription comme fichiers de sous-titres SRT ou VTT.Ces fonctionnent avec YouTube, Vimeo, et toutes les principales plateformes de vidéo.

Oye. STT.ai identifye et étiquette automatiquement les différents locuteurs en utilisant la diarisation des locuteurs AI. Fonctionne sur tous les modèles et langues.

La plupart des fichiers sont transcrits en moins de 5 minutes.Un fichier audio d'une heure prend généralement 2-3 minutes avec nos modèles les plus rapides.

STT.ai supports 20+ audio and video formats including MP3, WAV, M4A, FLAC, OGG, MP4, MKV, MOV, WebM, and AVI.Export as TXT, SRT, VTT, DOCX, JSON, or PDF.

Fichiers audio sont traités et supprimés après transcription. Vos données ne sont jamais utilisées pour formation. Le chiffrement côté client est gratuit sur tous les plans — il chiffre les transcriptions stockées avec une clé que vous seul avez. Pendant le traitement, le serveur gère votre audio en texte clair. Learn about our security.

STT.ai offre une API REST avec des SDK Python et Node.js. Le niveau gratuit comprend 100 minutes/mois.

Yes. STT.ai includes a built-in transcript editor where you can correct errors, rename speakers, and adjust timestamps.

Ekspore na DOCX o PDF ko email. Pro plans otima password-protected and permanent links.

STT.ai supporte 1,300+ platformes incluant YouTube, Vimeo, TikTok, SoundCloud, et plus. La transcription URL fonctionne seulement avec des audio et des vidéos disponibles publiquement. Le contenu protégé par DRM (comme les épisodes premium de Spotify, Netflix, Disney+, etc.) ne peut pas être transcrit. Pour le contenu DRM, téléchargez le fichier séparément et chargez-le directement.