Àkọlé àwòrán

Àwọn Ìgúnrégé Àwọn Àmì-ìwé

Download your transcripts in any format you need. STT.ai supports six export formats, each optimized for different workflows.

Àwọn iṣẹ́ láti mú àwọn àwòrán àti àwòrán tí a yàn fún gbogbo eniyan. Àwọn àwọn ìròyìn tí a dáwọ́ láti lo DRM kò fọwọ́sì.

Àwọn ìṣàfihàn fún àwọn ìṣàfihàn

Private transcript

Fi àkọlé pamọ́

Ṣí àwọn àwọn àgbéwọlé →

Tí fáìlì náà síbẹ̀ tàbí tẹ̀ láti ṣàfihàn

MP3, WAV, M4A, FLAC, MP4, MKV, MOV, WebM — to 2GB

Fi àwọn fáìlì mìíràn pamọ́ Àwọn àwọn àwọn àwọn àwọn

Àwọn ìṣàfihàn fún àwọn ìṣàfihàn

Private transcript

Fi àkọlé pamọ́

Ṣí àwọn àwọn àgbéwọlé →

Àwọn ìṣàfihàn fún àwọn ìṣàfihàn

Àwọn àkọlé àìpẹ́ láti inú àkọlé. AI kọ̀ọ̀kan-ìṣàmúlò-ètò bí a tí n sọ̀rọ̀ - ìṣàmúlò-ètò náà tí a bá kọ̀ọ̀kan àwọn àkọlé náà.

Àwọn àwọn àmì-ìwé

10 free min/day 600 min ọfẹ pẹlu iforukọsilẹ Kò ní kaadí ẹ̀yàn Àwọn àmì-ìwé

Ṣẹ̀dà nípa ọ̀fẹ̀ →

Àwọn ìgúnrégé ìṣàfihàn

After transcribing your audio or video, you can download the transcript in any of the following formats. All formats include the full transcript text, and timed formats include word-level or segment-level timestamps.

TXT (Àkọ́lé)

.txt

Simple plain text transcript without formatting. Best for copying into documents, emails, or other applications. Includes speaker labels when speaker detection is enabled.

Free plan

SRT (Àwọn àkọlé àkọlé)

.srt

The most widely supported subtitle format. Includes sequential numbering, timestamps, and text. Compatible with YouTube, Vimeo, VLC, Premiere Pro, Final Cut, and virtually every video player and editor.

Free plan

VTT (WebVTT)

.vtt

Àwọn Àkọ́kọ́ Àkọ́kọ́ Àwọn Àwòrán Wẹ́ẹ̀bù, ìpèwọ̀n fún àwọn àkọ́kọ́ àwọn àwòrán HTML5. Ǹfà àwọn ìṣàmúlò-ètò, àwọn ìpàlẹ̀, àti àwọn ààtò metadata. Lò láti inú àwọn ìṣàfihàn Wẹ́ẹ̀bù, àwọn àwọn pánẹ́ẹ̀lì ìṣàfihàn, àti àwọn àwọn ìṣàfihàn àwòrán tí a lò.

Basic plan+

DOCX (Àkọ́lé Wẹ́ẹ̀bù)

.docx

Àkọlé àwòrán Word tí a fformatted̀ láti ní àwọn àkọlé tòójútó, àwọn àkókò àkókò, àti àwọn àmì-ìwé àwọn ìgbàkọ. Ó jẹ́ ìjánu-ìjánú fún àwọn àmì-ìwé àwọn àgbèkalẹ̀ àti àwọn àkọlé tí ó ń fi àwọn àwọn ìṣàfarawé kọ̀ǹpútà mìíràn sí Microsoft Word tàbí Google Docs.

Basic plan+

JSON (Àkọ́lé)

.json

Machine-readable structured format with word-level timestamps, confidence scores, speaker IDs, and segment data. Perfect for developers building on top of STT.ai or feeding data into other systems.

Basic plan+

PDF (Àkọ́lé Àwọn Àkọ́lé)

.pdf

Professional formatted PDF with timestamps, speaker labels, and STT.ai branding. Ideal for sharing with clients, archiving records, or printing. Layout is optimized for readability.

Basic plan+

Àwọn Ìgúnrégé Ìṣàfarawé

Àwọn Àbùdá	TXT	SRT	VTT	DOCX	JSON	PDF
Plain text	✓	✓	✓	✓	✓	✓
Timestamps	✗	✓	✓	✓	✓	✓
Speaker labels	✓	✓	✓	✓	✓	✓
Word-level timing	✗	✗	✗	✗	✓	✗
Confidence scores	✗	✗	✗	✗	✓	✗
Video player compatible	✗	✓	✓	✗	✗	✗
Editable	✓	✓	✓	✓	✓	✗
Machine-readable	✗	✗	✗	✗	✓	✗

Ìgúnrégé wo ní Ò yẹ́ ki O Yan?

Fun àwọn àkọlé àti àwọn àkọlé

Use SRT for maximum compatibility or VTT for web-based video players. SRT works with YouTube, Vimeo, Premiere Pro, Final Cut, and DaVinci Resolve.

Fun àwọn àkọlé àti àwọn àkọlé

Use DOCX for editable documents or PDF for sharing and archiving. Both include formatted timestamps and speaker labels.

Fun àwọn ìṣàfihàn náà àti àwọn ìdákọ́

Use JSON for the richest data including word-level timestamps, confidence scores, and speaker IDs. Ideal for building custom applications.

Fún àkóónú àìpàdé àìpẹ́

Use TXT for a simple plain text transcript you can paste anywhere -- emails, notes, chat, or any text field.

Àwọn àwọn àgbéwọlé

Need to export multiple transcripts at once? STT.ai supports batch export from your transcript library. Select multiple transcripts, choose your format, and download them all in a single ZIP file. Available on all paid plans.

Àwọn Ìṣàfilọ́lẹ̀

Developers can retrieve transcripts in any format via the STT.ai API. Simply specify the desired format in your API request and receive the formatted output directly. The JSON format includes the most detailed data including word-level timestamps and confidence scores.

Ṣẹ̀dà àti ìjánu-ìṣàfilọ́lẹ̀ nínú ìṣàfarawé ọ̀fẹ́

Fi àwòrán àti àwòrán pamọ́. Yan ìwọ̀n ìṣàfihàn rẹ̀. Ṣàfikún nígbà.

Ṣí Ìṣàfilọ́lẹ̀

Àwọn Àtòjọ-ẹ̀yàn

export formats runs in your browser: paste a URL, upload a file, or record from your mic. STT.ai picks the AI model and returns the transcript in under 5 minutes. Export as TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes — every visitor gets 600 free minutes/month on STT.ai, usable for export formats the same as any other workflow. Paid plans starting at $5/month unlock longer files, private transcripts, and priority queueing.

export formats runs on the same AI models as the rest of STT.ai — our best models reach 95-97% accuracy on clean speech (3-5% Word Error Rate on benchmarks). Switch models on the fly if the first pass is below your target.

export formats can run on any of STT.ai's 10+ models — STT.ai Enhanced (most accurate), Whisper Large V3 (99 languages), NVIDIA Canary (#1 WER on supported langs), Whisper Turbo (fast), Moonshine (lightweight), and more.

Yes. Every transcript exports as SRT or VTT — works with YouTube, Vimeo, TikTok, VLC, and every major video player. The burn-subtitles tool overlays them onto video as hardsubs.

Yes. Speaker diarization automatically labels each voice (Speaker 1, Speaker 2, ...) and you can rename them in the built-in editor. Works across all models and languages.

Most export formats jobs finish in under 5 minutes. A 1-hour audio file typically completes in 2-3 minutes with our fastest models. Speed depends on chosen model and current GPU load.

export formats accepts 20+ formats — MP3, WAV, M4A, FLAC, OGG, MP4, MKV, MOV, WebM, AVI, and more. Output to TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes. Audio files submitted to export formats are processed and deleted by default. Pro plans add client-side encryption — even if STT.ai's database is breached, your transcripts are unreadable without your key. Data is never used for model training without explicit opt-in.

Yes. STT.ai offers a REST API with Python and Node.js SDKs, plus an MCP server for Claude and Cursor — all usable for export formats workflows. Free API tier includes 100 minutes/month.

Yes. Every transcript opens in the built-in editor where you can correct words, rename speakers, adjust timestamps, and add notes. All changes save automatically.

Every transcript gets a unique shareable URL. Export to DOCX or PDF for email. Pro plans add password-protected and permanent links — useful for client work.

STT.ai handles 1,300+ platforms including YouTube, Vimeo, TikTok, SoundCloud, Zoom, Google Meet, podcast hosts, and more. URL transcription works with publicly-available content only — DRM-protected sources can't be transcribed.

Àwọn Ìgúnrégé Àwọn Àmì-ìwé

Àwọn ìgúnrégé ìṣàfihàn

TXT (Àkọ́lé)

SRT (Àwọn àkọlé àkọlé)

VTT (WebVTT)

DOCX (Àkọ́lé Wẹ́ẹ̀bù)

JSON (Àkọ́lé)

PDF (Àkọ́lé Àwọn Àkọ́lé)

Àwọn Ìgúnrégé Ìṣàfarawé

Ìgúnrégé wo ní Ò yẹ́ ki O Yan?

Àwọn àwọn àgbéwọlé

Àwọn Ìṣàfilọ́lẹ̀

Ṣẹ̀dà àti ìjánu-ìṣàfilọ́lẹ̀ nínú ìṣàfarawé ọ̀fẹ́

Àwọn Àtòjọ-ẹ̀yàn

How does export formats work on STT.ai?

Is export formats free?

How accurate is export formats?

What AI models can I use for export formats?

Can I get subtitles from export formats?

Does export formats detect different speakers?

How long does export formats take?

What input formats does export formats support?

Is my audio private when I use export formats?

Is there a export formats API?

Can I edit a export formats transcript after?

How do I share what export formats produces?

What other platforms work beyond export formats?