Ka Papa Hana

Ka hoʻouna ʻana i nā ʻano

Hoʻoiho i kāu transcripts i kekahi ʻano āu e pono ai. STT.ai e kākoʻo i ʻehā mau ʻano hoʻouna, a pau i hoʻomaikaʻi ʻia no nā kaʻina hana like ʻole.

Hoʻohana me nā leo a me nā wikiō i loaʻa i ka lehulehu. ʻAʻole kākoʻo ʻia nā mea i pale ʻia e DRM.

Hoʻonui no ka hoʻonui

Private transcript

Kāhea me ka transcript

Hoʻokuʻu me Pro →

E hoʻokuʻu i ka faila i kēia wahi a i ʻole kaomi e kaomi

ʻO nā mea hoʻohana e hoʻohana i nā ʻano leo like ʻole e like me MP3, WAV, M4A, FLAC, MP4, MKV, MOV, WebM — a hiki i 2GB

Hoʻouka i nā faila he nui me Pro

Hoʻonui no ka hoʻonui

Private transcript

Kāhea me ka transcript

Hoʻokuʻu me Pro →

Hoʻonui no ka hoʻonui

Ka hoʻolaha ʻana i ka manawa maoli i ka huaʻōlelo. AI hoʻoponopono ʻia e like me kou kamaʻilio ʻana - hoʻomaikaʻi ka pololei me ka hoʻolaha lōʻihi.

Hoʻāʻo i kou leo i mua

10 min / lā manuahi 600 min manuahi me ka hoʻopaʻa inoa Kāleka kāleka Hoʻopunipuni

E hoʻopaʻa inoa no ka manuahi →

Hoʻouna i nā ʻano i kākoʻo ʻia

I ka wā e hoʻololi ai i kāu leo a i ʻole wikiō, hiki iā ʻoe ke hoʻoiho i ka hoʻololi i kekahi o nā ʻano i ʻōlelo ʻia ma lalo. Hoʻopili nā ʻano āpau i ka huaʻōlelo hoʻololi piha, a hoʻopili nā ʻano hoʻololi i ka wā i ka wā o ka hua'ōlelo a i ʻole ka wā o ka wā.

TXT (Tukipika Pākē)

.txt

Ka hoʻololi ʻana i ka hua'ōlelo maʻamau me ka ʻole o ka hoʻonohonoho ʻana. He maikaʻi loa no ka kope ʻana i nā palapala, leka uila, a i ʻole nā noi ʻē aʻe. Hāʻawi i nā hua'ōlelo kīwī inā hoʻohana ʻia ka hōʻike kīwī.

Free plan

SRT (SubRip ʻOihana)

.srt

Hoʻopilikino ʻia ka ʻano o ka ʻōlelo haʻi ʻōlelo. Hāʻawi i ka helu helu, ka manawa, a me ka huaʻōlelo. Hoʻopilikino ʻia me YouTube, Vimeo, VLC, Premiere Pro, Final Cut, a me nā mea pāʻani wikiō a me nā mea hoʻoponopono.

Free plan

VTT (WebVTT)

.vtt

Web Video Text Tracks format, ka hoʻonohonoho no nā hua'ōlelo wikiō HTML5. Hoʻohanaʻia e nā mea huli pūnaewele, nā papa hoʻoili, a me nā mea pāʻani wikiō hou.

Basic plan+

DOCX (Document Word)

.docx

Hoʻohālike ʻia ka palapala Word me nā inoa kūpono, nā manawa, a me nā ʻōlelo hōʻike. He kūpono no nā minuke o ka hui, nā hōʻike, a me nā palapala e pono ai ka hoʻoponopono hou ʻana i ka Microsoft Word a i ʻole Google Docs.

Basic plan+

JSON (ʻike i hoʻonohonohoʻia)

.json

ʻO ka ʻano hana i hoʻonohonoho ʻia e ka mīkini me nā huaʻōlelo o ka huaʻōlelo, nā helu o ka hilinaʻi, nā ID speaker, a me nā ʻike segment. Perfect no nā mea hoʻomohala e kūkulu ana ma luna o STT.ai a i ʻole ka hānai ʻana i nā ʻike i nā ʻōnaehana ʻē aʻe.

Basic plan+

PDF (Document Portable)

.pdf

PDF i hoʻonohonoho ʻia e nā mea loea me nā manawa, nā labels speaker, a me ka STT.ai branding. He kūpono no ka hoʻokaʻawale ʻana me nā mea kūʻai aku, nā pūʻulu hoʻopaʻa, a i ʻole ka paʻi ʻana. Hoʻomaikaʻi ʻia ka hoʻolālā no ka heluhelu ʻana.

Basic plan+

Ka hoʻohālikelike ʻana i ka ʻano

Hōʻike	TXT	SRT	VTT	DOCX	JSON	PDF
Plain text	✓	✓	✓	✓	✓	✓
Timestamps	✗	✓	✓	✓	✓	✓
Speaker labels	✓	✓	✓	✓	✓	✓
Word-level timing	✗	✗	✗	✗	✓	✗
Confidence scores	✗	✗	✗	✗	✓	✗
Video player compatible	✗	✓	✓	✗	✗	✗
Editable	✓	✓	✓	✓	✓	✗
Machine-readable	✗	✗	✗	✗	✓	✗

He aha ka hoʻonohonoho pono e koho ai?

No nā hua'ōlelo a me nā hua'ōlelo

Use SRT for maximum compatibility or VTT for web-based video players. SRT works with YouTube, Vimeo, Premiere Pro, Final Cut, and DaVinci Resolve.

No nā palapala a me nā hōʻike

Use DOCX for editable documents or PDF for sharing and archiving. Both include formatted timestamps and speaker labels.

No nā mea hoʻomohala a me nā hoʻohui

Use JSON for the richest data including word-level timestamps, confidence scores, and speaker IDs. Ideal for building custom applications.

No ka kope-paʻa wikiwiki

Use TXT for a simple plain text transcript you can paste anywhere -- emails, notes, chat, or any text field.

Ka Papa Hana

Need to export multiple transcripts at once? STT.ai supports batch export from your transcript library. Select multiple transcripts, choose your format, and download them all in a single ZIP file. Available on all paid plans.

Ka Papa Hana

Developers can retrieve transcripts in any format via the STT.ai API. Simply specify the desired format in your API request and receive the formatted output directly. The JSON format includes the most detailed data including word-level timestamps and confidence scores.

Hoʻouna a hoʻouna i kekahi ʻano

Hoʻouka i ka leo a i ʻole wikiō. Kaomi i kāu ʻano hoʻouna. Kaomi i ka hoʻouna.

Ke hoʻomaka nei i ka hoʻololi ʻana i ka manuahi

Nā nīnau i nīnau pinepine ʻia

export formats runs in your browser: paste a URL, upload a file, or record from your mic. STT.ai picks the AI model and returns the transcript in under 5 minutes. Export as TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes — every visitor gets 600 free minutes/month on STT.ai, usable for export formats the same as any other workflow. Paid plans starting at $5/month unlock longer files, private transcripts, and priority queueing.

export formats runs on the same AI models as the rest of STT.ai — our best models reach 95-97% accuracy on clean speech (3-5% Word Error Rate on benchmarks). Switch models on the fly if the first pass is below your target.

export formats can run on any of STT.ai's 10+ models — STT.ai Enhanced (most accurate), Whisper Large V3 (99 languages), NVIDIA Canary (#1 WER on supported langs), Whisper Turbo (fast), Moonshine (lightweight), and more.

Yes. Every transcript exports as SRT or VTT — works with YouTube, Vimeo, TikTok, VLC, and every major video player. The burn-subtitles tool overlays them onto video as hardsubs.

Yes. Speaker diarization automatically labels each voice (Speaker 1, Speaker 2, ...) and you can rename them in the built-in editor. Works across all models and languages.

Most export formats jobs finish in under 5 minutes. A 1-hour audio file typically completes in 2-3 minutes with our fastest models. Speed depends on chosen model and current GPU load.

export formats accepts 20+ formats — MP3, WAV, M4A, FLAC, OGG, MP4, MKV, MOV, WebM, AVI, and more. Output to TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes. Audio files submitted to export formats are processed and deleted by default. Pro plans add client-side encryption — even if STT.ai's database is breached, your transcripts are unreadable without your key. Data is never used for model training without explicit opt-in.

Yes. STT.ai offers a REST API with Python and Node.js SDKs, plus an MCP server for Claude and Cursor — all usable for export formats workflows. Free API tier includes 100 minutes/month.

Yes. Every transcript opens in the built-in editor where you can correct words, rename speakers, adjust timestamps, and add notes. All changes save automatically.

Every transcript gets a unique shareable URL. Export to DOCX or PDF for email. Pro plans add password-protected and permanent links — useful for client work.

STT.ai handles 1,300+ platforms including YouTube, Vimeo, TikTok, SoundCloud, Zoom, Google Meet, podcast hosts, and more. URL transcription works with publicly-available content only — DRM-protected sources can't be transcribed.

Ka hoʻouna ʻana i nā ʻano

Hoʻouna i nā ʻano i kākoʻo ʻia

TXT (Tukipika Pākē)

SRT (SubRip ʻOihana)

VTT (WebVTT)

DOCX (Document Word)

JSON (ʻike i hoʻonohonohoʻia)

PDF (Document Portable)

Ka hoʻohālikelike ʻana i ka ʻano

He aha ka hoʻonohonoho pono e koho ai?

Ka Papa Hana

Ka Papa Hana

Hoʻouna a hoʻouna i kekahi ʻano

Nā nīnau i nīnau pinepine ʻia

How does export formats work on STT.ai?

Is export formats free?

How accurate is export formats?

What AI models can I use for export formats?

Can I get subtitles from export formats?

Does export formats detect different speakers?

How long does export formats take?

What input formats does export formats support?

Is my audio private when I use export formats?

Is there a export formats API?

Can I edit a export formats transcript after?

How do I share what export formats produces?

What other platforms work beyond export formats?