Adrodd Nam / Cais Nodweddion

Sain i destun ar-lein

Trosi sain i destun gyda throsglwyddiad AI-powered. Llwytho ffeiliau sain, recordio o'ch microffon, neu gludo URL. 100+ iaith, 10+ model, 98%+ cywirdeb.

Gweithio gyda sain a fideo sydd ar gael yn gyhoeddus. Ni chynhelir cynnwys sydd wedi'i amddiffyn gan DRM.

Uwchraddio i Gyflym

Private transcript

Sgwrsio gyda throsglwyddiad

Datgloi gyda Pro →

Rholio ffeil yma neu glicio i bori

MP3, WAV, M4A, FLAC, MP4, MKV, MOV, WebM - hyd at 2GB

Lanlwytho ffeiliau lluosol gyda Pro

Uwchraddio i Gyflym

Private transcript

Sgwrsio gyda throsglwyddiad

Datgloi gyda Pro →

Uwchraddio i Gyflym

Llythrennedd amser real i destun. Mae AI yn cywiro'n awtomatig wrth i chi siarad - mae cywirdeb yn gwella gyda llais hirach.

Arbrofi eich meicroffon yn gyntaf

10 munud rhydd/diwrnod 600 munud am ddim gyda chofrestru Dim cerdyn credyd Wedi' i amgryptio

Cofrestru am ddim →

1. Lanlwytho Sain

Llwytho MP3, WAV, M4A, FLAC, OGG, neu unrhyw fformat sain. hyd at 2GB.

2. Prosesau Sain AI

Mae AI yn echdynnu siarad o'ch sain gyda darganfod siaradwr a stampiau amser.

3. Cyrchu'ch Traethawd

Gweld, golygu, lawrlwytho neu rannu. Allforio fel TXT, SRT, VTT, DOCX, neu PDF.

Fformatau Sain a Gynhelir

MP3 WAV M4A FLAC OGG MP4 MKV MOV WebM AVI

Modelau Sain i Destun

Dewiswch y model AI sy'n gweddu i'ch anghenion - neu gadewch i ni ddewis yr un gorau.

Trawsnewid Sain mewn 100+ o Ieithoedd

English Spanish French German Japanese Arabic Hindi Portuguese Russian Korean Pob iaith →

Sain i Testun

Pryd i drosi sain i destun?

Cychwyn Rhydd →

Cwestiynau a Ofynnir yn Aml

Upload your audio file or paste a URL, pick an AI model, and click Transcribe. STT.ai returns editable text with timestamps and speaker labels — most files finish in under five minutes.

MP3, WAV, M4A, FLAC, OGG, AAC, AMR, and 10+ more are all supported. You don't need to convert between formats first — upload whatever your recorder or app produces.

A little. Lossless formats like WAV and FLAC carry bit-perfect audio, so accuracy is bounded only by the model and speaker clarity. Lossy formats (MP3, M4A) at 128 kbps or higher are effectively identical; very low bitrates under 64 kbps can cost a few points.

Yes. STT.ai includes 600 free minutes per month with no signup for your first file. Paid plans starting at $5/month add longer files, private transcripts, and priority processing.

On clean audio our best models reach 95-97% accuracy (3-5% Word Error Rate). Background noise, overlapping speakers, and strong accents are the main factors that lower accuracy.

Yes. Free users can transcribe up to one hour per file; paid plans extend that to 8+ hours, which covers full-length podcasts, interviews, and audiobooks in a single pass.

Yes. Speaker diarization labels each voice (Speaker 1, Speaker 2, ...) and you can rename them in the editor — works on every supported audio format and model.

Export to TXT, DOCX, PDF, JSON, or SRT/VTT subtitles. JSON keeps machine-readable timestamps and speaker labels; DOCX and PDF are best for sharing and archiving.

Yes. 100+ languages with auto-detection, plus the option to set the language manually. Mixed-language audio is handled by switching mid-file, and you can translate the result afterwards.

Yes. Audio is processed and deleted by default, and Pro plans add client-side encryption so transcripts are unreadable without your key. Nothing is used for training without explicit opt-in.

Yes. Paste a link from any of 1,300+ supported platforms — podcast hosts, SoundCloud, YouTube, and more — and STT.ai fetches the audio directly. DRM-protected sources can't be transcribed.

Yes. The REST API accepts audio files directly, with Python and Node.js SDKs and a free tier of 100 minutes/month. Per-second billing applies beyond the free tier.

Sain i destun ar-lein

1. Lanlwytho Sain

2. Prosesau Sain AI

3. Cyrchu'ch Traethawd

Fformatau Sain a Gynhelir

Modelau Sain i Destun

Trawsnewid Sain mewn 100+ o Ieithoedd

Sain i Testun

Pryd i drosi sain i destun?

Cwestiynau a Ofynnir yn Aml

How do I convert audio to text?

What audio formats can I convert to text?

Does the audio format affect accuracy?

Is audio-to-text conversion free?

How accurate is audio to text?

Can I convert long audio files like podcasts to text?

Does it detect different speakers in the audio?

What output formats can I export the text in?

Can I convert audio to text in other languages?

Is my audio kept private?

Can I convert audio to text from a URL?

Is there an API to convert audio to text?