Segnala bug / richiesta di funzionalità

Audio in testo online gratis

Converti audio in testo con trascrizione IA. Carica file audio, registra dal microfono o incolla un URL. Oltre 100 lingue, 10+ modelli, 98%+ di precisione.

Funziona con audio e video pubblicamente disponibili. I contenuti protetti da DRM non sono supportati.

Aggiornamento per Enhanced

Private transcript

Parlare con la trascrizione

Sblocca con Pro →

Rilascia il file qui o fai clic per navigare

MP3, WAV, M4A, FLAC, MP4, MKV, MOV, WebM fino a 2GB

Caricamento batch di file multipli con Pro

Aggiornamento per Enhanced

Private transcript

Parlare con la trascrizione

Sblocca con Pro →

Aggiornamento per Enhanced

Discorso in tempo reale al testo. AI auto-corregge mentre si parla di precisione di galattosio migliora con il discorso più lungo.

Prova prima il microfono

10 minuti/giorno gratuiti 600 min gratis con iscrizione Nessuna carta di credito Cifrato

Iscriviti gratis →

1. Upload Audio

Upload MP3, WAV, M4A, FLAC, OGG, or any audio format. Up to 2GB.

2. AI Processes Audio

AI extracts speech from your audio with speaker detection and timestamps.

3. Get Your Transcript

View, edit, download, or share. Export as TXT, SRT, VTT, DOCX, or PDF.

Supported Audio Formats

MP3 WAV M4A FLAC OGG MP4 MKV MOV WebM AVI

Audio to Text Models

Scegli il modello IA adatto alle tue esigenze — o lascia che scegliamo il migliore.

STT.ai Enhanced

3.2% WER · 100 langs

Whisper Large V3

4.2% WER · 99 langs

5.1% WER · 99 langs

3.5% WER · 4 langs

NVIDIA Parakeet

3.0% WER · 1 langs

Transcribe Audio in 100+ Languages

English Spanish French German Japanese Arabic Hindi Portuguese Russian Korean Tutte le lingue →

Audio to Text Use Cases

Ready to convert audio to text?

Inizia gratis →

Domande frequenti

Upload your audio file or paste a URL, pick an AI model, and click Transcribe. STT.ai returns editable text with timestamps and speaker labels — most files finish in under five minutes.

MP3, WAV, M4A, FLAC, OGG, AAC, AMR, and 10+ more are all supported. You don't need to convert between formats first — upload whatever your recorder or app produces.

A little. Lossless formats like WAV and FLAC carry bit-perfect audio, so accuracy is bounded only by the model and speaker clarity. Lossy formats (MP3, M4A) at 128 kbps or higher are effectively identical; very low bitrates under 64 kbps can cost a few points.

Yes. STT.ai includes 600 free minutes per month with no signup for your first file. Paid plans starting at $5/month add longer files, private transcripts, and priority processing.

On clean audio our best models reach 95-97% accuracy (3-5% Word Error Rate). Background noise, overlapping speakers, and strong accents are the main factors that lower accuracy.

Yes. Free users can transcribe up to one hour per file; paid plans extend that to 8+ hours, which covers full-length podcasts, interviews, and audiobooks in a single pass.

Yes. Speaker diarization labels each voice (Speaker 1, Speaker 2, ...) and you can rename them in the editor — works on every supported audio format and model.

Export to TXT, DOCX, PDF, JSON, or SRT/VTT subtitles. JSON keeps machine-readable timestamps and speaker labels; DOCX and PDF are best for sharing and archiving.

Yes. 100+ languages with auto-detection, plus the option to set the language manually. Mixed-language audio is handled by switching mid-file, and you can translate the result afterwards.

Yes. Audio is processed and deleted by default, and Pro plans add client-side encryption so transcripts are unreadable without your key. Nothing is used for training without explicit opt-in.

Yes. Paste a link from any of 1,300+ supported platforms — podcast hosts, SoundCloud, YouTube, and more — and STT.ai fetches the audio directly. DRM-protected sources can't be transcribed.

Yes. The REST API accepts audio files directly, with Python and Node.js SDKs and a free tier of 100 minutes/month. Per-second billing applies beyond the free tier.