Sleep en laat een audio-of videobestand (MP3, WAV, MP4, en 20+ formaten). Record van uw microfoon in real-time. Of plak een link van YouTube, Vimeo, TikTok, en 1.300+ platforms.

2. AI Transcribeert met uw keuze van model

Kies uit 10+ AI modellen waaronder Whisper, NVIDIA Canary (#1 nauwkeurigheid), en Moonshine. Auto-detect taal van 100+ opties. Speaker diarization identificeert wie wat zei.

3. Exporteren, delen of integreren

Downloaden als TXT, SRT, VTT, DOCX, JSON, of PDF. Delen via link. Gebruik onze API om transcriptie te integreren in uw app. Perfect voor ondertitels, meeting notes, podcasts, en nog veel meer.

Popular Use Cases

Alle use cases →

Vergaderingen

Meeting notes & actie-items

Podcasts

Transcripts & tonen notities

Klassenotities & studiegidsen

Juridisch

Deposito's en gerechtshof

Alles wat je nodig hebt voor audio en video

70+ gratis gereedschap aangedreven door AI

Toespraak naar tekst

Audio- en videobestanden overschrijven

Live-Transcription

Real-time microfoon transcriptie

YouTube Transcripts

Onderschriften uit een video halen

Ondertiteleditor

SRT- & VTT-bestanden online bewerken

Noise Remover

Achtergrondgeluid uit audio verwijderen

Audioconverter

MP3, WAV, FLAC, OGG, AAC & meer

Vocal Remover

Zang isoleren of verwijderen

Audio-trimmer

Audiobestanden knippen en trimmen

Bijschriftomvormer

SRT, VTT, SSA, SBV-formaten

Vergaderingsnotulen

Actie-items & samenvattingen uitpakken

Tekst naar spraak

Tekst omzetten naar natuurlijke spraak

Ondertiteling vertaler

Vertalen Nederlands ondertiteling: 100+ languages

Bekijk alle 70+ gereedschappen →

100+

Ondersteunde talen

70+

Vrije hulpprogramma's

1,300+

Ondersteunde platforms

Formaten exporteren

Ontwikkelaar-eerste API

Integreer spraak-naar-tekst in uw app in minuten. RESTful API met real-time WebSocket streaming.

REST + WebSocket — Bestand uploaden en real-time streamen

Meerdere modellen — Whisper, Canary, Enhanced & meer

Diaratie van de luidspreker — Auto-detect wie zei wat

Flexibele output — JSON, TXT, SRT, VTT met woordtijdstempels

API Docs Speeltuin

import requests

response = requests.post(
    "https://api.stt.ai/v1/transcribe",
    headers={"Authorization": f"Bearer {API_KEY}"},
    files={"file": open("meeting.mp3", "rb")},
    data={
        "model": "large-v3-turbo",
        "language": "auto",
        "diarize": "true",
        "response_format": "json",
    },
)

result = response.json()
for seg in result["segments"]:
    print(f"{seg['speaker']}: {seg['text']}")

import fs from "fs";

const form = new FormData();
form.append("file", fs.createReadStream("meeting.mp3"));
form.append("model", "large-v3-turbo");
form.append("language", "auto");
form.append("diarize", "true");

const res = await fetch("https://api.stt.ai/v1/transcribe", {
  method: "POST",
  headers: { Authorization: `Bearer ${API_KEY}` },
  body: form,
});

const { segments } = await res.json();
segments.forEach(s =>
  console.log(`${s.speaker}: ${s.text}`)
);

Overschakelen van een andere toespraak naar een sms-dienst?

STT.ai vs Otter.ai STT.ai vs TurboScribe STT.ai vs Fireflies STT.ai vs Rev Alles vergelijken →

Eenvoudige, transparante prijzen

Begin vrij, schuin naarmate je groeit.

Vrij

$0/munit description in lists

600 min/maand

5 talen
TXT & SRT-export
API-toegang

Starter

$9/munit description in lists

3.000 min/maand

100+ talen
Alle AI modellen
Alle exportformaten

MEER POPULAIR

Pro

$19/munit description in lists

7.500 min/maand

Privé-transcripties
Onbeperkte teamzetels
Prioritaire verwerking

Zaken

$39/munit description in lists

20.000 min/maand

Alles in Pro
50K min opslag
Onbeperkt AI-chat

Bekijk alle plannen & prijzen →

Ondersteunde talen

Alle 100+ talen →

English Spanish French German Japanese Chinese Arabic Hindi Portuguese Russian Korean Italian Turkish Dutch Polish +85 meer

Klaar om te transcriberen?

Upload uw eerste bestand gratis. Geen creditcard, geen aanmelding. 600 minuten per maand op het gratis plan.

Transcripting starten

Veelgestelde vragen

speech to text runs in your browser: paste a URL, upload a file, or record from your mic. STT.ai picks the AI model and returns the transcript in under 5 minutes. Export as TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes — every visitor gets 600 free minutes/month on STT.ai, usable for speech to text the same as any other workflow. Paid plans starting at $5/month unlock longer files, private transcripts, and priority queueing.

speech to text runs on the same AI models as the rest of STT.ai — our best models reach 95-97% accuracy on clean speech (3-5% Word Error Rate on benchmarks). Switch models on the fly if the first pass is below your target.

speech to text can run on any of STT.ai's 10+ models — STT.ai Enhanced (most accurate), Whisper Large V3 (99 languages), NVIDIA Canary (#1 WER on supported langs), Whisper Turbo (fast), Moonshine (lightweight), and more.

Yes. Every transcript exports as SRT or VTT — works with YouTube, Vimeo, TikTok, VLC, and every major video player. The burn-subtitles tool overlays them onto video as hardsubs.

Yes. Speaker diarization automatically labels each voice (Speaker 1, Speaker 2, ...) and you can rename them in the built-in editor. Works across all models and languages.

Most speech to text jobs finish in under 5 minutes. A 1-hour audio file typically completes in 2-3 minutes with our fastest models. Speed depends on chosen model and current GPU load.

speech to text accepts 20+ formats — MP3, WAV, M4A, FLAC, OGG, MP4, MKV, MOV, WebM, AVI, and more. Output to TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes. Audio files submitted to speech to text are processed and deleted by default. Pro plans add client-side encryption — even if STT.ai's database is breached, your transcripts are unreadable without your key. Data is never used for model training without explicit opt-in.

Yes. STT.ai offers a REST API with Python and Node.js SDKs, plus an MCP server for Claude and Cursor — all usable for speech to text workflows. Free API tier includes 100 minutes/month.

Yes. Every transcript opens in the built-in editor where you can correct words, rename speakers, adjust timestamps, and add notes. All changes save automatically.

Every transcript gets a unique shareable URL. Export to DOCX or PDF for email. Pro plans add password-protected and permanent links — useful for client work.

STT.ai handles 1,300+ platforms including YouTube, Vimeo, TikTok, SoundCloud, Zoom, Google Meet, podcast hosts, and more. URL transcription works with publicly-available content only — DRM-protected sources can't be transcribed.

Gratis AI Toespraak naar tekst

Toespraak naar tekstmodellen

Hoe werkt STT.ai?

1. Uploaden, opnemen of plakken URL

2. AI Transcribeert met uw keuze van model

3. Exporteren, delen of integreren

Popular Use Cases

Alles wat je nodig hebt voor audio en video

Ontwikkelaar-eerste API

Eenvoudige, transparante prijzen

Ondersteunde talen

Klaar om te transcriberen?

Veelgestelde vragen

How does speech to text work on STT.ai?

Is speech to text free?

How accurate is speech to text?

What AI models can I use for speech to text?

Can I get subtitles from speech to text?

Does speech to text detect different speakers?

How long does speech to text take?

What input formats does speech to text support?

Is my audio private when I use speech to text?

Is there a speech to text API?

Can I edit a speech to text transcript after?

How do I share what speech to text produces?

What other platforms work beyond speech to text?