Drag og slepp hvørja sum helst lyd- ella videofil (MP3, WAV, MP4 og 20+ snið). Tak upp úr mikrofonini í sanntíð. Eða lim inn ein link frá YouTube, Vimeo, TikTok, og 1.300+ platformum.

2. AI Transcribes with Your Choice of Model

Vel millum 10+ AI-modellar, harímillum Whisper, NVIDIA Canary (#1 nøgdsemi) og Moonshine. Auto-uppdag tungumál úr 100+ valmøguleikum. Talara-diarisering staðfestir, hvør segði hvat.

3. Export, Share, or Integrate

Download as TXT, SRT, VTT, DOCX, JSON, or PDF. Share via link. Use our API to integrate transcription into your app. Perfect for subtitles, meeting notes, podcasts, and more.

Popular Use Cases

All use cases →

Class notes & study guides

Legal

Landsrættur og Landsrættur

All You Need for Audio & Video

70+ free tools powered by AI

Tala til tekst

Umskriva lyd- og videofiler

Live Transliteration

Realtime microphone transcription

YouTube-video

Pakk út undirtekstir úr hvørjum video

Textur

Rediger SRT & VTT filer online

Noise Remover

Fjarlægja bakgrunnsstøy frá ljóð

Audio Converter

MP3, WAV, FLAC, OGG, AAC og fleiri

Vocal Remover

Isolera vokalir ella fjerna teir

Audio Trimmer

Klipp og trimma ljóðfiler

Texta-konverterari

SRT, VTT, SSA, SBV snið

Møtir

Pakka út & samandrag

Textur til tal

Umseta tekst til náttúruliga talu

Textur

Umset undirtekstir til 100+ tungumál

Sjá allar 70+ tól →

100+

Stødd mál

70+

Free Tools

1,300+

Støddar skipanir

Eksportformatur

Developer-First API

Integrera tal-til-tekst í títt app á fáum minuttum. RESTful API við WebSocket-streaming í veruligum tíðum.

REST + WebSocket — File upload and real-time streaming

Fleiri modellar — Whisper, Canary, Enhanced & more

Dimmalætting — Auto-detect who said what

Fleksibel úttøka — JSON, TXT, SRT, VTT við orða-tíðarstemmum

API Docs Playground

import requests

response = requests.post(
    "https://api.stt.ai/v1/transcribe",
    headers={"Authorization": f"Bearer {API_KEY}"},
    files={"file": open("meeting.mp3", "rb")},
    data={
        "model": "large-v3-turbo",
        "language": "auto",
        "diarize": "true",
        "response_format": "json",
    },
)

result = response.json()
for seg in result["segments"]:
    print(f"{seg['speaker']}: {seg['text']}")

import fs from "fs";

const form = new FormData();
form.append("file", fs.createReadStream("meeting.mp3"));
form.append("model", "large-v3-turbo");
form.append("language", "auto");
form.append("diarize", "true");

const res = await fetch("https://api.stt.ai/v1/transcribe", {
  method: "POST",
  headers: { Authorization: `Bearer ${API_KEY}` },
  body: form,
});

const { segments } = await res.json();
segments.forEach(s =>
  console.log(`${s.speaker}: ${s.text}`)
);

Skipa frá eini aðrari tal- til teksttænastu?

STT.ai vs Otter.ai STT.ai vs TurboScribe STT.ai vs Fireflies STT.ai vs Rev Samanbera alt →

Simple, Transparent Pricing

Start free. Scale as you grow.

Free

$0/mo

600 min/month

5 Languages
TXT & SRT útflutningur
API access

Starter

$9/mo

3000 f.Kr.

100+ languages
All AI models
All export formats

Mest kend

Pro

$19/mo

7,500 min/month

Private transcripts
Unlimited team seats
Priority processing

Business

$39/mo

20.000 min/month

All in Pro
50K min storage
Unlimited AI chat

Sjá allar ætlanirnar og prísirnar →

Stødd mál

100+ languages →

English Spanish French German Japanese Chinese Arabic Hindi Portuguese Russian Korean Italian Turkish Dutch Polish 85 f.Kr.

Ready to translate?

Upplatið tína fyrstu skrá ókeypis. Eingin gjaldskort, eingin skráseting. 600 minuttir um mánaðin á ókeypis ætlan.

Start Transcribing

Ofta settir spurningar

speech to text runs in your browser: paste a URL, upload a file, or record from your mic. STT.ai picks the AI model and returns the transcript in under 5 minutes. Export as TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes — every visitor gets 600 free minutes/month on STT.ai, usable for speech to text the same as any other workflow. Paid plans starting at $5/month unlock longer files, private transcripts, and priority queueing.

speech to text runs on the same AI models as the rest of STT.ai — our best models reach 95-97% accuracy on clean speech (3-5% Word Error Rate on benchmarks). Switch models on the fly if the first pass is below your target.

speech to text can run on any of STT.ai's 10+ models — STT.ai Enhanced (most accurate), Whisper Large V3 (99 languages), NVIDIA Canary (#1 WER on supported langs), Whisper Turbo (fast), Moonshine (lightweight), and more.

Yes. Every transcript exports as SRT or VTT — works with YouTube, Vimeo, TikTok, VLC, and every major video player. The burn-subtitles tool overlays them onto video as hardsubs.

Yes. Speaker diarization automatically labels each voice (Speaker 1, Speaker 2, ...) and you can rename them in the built-in editor. Works across all models and languages.

Most speech to text jobs finish in under 5 minutes. A 1-hour audio file typically completes in 2-3 minutes with our fastest models. Speed depends on chosen model and current GPU load.

speech to text accepts 20+ formats — MP3, WAV, M4A, FLAC, OGG, MP4, MKV, MOV, WebM, AVI, and more. Output to TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes. Audio files submitted to speech to text are processed and deleted by default. Pro plans add client-side encryption — even if STT.ai's database is breached, your transcripts are unreadable without your key. Data is never used for model training without explicit opt-in.

Yes. STT.ai offers a REST API with Python and Node.js SDKs, plus an MCP server for Claude and Cursor — all usable for speech to text workflows. Free API tier includes 100 minutes/month.

Yes. Every transcript opens in the built-in editor where you can correct words, rename speakers, adjust timestamps, and add notes. All changes save automatically.

Every transcript gets a unique shareable URL. Export to DOCX or PDF for email. Pro plans add password-protected and permanent links — useful for client work.

STT.ai handles 1,300+ platforms including YouTube, Vimeo, TikTok, SoundCloud, Zoom, Google Meet, podcast hosts, and more. URL transcription works with publicly-available content only — DRM-protected sources can't be transcribed.

Free Tala til tekst

Text to Speech Models

How STT.ai Works

1. Upload, Record, or Paste URL

2. AI Transcribes with Your Choice of Model

3. Export, Share, or Integrate

Popular Use Cases

All You Need for Audio & Video

Developer-First API

Simple, Transparent Pricing

Stødd mál

Ready to translate?

Ofta settir spurningar

How does speech to text work on STT.ai?

Is speech to text free?

How accurate is speech to text?

What AI models can I use for speech to text?

Can I get subtitles from speech to text?

Does speech to text detect different speakers?

How long does speech to text take?

What input formats does speech to text support?

Is my audio private when I use speech to text?

Is there a speech to text API?

Can I edit a speech to text transcript after?

How do I share what speech to text produces?

What other platforms work beyond speech to text?