Seret lan lempar file audio utawa video (MP3, WAV, MP4, lan 20+ format). Rekaman saka mikrofon ing wektu nyata. Utawa tempel tautan saka YouTube, Vimeo, TikTok, lan 1,300+ platform.

2. AI Transcribes karo pilihan sampeyan saka Model

Pilih saka 10+ model AI kalebu Whisper, NVIDIA Canary (#1 akurasi), lan Moonshine. Auto-deteksi basa saka 100+ pilihan. Speaker diarization ngenali sapa ngandika apa.

3. Eksport, Bagi, utawa Nganggo bareng

Download minangka TXT, SRT, VTT, DOCX, JSON, utawa PDF. Share liwat tautan. Gunakake API kita kanggo nggabungake transkripsi menyang aplikasi sampeyan. Perfect kanggo subtitle, notifikasi rapat, podcast, lan liya-liyane.

Kaca Pangguna

Kabeh kasus pamakéan →

Rapat

Notes & Action Items

Podcast

Transkrip & Papar notifikasi

Subtitles

SRT, VTT lan liya-liyané

Deposisi & pengadilan

Sekabeh sing sampeyan butuhake kanggo Audio & Video

70+ piranti bébas kang dikuasa déning AI

Prasasti

Transkripsi berkas audio lan video

Transkripsi

Real-time microphone transcription

Situs resmi YouTube

Ekstrak subtitle saka video apa wae

Penyunting subtitle

Sumbang berkas SRT & VTT online

Ngaresiki Noise

Busak swara latar mburi saka audio

Konversi Audio

MP3, WAV, FLAC, OGG, AAC lan liya-liyane

Pembersih swara

Isolasi vokal utawa mbusaké

Trimmer Audio

Potong lan polah berkas audio

Pengubah Caption

SRT, VTT, SSA, SBV format

Minutes

Ekstrak item tindakan lan ringkasan

Teks dadi swara

Ngganti teks dadi swara alami

Penerjemah Subtitle

Terjemahake subtitle menyang 100+ basa

Lihat kabeh 70+ piranti →

100+

Basa kang didhukung

70+

Alat Bebas

1,300+

Platform sing didhukung

Format Eksport

Developer-First API

Ing basa Inggris, istilah iki uga bisa digunakaké kanggo nyebut piranti lunak komputer kang bisa disambungake menyang Internet.

REST + WebSocket — Upload file lan real-time streaming

Akèh model — Whisper, Canary, Enhanced lan liya- liya

Diarisisasi juru basa — Otomatis-ngadeteksi sapa kang ngomong apa

Output fleksibel — JSON, TXT, SRT, VTT karo timestamp tembung

Dokumen API Playground

import requests

response = requests.post(
    "https://api.stt.ai/v1/transcribe",
    headers={"Authorization": f"Bearer {API_KEY}"},
    files={"file": open("meeting.mp3", "rb")},
    data={
        "model": "large-v3-turbo",
        "language": "auto",
        "diarize": "true",
        "response_format": "json",
    },
)

result = response.json()
for seg in result["segments"]:
    print(f"{seg['speaker']}: {seg['text']}")

import fs from "fs";

const form = new FormData();
form.append("file", fs.createReadStream("meeting.mp3"));
form.append("model", "large-v3-turbo");
form.append("language", "auto");
form.append("diarize", "true");

const res = await fetch("https://api.stt.ai/v1/transcribe", {
  method: "POST",
  headers: { Authorization: `Bearer ${API_KEY}` },
  body: form,
});

const { segments } = await res.json();
segments.forEach(s =>
  console.log(`${s.speaker}: ${s.text}`)
);

Ngganti saking layanan basa liyane menyang layanan teks?

STT.ai vs Otter.ai STT.ai vs TurboScribe STT.ai vs Fireflies STT.ai vs Rev Ngbandingake kabeh →

Prakiraan rega sing gampang lan transparan

Mulake bebas. Skala nalika sampeyan tuwuh.

Bebas

$0/wulan

600 min/wulan

Basa
Eksport TXT & SRT
API akses

Pembuka

$9/wulan

3,000 min/wulan

Basa
Sedaya model AI
Sembarang format eksport

Populèr

Pro

$19/wulan

7,500 min/wulan

Transkrip pribadi
Sesi tim tanpa wates
Prioritas pangolahan

Bisnis

$39/wulan

20,000 min/wulan

Kabeh ing Pro
50K min storage
Chat AI tanpa wates

Lihat kabeh rencana lan rega →

Basa kang didhukung

Basa 100+ →

English Spanish French German Japanese Chinese Arabic Hindi Portuguese Russian Korean Italian Turkish Dutch Polish +85 luwih

Siap kanggo transkripsi?

Upload file pisanan gratis. Ora kredit kartu, ora signup. 600 menit saben wulan ing rencana gratis.

Ngawiwiti transkripsi

Pitakon kang asring diajukake

speech to text runs in your browser: paste a URL, upload a file, or record from your mic. STT.ai picks the AI model and returns the transcript in under 5 minutes. Export as TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes — every visitor gets 600 free minutes/month on STT.ai, usable for speech to text the same as any other workflow. Paid plans starting at $5/month unlock longer files, private transcripts, and priority queueing.

speech to text runs on the same AI models as the rest of STT.ai — our best models reach 95-97% accuracy on clean speech (3-5% Word Error Rate on benchmarks). Switch models on the fly if the first pass is below your target.

speech to text can run on any of STT.ai's 10+ models — STT.ai Enhanced (most accurate), Whisper Large V3 (99 languages), NVIDIA Canary (#1 WER on supported langs), Whisper Turbo (fast), Moonshine (lightweight), and more.

Yes. Every transcript exports as SRT or VTT — works with YouTube, Vimeo, TikTok, VLC, and every major video player. The burn-subtitles tool overlays them onto video as hardsubs.

Yes. Speaker diarization automatically labels each voice (Speaker 1, Speaker 2, ...) and you can rename them in the built-in editor. Works across all models and languages.

Most speech to text jobs finish in under 5 minutes. A 1-hour audio file typically completes in 2-3 minutes with our fastest models. Speed depends on chosen model and current GPU load.

speech to text accepts 20+ formats — MP3, WAV, M4A, FLAC, OGG, MP4, MKV, MOV, WebM, AVI, and more. Output to TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes. Audio files submitted to speech to text are processed and deleted by default. Pro plans add client-side encryption — even if STT.ai's database is breached, your transcripts are unreadable without your key. Data is never used for model training without explicit opt-in.

Yes. STT.ai offers a REST API with Python and Node.js SDKs, plus an MCP server for Claude and Cursor — all usable for speech to text workflows. Free API tier includes 100 minutes/month.

Yes. Every transcript opens in the built-in editor where you can correct words, rename speakers, adjust timestamps, and add notes. All changes save automatically.

Every transcript gets a unique shareable URL. Export to DOCX or PDF for email. Pro plans add password-protected and permanent links — useful for client work.

STT.ai handles 1,300+ platforms including YouTube, Vimeo, TikTok, SoundCloud, Zoom, Google Meet, podcast hosts, and more. URL transcription works with publicly-available content only — DRM-protected sources can't be transcribed.

Free AI Prasasti

Model basa kanggo teks

STT.ai ing sakiwa tengené.

1. Unggah, Rekam, utawa Tetep URL

2. AI Transcribes karo pilihan sampeyan saka Model

3. Eksport, Bagi, utawa Nganggo bareng

Kaca Pangguna

Sekabeh sing sampeyan butuhake kanggo Audio & Video

Developer-First API

Prakiraan rega sing gampang lan transparan

Basa kang didhukung

Siap kanggo transkripsi?

Pitakon kang asring diajukake

How does speech to text work on STT.ai?

Is speech to text free?

How accurate is speech to text?

What AI models can I use for speech to text?

Can I get subtitles from speech to text?

Does speech to text detect different speakers?

How long does speech to text take?

What input formats does speech to text support?

Is my audio private when I use speech to text?

Is there a speech to text API?

Can I edit a speech to text transcript after?

How do I share what speech to text produces?

What other platforms work beyond speech to text?