Seret jeung lebetkeun file audio atawa video (MP3, WAV, MP4, jeung 20+ format). Rekening ti mikrofon anjeun dina waktu nyata. Utawa lebetkeun tautan ti YouTube, Vimeo, TikTok, jeung 1,300+ platform.

2. AI Transkrip karo pilihan Model

Pilih ti 10+ model AI kaasup Whisper, NVIDIA Canary (#1 akurasi), sarta Moonshine. Auto-ngadeteksi basa ti 100+ pilihan. Speaker diarization ngaidentipikasi saha ngomong naon.

3. Eksport, Bagi, utawa Ngagabung

Unduh salaku TXT, SRT, VTT, DOCX, JSON, atanapi PDF. Bagikeun ku tautan. Gunakeun API kami pikeun ngahijikeun transkripsi kana aplikasi anjeun. sampurna pikeun subtitle, catatan rapat, podcast, sareng sajabana.

Kasus Pangguna Populer

Segala kasus panggunaan →

Rapat

Notice meeting & action items

Podcast

Transkrip & Papar catatan

Istilah cekak

SRT, VTT lan liya-liyane

Medis

Transkripsi aman

Leksan

Notes kelas & panduan studi

Legal

Deposit & pengadilan

Sekabeh sing sampeyan butuhake kanggo Audio & Video

70+ alat gratis sing didhukung dening AI

Basa menyang Teks

Transkripsi file audio lan video

Transkripsi langsung

Transkripsi mikrofon wektu nyata

YouTube Transkrip

Ngekstrak caption saka video apa wae

Penyunting isibadan

Edit SRT & VTT file online

Penghapus Noise

Buang swara latar belakang saka audio

Аудио конвертер

MP3, WAV, FLAC, OGG, AAC lan liya-liyane

Pembersih Suara

Isolation vokal utawa mbusak

Trimmer Audio

Potong lan motong file audio

Канал

Pribadi SRT, VTT, SSA, SBV

Minutes

Ekstrak item tindakan & ringkasan

Teks kanggo swara

Ngganti teks dadi swara alami

Translator Subtitle

Translator subtitles to 100+ languages

Lihat kabeh 70+ alat →

100+

Basa sing didukung

70+

Alat bebas

1,300+

Platform sing didukung

Format Eksport

API Pangumbang-Pinakawan

Ngagabungkeun basa-ka-teks kana aplikasi anjeun dina menit. RESTful API kalayan streaming WebSocket real-time.

REST + WebSocket — Ngunduh file lan streaming wektu nyata

Sawetara model — Whisper, Canary, Dioptimalake lan liya-liyane

Speaker diarization — Ngadeteksi kanthi otomatis sapa kang ngomong apa

Output fleksibel — JSON, TXT, SRT, VTT karo tanda wektu tembung

Dokumen API Playground

import requests

response = requests.post(
    "https://api.stt.ai/v1/transcribe",
    headers={"Authorization": f"Bearer {API_KEY}"},
    files={"file": open("meeting.mp3", "rb")},
    data={
        "model": "large-v3-turbo",
        "language": "auto",
        "diarize": "true",
        "response_format": "json",
    },
)

result = response.json()
for seg in result["segments"]:
    print(f"{seg['speaker']}: {seg['text']}")

import fs from "fs";

const form = new FormData();
form.append("file", fs.createReadStream("meeting.mp3"));
form.append("model", "large-v3-turbo");
form.append("language", "auto");
form.append("diarize", "true");

const res = await fetch("https://api.stt.ai/v1/transcribe", {
  method: "POST",
  headers: { Authorization: `Bearer ${API_KEY}` },
  body: form,
});

const { segments } = await res.json();
segments.forEach(s =>
  console.log(`${s.speaker}: ${s.text}`)
);

Ngganti saka layanan basa liyane menyang layanan teks?

STT.ai vs Otter.ai STT.ai vs TurboScribe STT.ai vs Fireflies STT.ai vs Rev Ngbandingake kabeh →

Sederhana, transparan

Mulai bebas. Skala nalika sampeyan tuwuh.

Bebas

$0/100% OFF

600 min/month

5 Basa
Eksport TXT & SRT
API akses

Pemula

$9/100% OFF

3,000 min/month

100+ basa
Seluruh model AI
Sekabeh format eksport

Paling populer

Pro

$19/100% OFF

7,500 min/bulan

Transkrip pribadi
Anggota tim tanpa wates
Prioritas

Bisnis

$39/100% OFF

20,000 min/month

Segalanya ing Pro
50K min panyimpenan
Chat tanpa wates

View all plans & pricing →

Basa sing didukung

100+ basa →

English Spanish French German Japanese Chinese Arabic Hindi Portuguese Russian Korean Italian Turkish Dutch Polish +85 luwih

Siap kanggo ngrekam?

Unggah file munggaran anjeun gratis. Teu kartu kredit, teu ngadaptar. 600 menit per bulan dina rencana gratis.

Mulai Penulisan

Takon-takon sing asring diajukake

speech to text runs in your browser: paste a URL, upload a file, or record from your mic. STT.ai picks the AI model and returns the transcript in under 5 minutes. Export as TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes — every visitor gets 600 free minutes/month on STT.ai, usable for speech to text the same as any other workflow. Paid plans starting at $5/month unlock longer files, private transcripts, and priority queueing.

speech to text runs on the same AI models as the rest of STT.ai — our best models reach 95-97% accuracy on clean speech (3-5% Word Error Rate on benchmarks). Switch models on the fly if the first pass is below your target.

speech to text can run on any of STT.ai's 10+ models — STT.ai Enhanced (most accurate), Whisper Large V3 (99 languages), NVIDIA Canary (#1 WER on supported langs), Whisper Turbo (fast), Moonshine (lightweight), and more.

Yes. Every transcript exports as SRT or VTT — works with YouTube, Vimeo, TikTok, VLC, and every major video player. The burn-subtitles tool overlays them onto video as hardsubs.

Yes. Speaker diarization automatically labels each voice (Speaker 1, Speaker 2, ...) and you can rename them in the built-in editor. Works across all models and languages.

Most speech to text jobs finish in under 5 minutes. A 1-hour audio file typically completes in 2-3 minutes with our fastest models. Speed depends on chosen model and current GPU load.

speech to text accepts 20+ formats — MP3, WAV, M4A, FLAC, OGG, MP4, MKV, MOV, WebM, AVI, and more. Output to TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes. Audio files submitted to speech to text are processed and deleted by default. Pro plans add client-side encryption — even if STT.ai's database is breached, your transcripts are unreadable without your key. Data is never used for model training without explicit opt-in.

Yes. STT.ai offers a REST API with Python and Node.js SDKs, plus an MCP server for Claude and Cursor — all usable for speech to text workflows. Free API tier includes 100 minutes/month.

Yes. Every transcript opens in the built-in editor where you can correct words, rename speakers, adjust timestamps, and add notes. All changes save automatically.

Every transcript gets a unique shareable URL. Export to DOCX or PDF for email. Pro plans add password-protected and permanent links — useful for client work.

STT.ai handles 1,300+ platforms including YouTube, Vimeo, TikTok, SoundCloud, Zoom, Google Meet, podcast hosts, and more. URL transcription works with publicly-available content only — DRM-protected sources can't be transcribed.

Free AI Basa menyang Teks

Model swara dadi teks

STT.ai-жылы пайда болгон.

1. Unggah, Record, utawa Tepek URL

2. AI Transkrip karo pilihan Model

3. Eksport, Bagi, utawa Ngagabung

Kasus Pangguna Populer

Sekabeh sing sampeyan butuhake kanggo Audio & Video

API Pangumbang-Pinakawan

Sederhana, transparan

Basa sing didukung

Siap kanggo ngrekam?

Takon-takon sing asring diajukake

How does speech to text work on STT.ai?

Is speech to text free?

How accurate is speech to text?

What AI models can I use for speech to text?

Can I get subtitles from speech to text?

Does speech to text detect different speakers?

How long does speech to text take?

What input formats does speech to text support?

Is my audio private when I use speech to text?

Is there a speech to text API?

Can I edit a speech to text transcript after?

How do I share what speech to text produces?

What other platforms work beyond speech to text?