Free AI Basa menyang Teks

Ngaliput audio & video ka teks dina 100+ basa. 10+ model AI. Ngadeteksi speaker. Ora perlu ngadaptar.

9.3K
transcriptions
235.9K
menit
100+
basa
70+
free tools

Ngagunakeun audio & video anu aya di dieu. Kandungan anu dilindungi ku DRM henteu didukung.

Ningkatake kanggo Diperbaiki
Private transcript
Chat with transcript
Buka karo Pro →
Gunakake file ing kene utawa klik kanggo browse
MP3, WAV, M4A, FLAC, MP4, MKV, MOV, WebM — nganti 2GB
Muat-up file karo Pro
Ningkatake kanggo Diperbaiki
Private transcript
Chat with transcript
Buka karo Pro →
Ningkatake kanggo Diperbaiki
Recording: 0:00
Wektu nyata Lilin (sekarang)
Dioptimalake Wisp (akurat)
Link umum: 24h, teks mung · Ndaftar for 7d + audio · Pro for private links

Parobihan basa kana teks. AI ngalereskeun otomatis nalika anjeun nyarios - akurasi naék ku kecap-kecap anu langkung panjang.

Uji mikrofonmu sadurunge
❤️ Love STT.ai? Beritahu kanca-kancamu!
Sampeyan wis nggunakake transkripsi gratis sampeyan

1000 taun ka pengker, 600 taun ka tukang, 100 taun ka tukang, 100 taun ka tukang, 100 taun ka tukang.

10 free min/day 600 min gratis karo ndhaptar Tanpa kartu kredit Dienkripsi
Daftar gratis →
Klien-Side Encrypted Storage — Transkrip anjeun dienkripsi dina panyungsi anjeun. Urang ogé henteu tiasa maca éta. Ngerti carane iku kerja →

Dipercaya dening profesional ing saindenging jagad

STT.ai-жылы пайда болгон.

Tiga langkah kanggo transkripsi sing akurat

1. Unggah, Record, utawa Tepek URL

Seret jeung lebetkeun file audio atawa video (MP3, WAV, MP4, jeung 20+ format). Rekening ti mikrofon anjeun dina waktu nyata. Utawa lebetkeun tautan ti YouTube, Vimeo, TikTok, jeung 1,300+ platform.

2. AI Transkrip karo pilihan Model

Pilih ti 10+ model AI kaasup Whisper, NVIDIA Canary (#1 akurasi), sarta Moonshine. Auto-ngadeteksi basa ti 100+ pilihan. Speaker diarization ngaidentipikasi saha ngomong naon.

3. Eksport, Bagi, utawa Ngagabung

Unduh salaku TXT, SRT, VTT, DOCX, JSON, atanapi PDF. Bagikeun ku tautan. Gunakeun API kami pikeun ngahijikeun transkripsi kana aplikasi anjeun. sampurna pikeun subtitle, catatan rapat, podcast, sareng sajabana.

100+
Basa sing didukung
70+
Alat bebas
1,300+
Platform sing didukung
7
Format Eksport

API Pangumbang-Pinakawan

Ngagabungkeun basa-ka-teks kana aplikasi anjeun dina menit. RESTful API kalayan streaming WebSocket real-time.

REST + WebSocket — Ngunduh file lan streaming wektu nyata
Sawetara model — Whisper, Canary, Dioptimalake lan liya-liyane
Speaker diarization — Ngadeteksi kanthi otomatis sapa kang ngomong apa
Output fleksibel — JSON, TXT, SRT, VTT karo tanda wektu tembung
import requests

response = requests.post(
    "https://api.stt.ai/v1/transcribe",
    headers={"Authorization": f"Bearer {API_KEY}"},
    files={"file": open("meeting.mp3", "rb")},
    data={
        "model": "large-v3-turbo",
        "language": "auto",
        "diarize": "true",
        "response_format": "json",
    },
)

result = response.json()
for seg in result["segments"]:
    print(f"{seg['speaker']}: {seg['text']}")

Siap kanggo ngrekam?

Unggah file munggaran anjeun gratis. Teu kartu kredit, teu ngadaptar. 600 menit per bulan dina rencana gratis.

Mulai Penulisan

Takon-takon sing asring diajukake

speech to text runs in your browser: paste a URL, upload a file, or record from your mic. STT.ai picks the AI model and returns the transcript in under 5 minutes. Export as TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes — every visitor gets 600 free minutes/month on STT.ai, usable for speech to text the same as any other workflow. Paid plans starting at $5/month unlock longer files, private transcripts, and priority queueing.

speech to text runs on the same AI models as the rest of STT.ai — our best models reach 95-97% accuracy on clean speech (3-5% Word Error Rate on benchmarks). Switch models on the fly if the first pass is below your target.

speech to text can run on any of STT.ai's 10+ models — STT.ai Enhanced (most accurate), Whisper Large V3 (99 languages), NVIDIA Canary (#1 WER on supported langs), Whisper Turbo (fast), Moonshine (lightweight), and more.

Yes. Every transcript exports as SRT or VTT — works with YouTube, Vimeo, TikTok, VLC, and every major video player. The burn-subtitles tool overlays them onto video as hardsubs.

Yes. Speaker diarization automatically labels each voice (Speaker 1, Speaker 2, ...) and you can rename them in the built-in editor. Works across all models and languages.

Most speech to text jobs finish in under 5 minutes. A 1-hour audio file typically completes in 2-3 minutes with our fastest models. Speed depends on chosen model and current GPU load.

speech to text accepts 20+ formats — MP3, WAV, M4A, FLAC, OGG, MP4, MKV, MOV, WebM, AVI, and more. Output to TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes. Audio files submitted to speech to text are processed and deleted by default. Pro plans add client-side encryption — even if STT.ai's database is breached, your transcripts are unreadable without your key. Data is never used for model training without explicit opt-in.

Yes. STT.ai offers a REST API with Python and Node.js SDKs, plus an MCP server for Claude and Cursor — all usable for speech to text workflows. Free API tier includes 100 minutes/month.

Yes. Every transcript opens in the built-in editor where you can correct words, rename speakers, adjust timestamps, and add notes. All changes save automatically.

Every transcript gets a unique shareable URL. Export to DOCX or PDF for email. Pro plans add password-protected and permanent links — useful for client work.

STT.ai handles 1,300+ platforms including YouTube, Vimeo, TikTok, SoundCloud, Zoom, Google Meet, podcast hosts, and more. URL transcription works with publicly-available content only — DRM-protected sources can't be transcribed.