Hanganga AI Whakawhiti ki te kupu

Ka whakamāoritia te oro me te ataata ki te kupu i roto i ngā reo 100+ 10+ ngā tauira AI. Whakamātautau kaikōrero. Kāore e hiahiatia te whakaingoatanga.

9.3K
Ka whakamātauria
235.9K
minute transcried
100+
reo
70+
ngā utauta wātea

Ka mahi ki ngā oronga me ngā ataata e wātea ana ki te iwi whānui. Kāore e tautokona ngā ihirangi DRM-protected.

Whakahauhau mo te Whakarei ake
Private transcript
Kāhea me te whakahua
Whakapūkete me te Pro →
Ka tangohia te faila ki konei, ka tirohia rānei
MP3, WAV, M4A, FLAC, MP4, MKV, MOV, WebM — tae atu ki te 2GB
Whakahauhau mo te Whakarei ake
Private transcript
Kāhea me te whakahua
Whakapūkete me te Pro →
Whakahauhau mo te Whakarei ake
Te whakataki: 0:00
Wā-tūturu Wai (kore)
Whakarei ake Whisper (taurite)
Pānga tūmatanui: 24h, kupu anake · Ka whakaingoatia mō te 7d + oroiti · Ka taea mō ngā pātahitanga tūmataiti

Whakawhitiwhiti wā-tūturu ki te kupu. Ka tika te AI i te wā e kōrero ana - ka pai ake te tika me te kōrero roa.

Whakamātautau i tō tou kaihautū tuatahi
❤️ E hiahia ana ki te STT.ai? Whakapāpāho ki ōna hoa!
Kua whakamahia e koe ōna whakamāoritanga wātea

Ka whakaingoatia hei wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea mo te wātea

10 wātea min/roa 600 min wātea me te whakaingoatanga Kāore he kāri ā-pūtea Kua whakawaeheretia
Whakapūkete wātea →
Te Tatauranga Kōhiu-Wāhiu — Kua whakawaeheretia ōna tuhipoka i roto i tōna kaitiaki. Kāore anō kia taea e tātau te whakaako i a rātau. Ko te akoranga me pēhea te mahi →

I whakaponotia e nga tohunga puta noa i te ao

He pēhea te mahi a te STT.ai

E toru nga hipanga ki te whakamāoritanga tika

1. Whakaata, Whakataki, Paki rānei i te URL

Ka whakataka me te tuku i tētahi pūranga oro, wikiō rānei (MP3, WAV, MP4, me ngā āhuahira 20+). Ka whakataka mai i tō tātou pūoro i te wā tūturu. Ka tāpiri rānei i tētahi pātahitanga mai i te YouTube, Vimeo, TikTok, me ngā pūnaha 1,300+

2. AI e whakamāori ana me tōna kōwhiringa o te tauira.

Ka kōwhiria mai i ngā tauira AI 10+ tae atu ki te Whisper, NVIDIA Canary (#1 te tika), me te Moonshine. Ka kitea ā-māori te reo mai i ngā kōwhiringa 100+. Ka kitea e te kaikōrero te mea i kī ai.

3. Whakaputanga, Whakawehe, Whakawhanake rānei

Whakataki hei TXT, SRT, VTT, DOCX, JSON, PDF rānei. Whakawhitinga mā te pātahitanga. Ka whakamahia tātau API hei whakauru i te whakamāoritanga ki roto i tō tātau taupānga. Pai mo ngā whakahuatuhi, ngā tuhipoka hui, ngā podcast, me ētahi atu.

100+
Reo tautokona
70+
Whakahaua ngā utauta
1,300+
Ka tautokona nga papatono
7
He pūāhua whakahua

Ka tīmata te API-whakahaere

Ka whakaurua te kōrero ki te kupu ki roto i tōna taupānga i roto i ngā minu. RESTful API me te WebSocket-time-real-streaming.

REST + WebSocket — Whakaata i te faila me te wā tūturu
He maha nga tauira — Whisper, Canary, Whakarei ake & he nui ake
Ka taea te whakahua i te kaipāho — I kitea-māori e wai te mea i kī ai
Ko te huaputa āhuahira — JSON, TXT, SRT, VTT me ngā tohu wā kupu
import requests

response = requests.post(
    "https://api.stt.ai/v1/transcribe",
    headers={"Authorization": f"Bearer {API_KEY}"},
    files={"file": open("meeting.mp3", "rb")},
    data={
        "model": "large-v3-turbo",
        "language": "auto",
        "diarize": "true",
        "response_format": "json",
    },
)

result = response.json()
for seg in result["segments"]:
    print(f"{seg['speaker']}: {seg['text']}")

Ka huri mai i tētahi atu kōrero ki te ratonga tuhituhi?

E whakaritea ana hei whakahua?

Whakaata i tōna pūranga tuatahi. Kāore he kāri pūtea, kāore he whakaingoatanga. 600 ngā minu i ia marama i runga i te kaupapa wātea.

Ka tīmata te whakamāoritanga

E pā ana ngā pātai

speech to text runs in your browser: paste a URL, upload a file, or record from your mic. STT.ai picks the AI model and returns the transcript in under 5 minutes. Export as TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes — every visitor gets 600 free minutes/month on STT.ai, usable for speech to text the same as any other workflow. Paid plans starting at $5/month unlock longer files, private transcripts, and priority queueing.

speech to text runs on the same AI models as the rest of STT.ai — our best models reach 95-97% accuracy on clean speech (3-5% Word Error Rate on benchmarks). Switch models on the fly if the first pass is below your target.

speech to text can run on any of STT.ai's 10+ models — STT.ai Enhanced (most accurate), Whisper Large V3 (99 languages), NVIDIA Canary (#1 WER on supported langs), Whisper Turbo (fast), Moonshine (lightweight), and more.

Yes. Every transcript exports as SRT or VTT — works with YouTube, Vimeo, TikTok, VLC, and every major video player. The burn-subtitles tool overlays them onto video as hardsubs.

Yes. Speaker diarization automatically labels each voice (Speaker 1, Speaker 2, ...) and you can rename them in the built-in editor. Works across all models and languages.

Most speech to text jobs finish in under 5 minutes. A 1-hour audio file typically completes in 2-3 minutes with our fastest models. Speed depends on chosen model and current GPU load.

speech to text accepts 20+ formats — MP3, WAV, M4A, FLAC, OGG, MP4, MKV, MOV, WebM, AVI, and more. Output to TXT, SRT, VTT, DOCX, JSON, or PDF.

Yes. Audio files submitted to speech to text are processed and deleted by default. Pro plans add client-side encryption — even if STT.ai's database is breached, your transcripts are unreadable without your key. Data is never used for model training without explicit opt-in.

Yes. STT.ai offers a REST API with Python and Node.js SDKs, plus an MCP server for Claude and Cursor — all usable for speech to text workflows. Free API tier includes 100 minutes/month.

Yes. Every transcript opens in the built-in editor where you can correct words, rename speakers, adjust timestamps, and add notes. All changes save automatically.

Every transcript gets a unique shareable URL. Export to DOCX or PDF for email. Pro plans add password-protected and permanent links — useful for client work.

STT.ai handles 1,300+ platforms including YouTube, Vimeo, TikTok, SoundCloud, Zoom, Google Meet, podcast hosts, and more. URL transcription works with publicly-available content only — DRM-protected sources can't be transcribed.