Transcribe with Whisper Large V3
4.2%
WER
99
Languages
8.0x
Speed
MIT
License
About Whisper Large V3
Model Info
- ProviderOpenAI
- Architecture-
- LicenseMIT
- UpdatedMar 2026
Su'aalaha badanaa la isweydiiyo
Whisper Large V3 waa qaab hadal-u-qoraalka ah oo ay sameysay OpenAI. STT.ai waxay martigelisaa Whisper Large V3 dhismaha GPU-ga si aad u isticmaali karto iyada oo aan la siin qalabkaaga - soo dejisan audio ama fiidiyowga oo ka qaado Whisper Large V3 ka mid ah qaabka dooran.
On tirakoobyada caadiga ah, Whisper Large V3 gaadhay in ka badan 4.2% Word Error Rate. Real-world saxnaanta ku xiran tahay tayada audio, afka, iyo afka; in ay maqal ah ama afka, ka fikiraan boqolkiiba dhibcood yar oo ka sareeya WER.
Whisper Large V3 wuxuu ku socdaa STT.ai's free tier - booqde kasta wuxuu helaa 600 daqiiqo / bilood lacag la'aan ah. Qorshayaasha la bixiyo waxay ku darayaan xaddidaadyo dheeri ah oo per-file ah, nuqul gaar ah, iyo soo jeedinta hormuudka.
Whisper Large V3 waxaa lagu soo saaray hoos MIT, a permissive license furan-source. Waxaad awoodi kartaa self-host Whisper Large V3 on your hardware ama isticmaali version our martida — labadaba waa ganacsi loo isticmaali karo.
Whisper Large V3 taageeraa 99 luqadood. Auto-ka-qabashada doorata luqadda saxda ah ee audio ugu badan; waxaad sidoo kale ku qeexi kartaa gacanta si ay u qaado saxnaanta yar.
Whisper Large V3 audio ku saabsan 8.0x waqti dhab ah on our GPUs. A 1-saac file audio dhamaystiran hoos 7 daqiiqo; files dheeri ah oo fariin email ah marka la sameeyo.
Whisper Large V3 waxaa ku jira 1.55B parameters. Models weyn u badan tahay in ay ka sii sax ah laakiin ka sii dhakhso badan; STT.ai martida Whisper Large V3 on GPU sidaas darteed tirada parameter ma saameyn ku yeelan doontaa shaqadaaga dhinac macaamiisha.
Whisper Large V3 aqbalaa qaab kasta oo STT.ai taageeraya — MP3, WAV, M4A, FLAC, OGG, MP4, MKV, MOV, WebM, AVI, iyo kuwa kale. soo saarka sida TXT, SRT, VTT, DOCX, JSON, ama PDF.
Haa. Speaker diarization la socda Whisper Large V3 oo ku saabsan qoraal kasta — hadal kasta oo waxaa lagu calaamadiyay oo waxaad ku bedeli kartaa magacooda editor ka dib.
Haa. Whisper Large V3 ku socda deegaankayaga maamula - audio waa la xakamayn karaa oo la tirtiri karaa si default ah oo aan marnaba loo isticmaalin tababarka aan la oggolaanin. Pro qorshayaasha ku darto sirta dhinac macaamiisha ee qoraalada fadhiya.
isticmaali qalab la barbar dhigo-stt si ay u socdaan Whisper Large V3 ka hor mid ka mid ah noocyada kale ee taageeray on audio isku mid ah - waxaad arki doontaa WER, qaybta tirada, labels hadal, iyo kalsoonida dhibcaha dhinac-by-laab. The Whisper Large V3 vs Whisper Large V3 la barbardhigo waa ugu badan ee caadiga ah la socda.
Haa. Xulo "whisper-large-v3" sida paramtirka moodalka ee / v1 / transcribe endpoint. Python iyo Node.js SDKs waxaa ka mid ah Whisper Large V3 tusaale. Free API tier waxaa ka mid ah 100 daqiiqo / bilood.
Waa yaabe. Sababtoo ah Whisper Large V3 waa MIT-licensed, waxaad awoodi kartaa inaad iska diiwaangeliso. STT.ai's open-source page liiska mashruuca repo iyo miisaanka. Kooxaha wax soo saarka badankood waxay isticmaalaan noocayaga martida ah si ay u dhaafaan GPU iibka, qaababka isbeddelka, iyo ops.