Model	WER	Speed	زبانیں	بہترین
STT.ai Enhanced	3.2%	160.0x	100	STT.ai's flagship speech-to-text model with best-in-class accuracy and speed. Optimized …
Whisper Large V3	4.2%	8.0x	99	OpenAI's largest and most accurate Whisper model. Excellent multilingual support …
Whisper Turbo	5.1%	32.0x	99	OpenAI's speed-optimized Whisper variant. 4x faster than Large V3 with …
NVIDIA Canary	3.5%	45.0x	4	NVIDIA's multi-task ASR model with top-tier accuracy on English. Built …
Moonshine	7.8%	80.0x	1	Ultra-lightweight ASR model designed for edge devices. Runs on Raspberry …
NVIDIA Parakeet	3.0%	55.0x	1	NVIDIA's CTC-based English ASR model. One of the most accurate …
SenseVoice	5.5%	50.0x	50	Multilingual speech understanding model with emotion recognition and audio event …
Distil-Whisper	5.8%	48.0x	99	Distilled version of Whisper Large V3. 6x faster with 49% …
Vosk	12.0%	100.0x	20	Lightweight offline speech recognition. Works without internet, ideal for privacy-sensitive …

WER (ورڈ ایرل ریٹ) کیا ہے؟

ورڈ ایرر ریٹ (WER) بولنے کی شناخت کی درستگي کو ما ینے کے ليے معياري میٹريک هے یہ ترنسکریپٹ ميں لفظوں کی فیصدي حساب کر تا هے جو ريفرنس سے مختلف هے 5% کا WER کا مطلب هے کہ تقريبا ہر 100 لفظوں ميں سے 5 غلطي هے کم ترين بہتر هے

پیشہ ور انسانی نقل کرنے والے عام طور پر 4-5% کی ایک WER حاصل کرتے ہیں۔ بہترین AI ماڈل اب صاف آڈیو پر انسانی سطح کی دقت سے مطابقت رکھتے ہیں یا اس کے قریب ہوتے ہیں۔

استعمال کرنے کے لئے کون سا ماڈل ہے؟

ہمارا ڈیفالٹ استعمال کریں - Whisper Large V3 Turbo رفتار اور دقت کا بہترین توازن فراہم کرتا ہے. شروع کرنے کے لئے مفت، کوئی سائن اپ کی ضرورت نہیں.

مفت نقل شروع کریں