Transcribe with SenseVoice
5.5%
WER
50
Languages
50.0x
Speed
MIT
License
About SenseVoice
Model Info
- ProviderFunAudioLLM
- Architecture-
- LicenseMIT
- UpdatedMar 2026
E pā ana ngā pātai
Ko te SenseVoice he tauira kōrero-ki-tuhi e FunAudioLLM. E noho ana te STT.ai i te SenseVoice i runga i tātau hanganga GPU kia taea ai e koe te whakamahi me te kore e whakarato i tō tātau ake pūrere — whakaata audio, video rānei me te kōwhiri i te SenseVoice mai i te tauira kōwhiria.
I runga i ngā tohutoro paerewa, e tae ana te SenseVoice ki te 5.5% Wāhi hapa Wā. E whakawhirinaki ana te tika o te ao tūturu ki te āhuatanga oro, ki te āhua, me te reo; mō ngā pūkete mārō, whakahua rānei, e tūmanako ana ki ētahi wāhanga ōrautanga tiketike ake i te WER.
E haere ana te SenseVoice i runga i te taumata wātea wātea o te STT.ai — ka whiwhi te kaiwhaiwhai i te 600 minu/whā kāore i te utu. Ka tāpiri ngā mahere utu i ngā tepe-whakahaua, ngā tāruatanga tūmataiti, me te whakarārangitanga o te arotahi.
Ka tukua te SenseVoice i raro i te MIT, he whakaaetanga pūtake tūwhera. Ka taea e koe te whakawhiwhi i te SenseVoice ki o koe ake ngā pūrere, te whakamahi rānei i tātau putanga whakawhiwhia — he whai hua rānei.
E tautoko ana te SenseVoice i ngā reo 50. Ka kōwhiria e te kite-māori te reo tika mō te nuinga o te oro; Ka taea hoki e koe te whakapūtā i te ringa mō tētahi whakahau tika iti.
SenseVoice e mahi ana i te oro i te wā tūturu 50.0x i runga i a tātau GPUs. Ka oti te pūkete oro 1-ora i raro i te 1 minu; he roa ake te raupapa o ngā pūkete, ā, ka whakamōhiotia mā te imeli ina oti.
He tika ake ngā tauira nui ake, engari he pōturi ake; Ko te STT.ai e noho ana i te SenseVoice i runga i te GPU kia kore ai te tatau parameter e pā ki tōna mahi taha o te kaiuru.
E whakaae ana te SenseVoice ki ia momo STT.ai e tautoko ana — MP3, WAV, M4A, FLAC, OGG, MP4, MKV, MOV, WebM, AVI, me ētahi atu. Ko te huaputa he TXT, SRT, VTT, DOCX, JSON, PDF rānei.
He. Ka haere te whakahua kōrero ki te SenseVoice mō ia whakahua - ka whakahuatia ia kaikōrero, ā, ka taea e koe te whakahōu i a rātou i te kaiwhakatika i muri iho.
He. SenseVoice e haere ana i roto i tātau taiao whakahaere - ka tukatuka, ka whakakoretia te oro, ā, kāore anō kia whakamahia mō te whakaakoranga me te kore kōwhiringa mārama. Ka tāpiri ngā mahere Pro ki te whakapūtātanga o te taha o te kaiuru mō ngā tāruatanga i te wā e noho ana.
Ka whakamahia te utauta o te whakataurite-stt hei whakahaere i te SenseVoice ki ētahi atu tauira tautoko i runga i te oro ōrite — ka kitea e koe te WER, te tatau wāhanga, ngā tohu kaikōrero, me ngā pūkete ātete. Ko te whakataurite SenseVoice vs Whisper Large V3 te tino pūnoa.
He. Ka whakapūtātia "sensevoice" hei tātai tauira i runga i te /v1/transcribe te wāhi mutunga. Ko ngā SDKs Python me Node.js kei roto ko ngā tauira SenseVoice. Kei roto i te taumata API wātea ko te 100 minu/whā.
He. Nā te mea he SenseVoice te MIT-licensed, ka taea e koe te whakanoho i a ia. Ko te STT.ai te pou pūtake tūwhera e whakarārangi ana i te pūnaha me ngā taumahatanga. Ko te nuinga o ngā rōpū whakanao e whakamahi ana i tātau putanga whakanoho hei whakarerekē i te GPU, i ngā whakarerekētanga tauira, me ngā taumahi.