zlacker

[return to "Voxtral Transcribe 2"]
1. XCSme+4z[view] [source] 2026-02-04 17:43:21
>>meetpa+(OP)
Is it me or error rate of 3% is really high?

If you transcribe a minute of conversation, you'll have like 5 words transcribed wrongly. In an hour podcast, that is 300 wrongly transcribed words.

◧◩
2. cootsn+Vz[view] [source] 2026-02-04 17:46:47
>>XCSme+4z
The error rate for human transcription can be as high as 5%.
◧◩◪
3. XCSme+LB[view] [source] 2026-02-04 17:53:03
>>cootsn+Vz
Oh wow, I thought humans are like 0.1% error rate, if they are native speakers and aware of the subject being discussed.
◧◩◪◨
4. rhdunn+1g1[view] [source] 2026-02-04 20:51:55
>>XCSme+LB
It can depend a lot on different factors like:

- familiarity with the accent and/or speaker;

- speed and style/cadence of the speech;

- any other audio that is happening that can muffle or distort the audio;

- etc.

It can also take multiple passes to get a decent transcription.

[go to top]