zlacker

[return to "Voxtral Transcribe 2"]
1. mdrzn+bc[view] [source] 2026-02-04 16:03:32
>>meetpa+(OP)
There's no comparison to Whisper Large v3 or other Whisper models..

Is it better? Worse? Why do they only compare to gpt4o mini transcribe?

◧◩
2. GaggiX+7d[view] [source] 2026-02-04 16:07:35
>>mdrzn+bc
Gpt4o mini transcribe is better and actually realtime. Whisper is trained to encode the entire audio (or at least 30s chunks) and then decode it.
◧◩◪
3. mdrzn+Jd[view] [source] 2026-02-04 16:10:28
>>GaggiX+7d
So "gpt4o mini transcribe" is not just whisper v3 under the hood? Btw it's $0.006 / minute

For Whisper API online (with v3 large) I've found "$0.00125 per compute second" which is the cheapest absolute I've ever found.

[go to top]