zlacker

[return to "My AI skeptic friends are all nuts"]
1. retrac+J[view] [source] 2025-06-02 21:16:59
>>tablet+(OP)
Machine translation and speech recognition. The state of the art for these is a multi-modal language model. I'm hearing impaired veering on deaf, and I use this technology all day every day. I wanted to watch an old TV series from the 1980s. There are no subtitles available. So I fed the show into a language model (Whisper) and now I have passable subtitles that allow me to watch the show.

Am I the only one who remembers when that was the stuff of science fiction? It was not so long ago an open question if machines would ever be able to transcribe speech in a useful way. How quickly we become numb to the magic.

◧◩
2. anothe+jg[view] [source] 2025-06-02 22:47:35
>>retrac+J
Using AI to generate subtitles is inventive. Is it smart enough to insert the time codes such that the subtitle is well enough synchronised to the spoken line?

As someone who has started losing the higher frequencies and thus clarity, I have subtitles on all the time just so I don't miss dialogue. The only pain point is when the subtitles (of the same language) are not word-for-word with the spoken line. The discordance between what you are reading and hearing is really distracting.

This is my major peeve with my The West Wing DVDs, where the subtitles are often an abridgement of the spoken line.

[go to top]