zlacker

[return to "Statement from Scarlett Johansson on the OpenAI "Sky" voice"]
1. HarHar+mg[view] [source] 2024-05-21 00:01:25
>>mjcl+(OP)
I found the whole ChatGPT-4o demo to be cringe inducing. The fact that Altman was explicitly, and desperately, trying to copy "her" at least makes it understandable why he didn't veto the bimbo persona - it's actually what he wanted. Great call by Scarlett Johansson in not wanting to be any part of it.

One thing these trained voices make clear is that it's a tts engine generating ChatGPT-4o's speech, same as before. The whole omni-modal spin suggesting that the model is natively consuming and generating speech appears to be bunk.

◧◩
2. leumon+Zk[view] [source] 2024-05-21 00:31:35
>>HarHar+mg
I think it is more then a simple tts engine. At least from the demo, they showed: It can control the speed and it can sing when requested. Maybe its still a seperate speech engine, but more closely connected to the llm.
◧◩◪
3. kromem+KB[view] [source] 2024-05-21 02:56:42
>>leumon+Zk
Most impressive was the incredulity to the 'okay' during the counting demo after the nth interruption.

Was quickly apparent that text only is a poor medium for the variety and scope of signals that could be communicated by these multimodal networks.

[go to top]