zlacker

[return to "Statement from Scarlett Johansson on the OpenAI "Sky" voice"]
1. HarHar+mg[view] [source] 2024-05-21 00:01:25
>>mjcl+(OP)
I found the whole ChatGPT-4o demo to be cringe inducing. The fact that Altman was explicitly, and desperately, trying to copy "her" at least makes it understandable why he didn't veto the bimbo persona - it's actually what he wanted. Great call by Scarlett Johansson in not wanting to be any part of it.

One thing these trained voices make clear is that it's a tts engine generating ChatGPT-4o's speech, same as before. The whole omni-modal spin suggesting that the model is natively consuming and generating speech appears to be bunk.

◧◩
2. leumon+Zk[view] [source] 2024-05-21 00:31:35
>>HarHar+mg
I think it is more then a simple tts engine. At least from the demo, they showed: It can control the speed and it can sing when requested. Maybe its still a seperate speech engine, but more closely connected to the llm.
◧◩◪
3. nabaki+bE[view] [source] 2024-05-21 03:19:24
>>leumon+Zk
Azure Speech tts is capable of doing this with SSML. I wouldn't be surprised if it's what OpenAI is using on the backend.
[go to top]