zlacker

[parent] [thread] 2 comments
1. slim+(OP)[view] [source] 2024-05-23 06:16:59
so how would the process of training a speaking AI go ? would you input the actor voice samples and subtitles from a movie, then train it till the output is similar enough to the actors voice from the movie ?
replies(2): >>slim+i >>numpad+b7
2. slim+i[view] [source] 2024-05-23 06:19:34
>>slim+(OP)
what test data would they use ?
3. numpad+b7[view] [source] 2024-05-23 07:17:51
>>slim+(OP)
Just couple minutes of data through 10-20 minutes of training with RVC WebUI[0] on included base model into VC Client[1] gets you to 90% there. But that's nearly an year old method, so I'm sure OAI has its own completely novel architecture for extra 5% fidelity.

1: https://github.com/RVC-Project/Retrieval-based-Voice-Convers...

2: https://github.com/w-okada/voice-changer

[go to top]