I used RVC V2 to make an AI model of a character whose voice is raspy, has a lot of voice cracks and has a very "shouty" voice. I used a dataset which was 70% of voice clips of him yelling and/or talking loudly and the rest were of him talking normally. The dataset was decent quality, I did have to use background music remover software on some parts but it's overall decent.
The thing is, the model doesn't sound ANYTHING like the character. For some reason it's way too soft spoken, and even when it's supposed to be yelling or screaming it sounds kinda like he's whispering. The AI's neutral voice does sound like him but it's missing his voice cracks and voice raspiness. Is there any way I can mimic it?
[link] [comments]