Question about making your own voice models

Hi Reddit.

I have a, give or take, 4000 wav files I wanna use to create a TTS model (if anyone is curious, the dataset is from here: https://mtc.ethz.ch/publications/open-source/swiss-dial.html).

My idea was to take all of these files and merge them into a .tph file (similar to the ones found in this Hugging Face page: https://huggingface.co/QuickWick/Music-AI-Voices/tree/main) that I can just plug into a local TTS project and just run it.ç
I have seen a lot of Google Collab project to voice clone however I couldn't get it working on making the model.

How can I make that?

submitted by /u/ARacoonOnInternet
[link] [comments]