Hi Reddit.
I have a, give or take, 4000 wav files I wanna use to create a TTS model (if anyone is curious, the dataset is from here: https://mtc.ethz.ch/publications/open-source/swiss-dial.html).
My idea was to take all of these files and merge them into a .tph file (similar to the ones found in this Hugging Face page: https://huggingface.co/QuickWick/Music-AI-Voices/tree/main) that I can just plug into a local TTS project and just run it.ç
I have seen a lot of Google Collab project to voice clone however I couldn't get it working on making the model.
How can I make that?
[link] [comments]