Is there a FOSS machine learning voice cloning tool for TTS?

Basically, I'd like to try cloning my own voice for use with text-to-speech. Instead of paying to use a web-based service, I was wondering if I can basically download a model, train it with my own voice and then generate speech via text with it?

I'm pretty knowledgeable with computers being in IT but I'm entirely inexperienced in machine learning and AI other than using pre-made services like ChatGPT, Google Bard, UberDuck TTS, Gigapixel AI, Mid journey, DALL-E and so-on and so-forth.

If there is such a tool like this with good quality (even if I have to provide a LOT of audio to train it, that's ok), could you please point me in the direction I should be looking? If possible to a tutorial as well, hahah. Thank you!

submitted by /u/IDE_IS_LIFE
[link] [comments]