Any methods for achieving close to elevenlabs quality and inference speed with a local model?
My upcoming master thesis (on use of AI models for believable NPCs in video games) will involve the use of text to speech. Currently elevenlabs has been my go to for TTS, but the pricing model is quite inconvenient since its a monthly subscription inst…