XTTS voice cloning with only a seconds of audio

  • With Coqui.ai shutting down it's studio and API it looks like they are reinvesting their efforts into their XTTS models and framework. A recent update to their GitHub also has a no-code gradio ui to facilitate fine-tuning and inferencing locally. https://github.com/coqui-ai/TTS/releases/tag/v0.21.3