Zonos-v0.1 beta by Zyphra, featuring two expressive and real-time text-to-speech (TTS) models with high-fidelity voice cloning. 1.6B transformer and 1.6B hybrid under an Apache 2.0 license.

Posted by Xhehab_@reddit | LocalLLaMA | View on Reddit | 146 comments

>"Today, we're excited to announce a beta release of Zonos, a highly expressive TTS model with high fidelity voice cloning. >We release both transformer and SSM-hybrid models under an Apache 2.0 license. >Zonos performs well vs leading TTS providers in quality and expressiveness." **Details**: [https://www.zyphra.com/post/beta-release-of-zonos-v0-1](https://www.zyphra.com/post/beta-release-of-zonos-v0-1) **Get the weights on Huggingface**: [http://huggingface.co/Zyphra/Zonos-v0.1-hybrid](http://huggingface.co/Zyphra/Zonos-v0.1-hybrid) and [http://huggingface.co/Zyphra/Zonos-v0.1-transformer](http://huggingface.co/Zyphra/Zonos-v0.1-transformer) Download the inference code: [http://github.com/Zyphra/Zonos](http://github.com/Zyphra/Zonos)