MOSS-TTS-Nano: a 0.1B open-source multilingual TTS model that runs on 4-core CPU and supports realtime speech generation

Posted by TimeEnvironmental219@reddit | LocalLLaMA | View on Reddit | 6 comments

We just open-sourced MOSS-TTS-Nano, a tiny multilingual speech generation model from MOSI.AI and the OpenMOSS team.

Some highlights:

The project is aimed at practical TTS deployment: small footprint, low latency, and easy local setup for demos, lightweight services, and product integration.

GitHub:
https://github.com/OpenMOSS/MOSS-TTS-Nano

Huggingface:

https://huggingface.co/spaces/OpenMOSS-Team/MOSS-TTS-Nano

Online demo:
https://openmoss.github.io/MOSS-TTS-Nano-Demo/

Would love to hear feedback on quality, latency, and what use cases you’d want to try with a tiny open TTS model.