Best open source realtime tts?

There's actually a version that runs 100% locally... In your browser. It even works on mobile. The model is very small (only 82 million parameters), so running it 100% in the browser isn't a big deal.

[-]

sherlockAI@reddit

Here's a batch implementation of Kokoro for interested folks. We wanted to run it on-device but should help in any deployment. Takes about 400MB RAM if using int8 quantized version. Honestly, don't see much difference in fp32 vs int8.

https://www.nimbleedge.com/blog/how-to-run-kokoro-tts-model-on-device

[-]

GrayPsyche@reddit

can you train voices for it

[-]

g14loops@reddit

No, they ddin't public their training code.

[-]

pingwin@reddit

I run https://github.com/remsky/Kokoro-FastAPI at home, it usually eats around 2.5G VRAM.

[-]

plurch@reddit

Here are some other repos in the same neighborhood as kokoro

[-]

Osama_Saba@reddit

How does it vrams?

[-]

NAKOOT@reddit

IndexTTS, even works with 6GB VRAM and it's really easy to use.

[-]

Original_Finding2212@reddit

We ported KokoroTTS to Jetson-containers and it takes a few hundred MB RAM.. I think 300-600?

But you need one that supports working in stream or small chunks. There are other, bigger models with better voice.

[-]

YearnMar10@reddit

It takes me on jetson 3gig once everything is loaded… which container are you using?

[-]

Original_Finding2212@reddit

Use jetson-containers repo (disclaimer: I joined as a maintainer there). It completely changes how we work on jetson.

It supports old models as well!

[-]