Local TTS LLM in your browser?
Posted by DinUXasourus@reddit | LocalLLaMA | View on Reddit | 9 comments
I need to have web pages read to me due to dyslexia, and the "Read Aloud" voices all sound kinda shit. I'd love to have something where I can just select some text, right click/press to make my 4070 sweat and get something with a bit more soul.
Content-External1855@reddit
You have access to Pipertts in Read Aloud, they got some good voices. The problem with local tts is mainly the time it takes to generate, but piper's really fast and the voices are ranging between bad to pretty good.
Alternative if you know your way around python you can use Sherpa-onnx, which allow for a wider selection of voices with higher quality. But you'd have to do combine the python script with something like Autohotkey (if on windows) to easily pass data from your clipboard to the python script.
You could hook it up by setting a shortcut like alt+1 in a Autohotkey script that preforms a copy on the selected text, then sends that content to the python script as a argument.
DinUXasourus@reddit (OP)
Now we cookin! Time to redownload AHK
Content-External1855@reddit
glad I could help
DinUXasourus@reddit (OP)
Turns out the Read Aloud plugin has piper voices.
Content-External1855@reddit
yeah, that's what I use for daily tasks. But the browser extension won't let you change the speed when you use piper voices, so for that you need to set up a local solution. onnx also has sample scripts that you can use
ranker2241@reddit
AllTalk sounds great, for the browser setup you prolly need some api stuff and a little code
privacyparachute@reddit
Try https://papeg.ai/
You can also select a portion of the text and play only that part.
To change voices:
- click (+) and add the speaker AI
- select the speaker AI
- Make the chat input area bigger by dragging it up. Seven buttons to quickly switch voices will be revealed.
(or select a voice from the dropdown under Settings)
Microbzz@reddit
You may want to look into https://github.com/matatonic/openedai-speech, OpenWebUI docs also have a section on integrating with it
leelweenee@reddit
MS Edge browser