Seeking a Local/Offline Speech-to-Text with System-Wide 'Type Anywhere' Dictation
Posted by bilalazhar72@reddit | LocalLLaMA | View on Reddit | 9 comments
[PLEASE READ BEFORE ANSWERING TO PREVENT IRRELEVANT SUGGESTIONS FOR ME.]
I'm looking to improve my workflow on Linux and am searching for a specific type of speech-to-text application to run locally on my laptop.
My requirements are:
- 100% Local & Offline: All audio processing must happen on my own machine.
- High Accuracy: Quality should be on par with a good Whisper model. I'm not interested in older models like VOSK, as their accuracy doesn't meet my needs.
- Key Use Cases: My main goals are to dictate notes directly into my "Second Brain" style notes app and to send longer prompts to Large Language Model interfaces without breaking my flow.
- System-Wide Integration: This is the most crucial part. I want to press a hotkey and dictate directly into any active application (my code editor, a browser, a document, etc.).
For context, I use Speechnotes all the time because it supports models like tiny-faster-whisper
, which is very fast and works perfectly for my use case. The problem is purely its workflow—having to transcribe in one window and then constantly copy-paste the text is exactly the process I want to eliminate.
My goal is to find a seamless solution that works like native OS dictation but is powered by modern, local models. Many Whisper UIs I've found are excellent but seem to have the same limitation. The paid options are too expensive for what they are, which is why I'm focused on finding a great offline version.
Does a tool like this exist for Linux? What are you all using to achieve this kind of workflow?
Thanks for any help!
Professional-Crab234@reddit
... and the exact solution is here:
VOXD - a voice-typing / dictation app for linux
"Out of the box" sets you up with offline, LOCAL voice transcription, and even LOCAL ai-rewriting according to your custom pre-made prompts.
Works on CPU. No GPU required. Decent speed on my ryzen 5800. I have wired it up on whisper.cpp, llama.cpp and ydotool (btw, tribute to all those efforts)
bilalazhar72@reddit (OP)
what model are you using that is fast here this tool does not support nvidia parakeet right ?
Professional-Crab234@reddit
by default it is Whisper base, but through the model manager inside the app you can get any whisper model. since one has blazing fast whisper, served on whisper.cpp, there is absolutely no need for parakeet
bilalazhar72@reddit (OP)
epicenter whispering is what i tried today https://github.com/epicenter-md/epicenter/tree/main/apps/whispering
can run nvidia parakeet model (very fast on CPU )
thanks for pointing me to a good project
i did the same to you
DeProgrammer99@reddit
https://github.com/cjpais/Handy
bilalazhar72@reddit (OP)
do you by any chance use this as well
- if yes on which operating system
its keeps crashing for me
DeProgrammer99@reddit
I use it on Windows, and I haven't updated it since that thread, so my copy still adds "thank you" to the end of everything it writes, haha. But I never had it crash.
jamaalwakamaal@reddit
https://github.com/savbell/whisper-writer
jamaalwakamaal@reddit
https://github.com/dynamiccreator/whisper-typer-tool