Local AI Agent Wake words
Posted by betanu701@reddit | LocalLLaMA | View on Reddit | 8 comments
Hey all,
I am working on building a fully capable AI personal assistant that is 100% local. It is going to be a self evolving, Learning AI assistant that will integrate with things like Home assistant. I have it mostly built, still working on testing and getting satellite speakers and displays to work. it is built using the Qwen Family. However, it does not rely 100% on the LLm, there is a 3 layer architecture that essentially captures the intents and will direct things as it comes in with the LLm being the last fall through.
this is the blurb I have " ... transforms a local LLM into an intelligent home assistant that understands who's speaking, adapts to each family member, controls your smart home, and gets smarter every day — all running on your hardware, with zero cloud dependencies."
The question I have, I want to train a new wake word (I know how to train it) but I need actual audio samples of people saying the wake word. Does anyone know of a good place to crowd source people saying it?
Thanks in advance?
btw: I didn't post the link to the repo because right now, I am not trying to self promote even though It is going to be fully open source. If this is something of interest, I can post it, it just is not ready yet.
Artistic_Tension_983@reddit
Now best choice is Nanowakeword: https://github.com/arcosoph/nanowakeword
Technomancer1672@reddit
Check this notebook. I believe they generate samples of people saying the word using Vosk. Depending on the hardware you have look in to using a better TTS model to generate the samples, but synthetic data is probably the only way you're going to be able to train it. Consider adding reverb, overlaying background noise, likely false positives, etc...
rhinodevil@reddit
I also tried this a while ago. Not so easy to get workable results (I didn't).
rhinodevil@reddit
Maybe this works better? https://github.com/TaterTotterson/microWakeWord-Trainer-Nvidia-Docker
betanu701@reddit (OP)
I will give the notebook a try though!! See if I can throw my new word into that.
betanu701@reddit (OP)
Thanks, I tried using auto generated ones. I think it turned out well, and then I had about 10-20 mins of myself and other family saying it, but can't get it to pick up.
betanu701@reddit (OP)
Thanks, I tried using auto generated ones. I think it turned out well, and then I had about 10-20 mins of myself and other family saying it, but can't get it to pick up.
Impossible-Glass-487@reddit
"Kardashian"