Whisper Transcription Workflow: Home Server vs. Android Phone? Seeking Advice!
Posted by CtrlAltDelve@reddit | LocalLLaMA | View on Reddit | 11 comments
I've been doing a lot with the Whisper models lately. I find myself making voice recordings while I'm out, and then later I use something like MacWhisper at home to transcribe them using the best available Whisper model. After that, I take the content and process it using a local LLM.
This workflow has been *really* helpful for me.
One inconvenience is having to wait until I get home to use MacWhisper. I also prefer not to use any hosted transcription services. So, I've been considering a couple of ideas:
First, seeing if I can get Whisper to run properly on my Android phone (an S25 Ultra). This...is pretty involved and I'm not much of an Android developer. I've tried to do some reading on transformers.js but I think this is a little beyond my ability right now.
Second, having Whisper running on my home server continuously. This server is a Mac Mini M4 with 16 GB of RAM. I could set up a watch directory so that any audio file placed there gets automatically transcribed. Then, I could use something like Blip to send the files over to the server and have it automatically accept them.
Does anyone have any suggestions on either of these? Or any other thoughts?
11 Comments
cvjcvj2@reddit
jzn21@reddit
cvjcvj2@reddit
john-r@reddit
Bakedsoda@reddit
PermanentLiminality@reddit
davernow@reddit
Ktibr0@reddit
ali0une@reddit
mobileJay77@reddit
themegadinesen@reddit