Cohere Transcribe WebGPU: state-of-the-art multilingual speech recognition in your browser
Posted by xenovatech@reddit | LocalLLaMA | View on Reddit | 1 comments
Yesterday, Cohere released their first speech-to-text model, which now tops the OpenASR leaderboard (for English, but the model does support 14 different languages).
So, I decided to build a WebGPU demo for it: running the model entirely locally in the browser with Transformers.js. I hope you like it!
Link to demo (+ source code): [https://huggingface.co/spaces/CohereLabs/Cohere-Transcribe-WebGPU](https://huggingface.co/spaces/CohereLabs/Cohere-Transcribe-WebGPU)
1 Comments
urekmazino_0@reddit