CrisperWhisper ranks #2 on Open ASR Leaderboard

Posted by vaibhavs10@reddit | LocalLLaMA | View on Reddit | 10 comments

Hi All,

I'm VB, GPU Poor at Hugging Face. We ran the speech recognition benchmarks for a relatively new Whisper-large-v3 fine-tune and it now ranks #2 on the Open ASR Leaderboard. 🔥

CrisperWhisper aims to transcribe every spoken word exactly as it is, including fillers, pauses, stutters and false starts.

Fine-tuned from Whisper Large V3 it beats it by roughly ~1 WER margin ⚡

Kudos NyraHealth team - Open Speech Recognition scene is heating up!

You can find the Leaderboard here: https://huggingface.co/spaces/hf-audio/open_asr_leaderboard

What would you like to see on the leaderboard next? Keen on your feedback!

[-]

keniget@reddit

CW is awesome, thanks a lot for providing it out in the open!

for my iOS app recently I went through all the transcription services (elevenlabs, etc) and models, and landed finally on crispy whisper + openai tts.

My only challenge left is how to highlight the text when they are rendered in markdown as the text transcribed and the markdown rendered are different.

[-]

CrisperWhisper ranks #2 on Open ASR Leaderboard

keniget@reddit

Zemanyak@reddit

az226@reddit

rangerrick337@reddit

herozorro@reddit

TheActualStudy@reddit

Psychedelic_Traveler@reddit

grim-432@reddit

NoJellyfish6949@reddit

YearZero@reddit