Best model for speech to text Transcription for including filler words ?

Posted by Similar-Camp9685@reddit | LocalLLaMA | View on Reddit | 2 comments

Hey everyone, I want to perform speech-to-text transcription in which I have to include filler words like: um, ah, so etc. which highlight confidence. Is there any type of model which can help me? I tried WhisperX but the results are not favorable. This is very important for me as I'm writing a research paper.