Nvidia Parakeet-Realtime-EOU-120m-v1

Posted by nuclearbananana@reddit | LocalLLaMA | View on Reddit | 10 comments

Parakeet-Realtime-EOU-120m-v1 is a streaming speech recognition model that also performs end-of-utterance (EOU) detection. It achieves low latency (80ms~160 ms) and signals EOU by emitting an token at the end of each utterance. The model supports only English and does not output punctuation or capitalization.