Is real-time voice-to-voice still science fiction?

Posted by junior600@reddit | LocalLLaMA | View on Reddit | 21 comments

Hi everyone, as the title says: is it possible to have real-time voice-to-voice interaction running locally, or are we still not there yet?
I'd like to improve my speaking skills (including pronunciation) in English and Japanese, and I thought it would be great to have conversations with a local LLM.
It would also be nice to have something similar in Italian (my native language) for daily chats, but I assume it's not a very "popular" language to train on. lol