How is the new Grok AI girlfriend animation implemented?

Posted by EvilKY45@reddit | LocalLLaMA | View on Reddit | 42 comments

Looks pretty expressive: https://www.youtube.com/shorts/G8bd-uloo48. I tried on their App, all things (text, audio, lip sync, body movement) are generated in real time.

How do they implement that? Is there any open source work to achieve similar results?