Gemma 4 running fully offline on WebGPU with Transformers.js, controlling Reachy Mini over WebSerial.
Posted by xenovatech@reddit | LocalLLaMA | View on Reddit | 10 comments
lordhiggsboson@reddit
very cool!
Anthamon@reddit
Gotta get a new voice for them.
This one doesn't match very well with the cute aesthetic
Sliouges@reddit
Marvin?
thetaFAANG@reddit
ohh wow I thought this was 3D rendering in the browser whoops
would have been impressive as well, but we already know they can do 3D models
__JockY__@reddit
This has inspired me! I have a Reachy Mini waiting for a rainy day to build with the kids. Gonna do my rain dance now!
WebSerial is new to me, what a great idea. Does the reachy expose serial over BT?
scottgal2@reddit
Very nice!
jake_that_dude@reddit
the WebSerial bit is the part I like here. WebGPU local inference is cool, but wiring the action loop straight to browser APIs keeps the trust boundary small: model sees camera/frame state, JS does the motor command, nothing leaves the machine.
if you want to make the demo easier to verify, overlay the actual prompt + selected tool call before each movement. even just
observe -> decide -> serial.write()on screen would make it way clearer that Gemma is driving it instead of a hidden script.tchek@reddit
Gemma 4, how many parameters?
xenovatech@reddit (OP)
This demo is running Gemma 4 E2B, since it works pretty well in-browser (you can try it out using this demo I posted a couple weeks ago: https://huggingface.co/spaces/webml-community/Gemma-4-WebGPU)
Last_Bad_2687@reddit
Super cool, but wish you did more to show it actually processing the imagesĀ