A 4B Model That Outperforms 32B on GUI Tasks, Fully Open-Source

Posted by Successful-Bill-5543@reddit | LocalLLaMA | View on Reddit | 12 comments

It includes 

  1. 4B GUI Agent model capable of running on local computers.
  2. Plug-and-play inference infrastructure that handles ADB connections, dependency installation, and task recording/replay