AI desktop agent that controls your OS (opensource, crossplatform)

Posted by IlyaZelen@reddit | Python | View on Reddit | 5 comments

https://github.com/777genius/os-ai-computer-use

What This Project Does

Local AI agent that lets control your entire desktop: mouse, keyboard, drag-and-drop across any application, with built-in vision of what's on the screen. Python backend + Flutter UI, runs fully on your machine.

Target Audience

Developers and users experimenting with computer-use AI. Functional MVP, actively developed.

Comparison

Browser agents (Browser Use, Playwright-based) only work inside browsers. OS AI operates at the OS level - automate Finder, Photoshop, System Settings, or any native app. Cross-platform (macOS/Windows/Linux), provider-agnostic architecture, no Docker required.

Built with Python. Provider-agnostic architecture - currently uses Anthropic, but designed to support OpenAI, Gemini and others. Plans: offline mode, execute cli commands on request. Your support motivates to develop the project ❤️