UFO
ActiveDescription
UFO is a Windows GUI automation agent by Microsoft that understands screen interfaces and executes complex OS tasks through natural language commands.
UFO is a Windows GUI automation agent by Microsoft that understands screen interfaces and executes complex OS tasks through natural language commands.
Open-source agentic framework that uses computers like a human, capable of completing complex GUI tasks with autonomous learning and experience accumulation.
Open-sourced computer use agents that can operate on cross-platform environments including Windows, macOS, Ubuntu, and Android. ICLR 2026 Oral paper project.
Open-source Computer-Use-Agent that automates GUI interactions through natural language instructions, enabling intelligent desktop automation.
The first open-source Artificial Narrow Intelligence generalist agent that fully operates GUIs using only natural language. Uses Visualization-of-Thought and Chain-of-Thought reasoning for spatial perception and HID simulation.