browser-use

Active

Description

browser-use enables browser automation for agents, allowing LLMs to understand pages and perform complex web interactions.

Related Projects

ScaleCUA

1.1k · Python

Stale

Open-sourced computer use agents that can operate on cross-platform environments including Windows, macOS, Ubuntu, and Android. ICLR 2026 Oral paper project.

browserpythonagent +1

TuriX CUA

2.9k · Python

Active

Open-source Computer-Use-Agent that automates GUI interactions through natural language instructions, enabling intelligent desktop automation.

browseragentpython +1

PyWinAssistant

1.3k · Python

Stale

The first open-source Artificial Narrow Intelligence generalist agent that fully operates GUIs using only natural language. Uses Visualization-of-Thought and Chain-of-Thought reasoning for spatial perception and HID simulation.

browseragentpython +2

PPT Master

14.8k · Python

Active

AI-powered PPT generation tool that creates natively editable PPTX from any document, producing real PowerPoint shapes instead of images.

browserpythonagent +2

Browser AgentWeb 自动化Playwright

Browser Agents in Practice: Architecture and Pitfalls of AI-Controlled Browsers

Breaking down three abstraction layers for browser automation—from raw Playwright to structured extraction—with production patterns, runnable code, and common pitfalls.

browser-useWeb AutomationAgent

Web Automation Agent in Practice: Limits and Best Practices of browser-use

A practical breakdown of browser-use strengths and limits in web task automation, with strategies for stable execution and failure recovery.

browser-use

Description

Tags

Categories

Related Projects

ScaleCUA

TuriX CUA

PyWinAssistant

PPT Master

Related Articles

Browser Agents in Practice: Architecture and Pitfalls of AI-Controlled Browsers

Web Automation Agent in Practice: Limits and Best Practices of browser-use