Best Browser Agent Top 20
Top 20 most popular open-source Browser Agent projects, ranked by GitHub Stars.
browser-use
93.4k Starsbrowser-use enables browser automation for agents, allowing LLMs to understand pages and perform complex web interactions.
Scrapling
48.9k StarsAn adaptive web scraping framework that intelligently handles anti-bot measures, from single requests to full-scale crawls, designed for AI agent data collection.
Chrome DevTools MCP
39.3k StarsMCP server providing Chrome DevTools capabilities to coding agents, enabling web debugging, performance analysis, and DOM manipulation automation.
GitNexus
37.7k StarsThe Zero-Server Code Intelligence Engine — a client-side knowledge graph creator running entirely in your browser with a built-in Graph RAG Agent for code exploration.
UI-TARS Desktop
33.0k StarsByteDance's open-source multimodal AI agent stack connecting cutting-edge AI models with agent infrastructure for GUI automation and computer control.
Agent Browser
32.7k StarsAn open-source browser automation CLI for AI agents by Vercel, built with Rust for high performance and programmability.
Lightpanda Browser
30.2k StarsA lightweight browser runtime designed for automation and scraping scenarios, offering lower overhead than traditional browsers for headless tasks.
AgenticSeek
26.3k StarsFully local Manus AI alternative that autonomously browses the web, writes code, and interacts via voice, with no API costs
Open-AutoGLM
25.2k StarsOpen-AutoGLM is an open phone agent model and framework enabling AI to autonomously operate smartphone interfaces, unlocking the AI Phone experience for everyone.
Stagehand
22.6k StarsThe SDK for browser agents by Browserbase. Provides act, extract, and observe primitives for AI agents to naturally browse and interact with web pages.
Skyvern
21.6k StarsSkyvern is an agent platform for browser task automation, using page understanding and action planning to complete complex web workflows such as forms and back-office tasks.
Agent Reach
19.2k StarsGive your AI agent eyes to see the entire internet. Read and search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu with one CLI and zero API fees.
Deep Research
18.9k StarsAI-powered research assistant that performs iterative deep research on any topic by combining search engines, web scraping, and LLMs
Page Agent
17.7k StarsPage Agent is a JavaScript in-page GUI agent by Alibaba that controls web interfaces with natural language, enabling automated form filling, page navigation, and element interaction.
Browser Use Web UI
16.0k StarsA web interface for running AI agents in the browser, providing a visual experience for browser automation operations.
CUA
16.0k StarsCUA provides open-source infrastructure for Computer-Use Agents, including sandboxes, SDKs, and benchmarks to train and evaluate AI agents that control full desktops (macOS, Linux, Windows).
PPT Master
14.8k StarsAI-powered PPT generation tool that creates natively editable PPTX from any document, producing real PowerPoint shapes instead of images.
Browserless
13.1k StarsDeploy headless browsers in Docker. Run on cloud or bring your own infrastructure. Provides powerful web automation and rendering capabilities for AI agents. Free for non-commercial uses.
Midscene.js
13.0k StarsAI-powered vision-driven UI automation that lets you describe actions in natural language instead of writing selectors, supporting browser and mobile platforms
NanoBrowser
13.0k StarsNanoBrowser is an open-source Chrome extension for AI-powered multi-agent browser automation, supporting web task workflows with your own LLM API key.
Related Articles
AI Coding Agents Deep Dive: Architecture Trade-offs from CLI to IDE-Integrated
A deep architectural comparison of seven open-source coding agents across three paradigms — CLI-first, IDE-integrated, and fully autonomous — examining context management, tool access, and autonomy levels to help you pick the right tool for each development scenario.
Browser Agents in Practice: Architecture and Pitfalls of AI-Controlled Browsers
Breaking down three abstraction layers for browser automation—from raw Playwright to structured extraction—with production patterns, runnable code, and common pitfalls.
Web Automation Agent in Practice: Limits and Best Practices of browser-use
A practical breakdown of browser-use strengths and limits in web task automation, with strategies for stable execution and failure recovery.