Cappuccino

Normal

GitHub Python Apache-2.0

Description

A research project exploring how models understand web interfaces, decompose action steps, and complete complex online tasks through browser agent capabilities.

Related Projects

Vibium

2.8k · Go

Active

Browser automation tool for AI agents and humans, providing high-performance web interaction capabilities built in Go

browser-automationweb-agentgo +1

Magnitude

4.0k · TypeScript

Stale

An open-source, vision-first browser agent that drives web automation through visual understanding, supporting complex web interaction tasks for QA testing and workflow automation.

vision-firstbrowser-automationweb-agent +2

Mind2Web

988 · Jupyter Notebook

Stale

The first LLM-based web agent and benchmark for generalist web agents, providing datasets, evaluation frameworks and baseline methods for building agents that operate on real websites.

web-agentbenchmarkllm +2

AgentLab

576 · Python

Normal

An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.

web-agentbenchmarkevaluation +2