Cappuccino
NormalDescription
A research project exploring how models understand web interfaces, decompose action steps, and complete complex online tasks through browser agent capabilities.
A research project exploring how models understand web interfaces, decompose action steps, and complete complex online tasks through browser agent capabilities.
Browser automation tool for AI agents and humans, providing high-performance web interaction capabilities built in Go
An open-source, vision-first browser agent that drives web automation through visual understanding, supporting complex web interaction tasks for QA testing and workflow automation.
The first LLM-based web agent and benchmark for generalist web agents, providing datasets, evaluation frameworks and baseline methods for building agents that operate on real websites.
An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.