AgentList
HomeProjectsArticlesAbout
Explore Projects
HomeProjectsArticlesAbout
Explore Projects
Home / Projects / Bananalyzer

Bananalyzer

Normal
GitHub Python MIT

Description

Open source AI Agent evaluation framework for web tasks to measure and compare AI agent performance on web operations.

Tags

agent-evaluation web-tasks benchmark observability python

Categories

📊 Observability 🌐 Browser Agent
Visit GitHub

Project Metrics

Stars 327
Forks 0
Watchers 0
Issues 0
Created January 1, 2025
Last commit February 28, 2026

Deployment

Local

Related Projects

LM Evaluation Harness

12.3k · Python
Active

A framework for few-shot evaluation of language models by EleutherAI, providing standardized evaluation pipelines supporting hundreds of benchmark tasks and widely adopted as a core LLM evaluation tool in the community.

llm-evaluationbenchmarkevaluation-framework +2

HolmesGPT

2.2k · Python
Active

A CNCF Sandbox SRE Agent that automatically analyzes infrastructure logs and metrics to assist with incident diagnosis and system operations.

observabilitypythonagent +2

SwanLab

3.8k · Python
Active

An open-source, modern-design AI training tracking and visualization tool. Supports PyTorch, Transformers and more. Monitor and evaluate AI agent training processes.

pythonobservabilityevaluation +2

AgentDiff

27 · Python
Active

Interactive sandboxes for AI agent evaluations and reinforcement learning on third-party APIs like Slack, LinkedIn, and more.

agent-evaluationsandboxreinforcement-learning +2
AgentList

Curated directory of open-source AI agent projects

Quick Links

  • Project List
  • Featured Articles
  • Browse Categories

Contact

  • About
  • Privacy Policy
  • Contact Us

© 2026 AgentList. All rights reserved.

Made with for the open source community