AgentList
HomeProjectsArticlesAbout
Explore Projects
HomeProjectsArticlesAbout
Explore Projects
Projects Hugging Face Evaluate

Hugging Face Evaluate

Active
GitHub Python Apache-2.0

Description

A library by Hugging Face for easily evaluating machine learning models and datasets, providing a wide range of metrics and evaluation methods.

Tags

evaluation llm python huggingface framework

Categories

📊 Observability
Visit GitHub Visit Website

Project Metrics

Stars 2.5k
Forks 320
Watchers 2.5k
Issues 279
Created March 30, 2022
Last commit May 26, 2026

Deployment

Local

Related Projects

Argilla

5.0k · Python
Active

Argilla is a collaboration platform for AI engineers and domain experts to build high-quality datasets, collect human feedback, and evaluate models.

evaluationdata-processingllm +2

Weave

1.1k · Python
Active

A toolkit by Weights & Biases for developing AI-powered applications, providing LLM call tracing, evaluation experiment management, and versioning from prototype to production.

observabilityevaluationllm +2

PrompToMatix

957 · Python
Active

An automatic prompt optimization framework by Salesforce AI Research that leverages LLMs to search for and refine prompts for improved model performance.

prompt-engineeringevaluationllm +1

SwanLab

4.0k · Python
Active

An open-source, modern-design AI training tracking and visualization tool. Supports PyTorch, Transformers and more. Monitor and evaluate AI agent training processes.

pythonobservabilityevaluation +2
AgentList

The most comprehensive directory of open-source AI Agent projects. Discover and compare top Agent frameworks like LangChain, CrewAI, and more.

Quick Links

  • Project List
  • Featured Articles
  • Browse Categories

Contact

  • About
  • Privacy Policy
  • Contact Us

© 2026 AgentList. All rights reserved.

Made with for the open source community