Hugging Face Evaluate

Active

GitHub Python Apache-2.0

Description

A library by Hugging Face for easily evaluating machine learning models and datasets, providing a wide range of metrics and evaluation methods.

Related Projects

Argilla

5.0k · Python

Active

Argilla is a collaboration platform for AI engineers and domain experts to build high-quality datasets, collect human feedback, and evaluate models.

evaluationdata-processingllm +2

Weave

1.1k · Python

Active

A toolkit by Weights & Biases for developing AI-powered applications, providing LLM call tracing, evaluation experiment management, and versioning from prototype to production.

observabilityevaluationllm +2

PrompToMatix

957 · Python

Active

An automatic prompt optimization framework by Salesforce AI Research that leverages LLMs to search for and refine prompts for improved model performance.

prompt-engineeringevaluationllm +1

SwanLab

4.0k · Python

Active

An open-source, modern-design AI training tracking and visualization tool. Supports PyTorch, Transformers and more. Monitor and evaluate AI agent training processes.

pythonobservabilityevaluation +2

Hugging Face Evaluate

Description

Tags

Categories

Related Projects

Argilla

Weave

PrompToMatix

SwanLab