AgentList
HomeProjectsArticlesAbout
Explore Projects
HomeProjectsArticlesAbout
Explore Projects
Home / Projects / Argilla

Argilla

Active
GitHub Python Apache-2.0

Description

Argilla is a collaboration platform for AI engineers and domain experts to build high-quality datasets, collect human feedback, and evaluate models.

Tags

evaluation data-processing llm python framework

Categories

📊 Observability
Visit GitHub Visit Website

Project Metrics

Stars 4.9k
Forks 479
Watchers 4.9k
Issues 26
Created April 28, 2021
Last commit April 13, 2026

Deployment

Docker

Related Projects

Hugging Face Evaluate

2.4k · Python
Active

A library by Hugging Face for easily evaluating machine learning models and datasets, providing a wide range of metrics and evaluation methods.

evaluationllmpython +2

Weave

1.1k · Python
Active

A toolkit by Weights & Biases for developing AI-powered applications, providing LLM call tracing, evaluation experiment management, and versioning from prototype to production.

observabilityevaluationllm +2

PrompToMatix

948 · Python
Stale

An automatic prompt optimization framework by Salesforce AI Research that leverages LLMs to search for and refine prompts for improved model performance.

prompt-engineeringevaluationllm +1

AgentBench

3.3k · Python
Normal

A comprehensive benchmark to evaluate LLMs as agents (ICLR 2024), covering operating systems, databases, knowledge graphs, digital card games and more.

evaluationpythonagent +1
AgentList

Curated directory of open-source AI agent projects

Quick Links

  • Project List
  • Featured Articles
  • Browse Categories

Contact

  • About
  • Privacy Policy
  • Contact Us

© 2026 AgentList. All rights reserved.

Made with for the open source community