AgentList
HomeProjectsArticlesAbout
Explore Projects
HomeProjectsArticlesAbout
Explore Projects
Home / Projects / Weave

Weave

Active
GitHub Python Apache-2.0

Description

A toolkit by Weights & Biases for developing AI-powered applications, providing LLM call tracing, evaluation experiment management, and versioning from prototype to production.

Tags

observability evaluation llm python framework

Categories

📊 Observability
Visit GitHub Visit Website

Project Metrics

Stars 1.1k
Forks 151
Watchers 1.1k
Issues 368
Created June 7, 2023
Last commit April 18, 2026

Deployment

Cloud

Related Projects

Argilla

4.9k · Python
Active

Argilla is a collaboration platform for AI engineers and domain experts to build high-quality datasets, collect human feedback, and evaluate models.

evaluationdata-processingllm +2

OpenInference

924 · Python
Active

OpenTelemetry instrumentation for AI observability, providing standardized tracing, metrics collection, and span definitions for LLM inference processes to help developers monitor and debug AI agent systems.

observabilitypythonllm +2

Hugging Face Evaluate

2.4k · Python
Active

A library by Hugging Face for easily evaluating machine learning models and datasets, providing a wide range of metrics and evaluation methods.

evaluationllmpython +2

Purple Llama

4.1k · Python
Active

Meta's set of tools to assess and improve LLM security, including safety benchmarks, prompt injection detection, and output auditing to help evaluate and enhance the safety of large language models.

securityevaluationpython +2
AgentList

Curated directory of open-source AI agent projects

Quick Links

  • Project List
  • Featured Articles
  • Browse Categories

Contact

  • About
  • Privacy Policy
  • Contact Us

© 2026 AgentList. All rights reserved.

Made with for the open source community