AgentList
HomeProjectsArticlesAbout
Explore Projects
HomeProjectsArticlesAbout
Explore Projects
Projects Promptfoo

Promptfoo

Active
GitHub TypeScript MIT

Description

Promptfoo is an evaluation and regression testing tool for LLM apps and agents, useful for comparing prompts, tool-call results, and model outputs over time.

Tags

evaluation testing prompts typescript

Categories

📊 Observability
Visit GitHub Visit Website View Docs

Project Metrics

Stars 21.8k
Forks 1.9k
Watchers 21.8k
Issues 311
Created April 28, 2023
Last commit June 2, 2026

Deployment

Local

Related Projects

Agenta

4.2k · TypeScript
Active

Agenta is an open-source LLMOps platform providing prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

observabilityllmopsprompt-management +2

Deep Research Bench

738 · Python
Active

Comprehensive benchmark for deep research agents, providing systematic evaluation framework for assessing deep research agent performance.

benchmarkevaluationdeep-research +2

Giskard

5.4k · Python
Active

An open-source evaluation and testing library for LLM agents providing automated model scanning, bias detection, performance benchmarking, and compliance checks.

evaluationtestingllm-safety +3

AgentLabs

550 · TypeScript
Stale

AgentLabs is a toolkit for agent development and testing, focused on experimentation, replay, and workflow support to improve iteration speed.

testingdeveloper-toolsevaluation +1
AgentList

The most comprehensive directory of open-source AI Agent projects. Discover and compare top Agent frameworks like LangChain, CrewAI, and more.

Quick Links

  • Project List
  • Featured Articles
  • Browse Categories

Contact

  • About
  • Privacy Policy
  • Contact Us

© 2026 AgentList. All rights reserved.

Made with for the open source community