UQLM

Active

GitHub Python Apache-2.0

Description

CVS Health's open-source uncertainty quantification library for language models, providing UQ-based hallucination detection with confidence scoring and mitigation tools to identify and reduce unreliable LLM outputs.

Related Projects

Guardrails AI

7.0k · Python

Active

Guardrails AI adds programmable guardrails to large language models, ensuring reliability and safety through input/output validation, structured data extraction, and custom validators.

guardrailsllm-safetyvalidation +2

OpenAI Evals

18.6k · Python

Normal

OpenAI's framework for evaluating LLMs and LLM systems, providing an open-source registry of benchmarks and tools for systematic model assessment.

llm-evaluationbenchmarkevals +2

Inspect AI

2.2k · Python

Active

A framework for large language model evaluations developed by the UK AI Safety Institute (AISI), providing comprehensive model capability assessment tools with support for safety and alignment testing.

llm-evaluationai-safetyevaluation-framework +2

NeMo Guardrails

6.3k · Python

Active

NVIDIA NeMo Guardrails is an open-source toolkit for adding programmable guardrails to LLM-based conversational systems, supporting topic control, safety enforcement, and dialog guidance.

guardrailsllm-safetynvidia +2

UQLM

Description

Tags

Categories

Related Projects

Guardrails AI

OpenAI Evals

Inspect AI

NeMo Guardrails