AutoHarness

Normal

Description

Automated harness engineering for AI agents. Auto-generates test harnesses to evaluate agent safety and reliability across different scenarios.

Related Projects

Giskard

5.3k · Python

Active

An open-source evaluation and testing library for LLM agents providing automated model scanning, bias detection, performance benchmarking, and compliance checks.

evaluationtestingllm-safety +3

NeMo Guardrails

6.1k · Python

Active

NVIDIA NeMo Guardrails is an open-source toolkit for adding programmable guardrails to LLM-based conversational systems, supporting topic control, safety enforcement, and dialog guidance.

guardrailsllm-safetynvidia +2

Garak

7.8k · Python

Active

NVIDIA's open-source LLM vulnerability scanner that automatically detects security issues in language models including safety vulnerabilities, hallucination tendencies, jailbreak risks, and prompt injection attacks.

llm-securityvulnerability-scannerllm-evaluation +2

LLM Guard

2.9k · Python

Stale

The security toolkit for LLM interactions, providing prompt injection detection, PII anonymization, content safety auditing, and more to secure production LLM deployments.

securityllmpython +2

AutoHarness

Description

Tags

Categories

Related Projects

Giskard

NeMo Guardrails

Garak

LLM Guard