AI Red Teaming Playground Labs

Normal

GitHub TypeScript MIT

Description

Microsoft's open-source AI red teaming playground labs with infrastructure for running AI red teaming trainings and hands-on security exercises.

Related Projects

Vigil

478 · Python

Stale

Vigil is an LLM security detection tool that identifies prompt injections, jailbreaks, and other potentially risky LLM inputs through multi-dimensional analysis for real-time safety protection.

prompt-injectionsecurityllm-safety +2

EasyJailbreak

851 · Python

Normal

An easy-to-use Python framework for generating adversarial jailbreak prompts, helping researchers systematically evaluate LLM safety defenses with multiple attack method combinations.

jailbreakadversarialllm-safety +2

AgentDojo

560 · Python

Normal

A dynamic environment by ETH Zurich to evaluate attacks and defenses for LLM agents, providing standardized benchmarks for measuring agent system security.

security-benchmarkagent-evaluationattack-defense +2

Open-Prompt-Injection

439 · Python

Stale

An open-source benchmark for prompt injection attacks and defenses in LLMs, systematically evaluating the effectiveness of different attack strategies and defense mechanisms.

prompt-injectionbenchmarkllm-safety +2

AI Red Teaming Playground Labs

Description

Tags

Categories

Related Projects

Vigil

EasyJailbreak

AgentDojo

Open-Prompt-Injection