相关项目
LLM-Jailbreaks
603 ·
A comprehensive collection of LLM jailbreak techniques and prompts for ChatGPT, Claude, Llama, and other models — essential reference for LLM security research.
llmsecurityprompt-engineering
LLM Guard
2.8k · Python
LLM 交互安全工具包,提供提示词注入检测、敏感信息脱敏、内容安全审计等防护能力,保障生产环境 LLM 调用的安全性。
securityllmpython +2
Rebuff
1.5k · TypeScript
针对 LLM 的提示词注入检测器,结合启发式规则、向量相似度和语言模型多重防御策略,有效识别和阻止恶意提示注入攻击。
securityllmtesting +2
AgentShield
510 · TypeScript
AI agent security scanner that detects vulnerabilities in agent configurations, MCP servers, and tool permissions. Available as CLI, GitHub Action, and GitHub App integration.
typescriptsecurityllm +2