OpenClaw
OpenClaw is an open-source personal AI assistant platform supporting 25+ messaging channels (WhatsApp, Telegram, Slack, etc.) with multi-LLM integration and personal knowledge management.
OpenClaw is an open-source personal AI assistant platform supporting 25+ messaging channels (WhatsApp, Telegram, Slack, etc.) with multi-LLM integration and personal knowledge management.
An agentic skills framework and software development methodology that provides reusable skill modules and engineered workflows for AI coding agents.
The agent harness performance optimization system with skills, instincts, memory, security and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
n8n is a powerful workflow automation platform with native AI agent nodes, enabling multi-step agent workflow orchestration and hundreds of external service integrations.
AutoGPT is an autonomous AI agent that can complete user-defined tasks end-to-end. It plans and executes steps on its own and is considered a milestone in agent autonomy.
An autonomous AI agent framework from NousResearch that supports multiple LLM backends and grows with user needs.
OpenCode is an open-source terminal coding agent that supports multiple LLM providers, offering AI-powered code generation and editing in the terminal.
Langflow is a visual AI agent and workflow builder platform with drag-and-drop design, multi-LLM integration, and tool composition to simplify agent development.
Official Anthropic repository for Agent Skills, providing ready-to-use Claude agent skill examples and templates.
Dify is an open-source LLM application development platform with a visual agent orchestration interface, supporting workflows, knowledge bases, and multiple models.
Open WebUI is a feature-rich, user-friendly self-hosted AI platform supporting Ollama and OpenAI-compatible APIs, with RAG, agents, and MCP capabilities.
Full system prompts, internal tools and AI models from 40+ popular AI tools including Cursor, Devin, Windsurf, Manus, Lovable, and more.
LangChain is a framework for building applications powered by language models. It provides core capabilities such as chaining, memory management, and agent orchestration, making it a go-to choice for AI agent development.
Claude Code is an agentic coding tool by Anthropic that lives in your terminal, understands your codebase, and helps you code faster through natural language commands.
Firecrawl is a web scraping and search engine designed for AI agents, converting any webpage into structured Markdown data with search, scrape, and clean capabilities for building web-data-powered AI applications.
Firecrawl is the Web Data API for AI, turning web pages into clean, structured, LLM-friendly data with crawl, scrape, and search capabilities.
100+ AI Agent and RAG apps you can actually run — clone, customize, and ship. A great reference for quickly building LLM-powered applications.
Gemini CLI is a terminal-based AI agent tool from Google that supports code generation, file operations, and multi-turn conversations with a free usage tier.
browser-use enables browser automation for agents, allowing LLMs to understand pages and perform complex web interactions.
A cross-platform desktop All-in-One assistant tool for managing Claude Code, Codex, OpenCode, OpenClaw and Gemini CLI agents in one place.
A curated collection of hundreds of community-verified MCP server implementations spanning databases, search engines, dev tools, browser automation, and more, helping developers quickly discover and integrate MCP services for their use cases.
NextChat is a lightweight, cross-platform AI assistant client supporting GPT-4, Claude, Gemini and more, with Web, desktop, and mobile experiences.
Codex CLI is OpenAI's open-source coding-agent command-line tool for code understanding, refactoring, generation, and terminal collaboration in developer workflows.
Awesome DESIGN.md is a collection of DESIGN.md files inspired by popular brand design systems. Drop one into your project and let coding agents generate a matching UI, providing design system references for AI coding agents.
MCP Servers provides a large collection of reusable Model Context Protocol server implementations, giving agents standardized tool capabilities.
AI research automation agent by Andrej Karpathy that automatically runs nanochat training research experiments on a single GPU.
TradingAgents is a multi-agent trading framework built with LangGraph that mirrors real-world trading firm dynamics with specialized LLM-powered agents for fundamental analysis, sentiment analysis, risk management, and more.
A leading open-source RAG engine that fuses cutting-edge retrieval-augmented generation with agent capabilities to create a superior context layer for LLMs.
A high-throughput and memory-efficient inference and serving engine for LLMs, featuring PagedAttention, continuous batching, and optimized KV cache management for production deployments.
A Claude Code plugin that automatically captures coding session context, compresses it with AI, and injects relevant context back into future sessions for persistent memory.
Lobe Chat is an open-source ChatGPT-style chat application with a plugin system and multi-model support, suitable as an agent conversation interface.
The ultimate space for work and life to find, build and collaborate with agent teammates that grow with you, enabling multi-agent collaboration and team design.
Generate short videos with one click using AI LLM. Multi-step automation workflow from script generation to video composition.
Run Local LLMs on Any Device. Open-source and available for commercial use. Provides fully offline local inference and chat for AI agents.
OpenHands is an open-source AI software engineering agent platform that can automatically execute development tasks, modify code, and support collaborative iteration.
Comprehensive guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
Turn screenshots, mockups, and Figma designs into clean code using AI models. Supports HTML/Tailwind, React, Vue, and other frontend frameworks.
Daytona provides secure development-environment infrastructure for coding agents and automation workflows, serving as a runtime base for remote execution tasks.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs. Supports LoRA, QLoRA, RLHF and more for building custom agent models.
Multi-functional interface for GPT/GLM LLMs with optimized academic paper reading, polishing and writing. Supports multi-model parallel, plugin extensions and local deployment.
An open-source long-horizon SuperAgent harness by ByteDance that researches, codes, and creates with sandboxes, memories, tools, skills, subagents and message gateway for complex tasks.
The Multi-Agent Framework for building the first AI Software Company, enabling natural language programming with multi-role collaboration for automated requirement analysis, design, coding, and testing.
A financial data platform for analysts, quants and AI agents, providing comprehensive financial data access across stocks, crypto, economics and more.
12 Lessons to Get Started Building AI Agents by Microsoft. Hands-on curriculum covering core agent concepts, tool use, and multi-agent collaboration.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for Agentic workflows, supporting layout analysis, formula recognition, and table extraction.
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, and gpt-oss locally, providing model fine-tuning and deployment capabilities for agent developers.
A nano claude code-like agent harness built from scratch, demonstrating how to build AI coding assistants from zero to one.
A light-weight and powerful meta-prompting, context engineering, and spec-driven development system for AI coding agents like Claude Code.
Open Interpreter is a natural language interface for computers that lets LLMs run code locally to perform file operations, data analysis, and system management tasks.
Pathway is a Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG applications.
A curated list of awesome Claude Skills, resources and tools for customizing Claude AI workflows.
Cline is an autonomous coding agent in your IDE that can create/edit files, execute commands, use the browser, and more with your permission every step of the way.
AnythingLLM is an all-in-one AI productivity app with a self-hosted chat UI, RAG knowledge base, AI agents, and multi-model management, privacy-first with zero configuration.
The agentic development environment built for coding with multiple AI agents, providing a next-generation terminal experience.
Docling is an open-source document processing tool by IBM that converts PDF, Word, PPT, HTML and more into structured data for AI, purpose-built for GenAI and RAG pipelines.
An open-source agent harness platform providing the best agent toolkit, supporting multiple AI coding agents.
Ready-to-run cloud templates for RAG, AI pipelines and enterprise search with live data, always in sync with Sharepoint, Google Drive, S3, Kafka and more.
An adaptive web scraping framework that intelligently handles anti-bot measures, from single requests to full-scale crawls, designed for AI agent data collection.
Pi Mono is a comprehensive AI agent toolkit including a coding agent CLI, unified LLM API, TUI and web UI libraries, Slack bot, and vLLM pod management for end-to-end agent development.
AI-driven public opinion and trend monitor with multi-platform aggregation, RSS subscriptions, smart keyword filtering, AI-powered news analysis and briefings, supporting MCP integration and push notifications via WeChat, Feishu, DingTalk, Telegram and more.
Microsoft AutoGen is a multi-agent conversation framework that lets you create multiple agents to collaborate through dialogue and solve complex tasks.
AI coding assistant skill that turns any folder of code, docs, papers, images, or videos into a queryable knowledge graph. Works with Claude Code, Codex, Cursor, Gemini CLI, GitHub Copilot CLI, and more.
The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade architecture, distributed swarm intelligence, RAG integration, and native Claude Code/Codex integration.
Embedchain is a universal memory layer for AI agents, enabling quick integration of diverse data sources into LLMs for context-aware AI applications.
Mem0 is a long-term memory layer for AI agents, supporting cross-session memory management and personalized context retrieval.
Context7 is Upstash's context-engineering toolkit for agents, helping applications manage long context windows, retrieval injection, and history compression.
From vibe coding to agentic engineering — a practice guide helping developers master Claude Code best practices and advanced techniques.
A comprehensive tutorial on AI agent principles and practice, systematically covering core concepts, framework usage and hands-on projects.
GPT Engineer is an AI tool that generates entire codebases based on natural language descriptions. Just describe what you want to build, the AI asks for clarification, and then builds it.
Platform to experiment with AI Software Engineer — specify software in natural language, watch AI write and execute code, then iterate improvements
MemPalace is an open-source AI memory system providing a persistent long-term memory layer for AI agents, with ChromaDB vector storage and MCP protocol integration.
Flowise is a low-code builder for LLM apps that lets you create agent workflows and RAG applications with drag-and-drop interfaces.
A multi-agent collaboration framework where AI agents form crews to accomplish complex tasks together. Role definition, task assignment, tool sharing, and process orchestration.
A multi-agent collaboration framework where AI agents form crews to accomplish complex tasks together. Role definition, task assignment, tool sharing, and process orchestration.
OpenSpec is a spec-driven development (SDD) platform that guides AI coding assistants to generate code through specification definitions, improving development efficiency and code quality.
LlamaIndex is a data framework for building LLM applications. It provides data connectors, indexing, query engines, and agent workflow orchestration — a core tool in the RAG ecosystem.
LlamaIndex is a data framework that provides the data connection layer for LLM applications, with strong RAG capabilities across diverse data sources and vector databases.
Build agents that monitor and act on your behalf. Create automated agents for Twitter, weather monitoring, web scraping, and many other scenarios.
LiteLLM provides a unified interface and proxy gateway for LLM calls, simplifying multi-model switching, routing, and cost control.
AI-powered job search system built on Claude Code with 14 skill modes, Go dashboard, PDF generation and batch processing.
Agent Skills is a curated collection of production-grade engineering skills for AI coding agents, maintained by Addy Osmani, providing battle-tested best practices and operational conventions.
Open-source frontier voice AI from Microsoft, providing high-quality speech synthesis and recognition for building real-time conversational voice agent applications.
The original local LLM interface supporting text generation, vision, tool-calling, and training with both a web UI and API. Runs 100% offline and private.
Cherry Studio is an AI productivity studio with smart chat, autonomous agents, and 300+ assistants, providing unified access to frontier LLMs.
Open-source AI engine to run any model — LLMs, vision, voice, image, video — on any hardware without GPU. Provides OpenAI-compatible API for fully local, privacy-first AI inference.
An AI-driven low-code platform with zero-code and code-generation modes, featuring built-in AI chat, knowledge base, workflow orchestration and MCP plugin system.
Open-source extensible AI coding agent that goes beyond code suggestions — install, execute, edit, and test with any LLM.
AI pair programming in your terminal. Collaborate with LLMs to edit code, manage Git, and refactor across multiple files with deep developer workflow integration.
CowAgent (formerly chatgpt-on-wechat) is a powerful AI assistant framework built on LLMs with autonomous planning, tool use, long-term memory, multi-agent collaboration, and multi-channel integration for WeChat, Feishu, DingTalk, and more.
Claude Cookbooks is Anthropic's official collection of notebooks and recipes showcasing fun and effective ways of using Claude, covering tool use, RAG, multimodal, and various agent application scenarios.
Milvus is a high-performance open-source vector database built for AI applications. It supports storage, indexing, and similarity search of large-scale vector data, ideal for RAG, recommendation systems, and more.
nanobot is an ultra-lightweight personal AI agent that supports multiple LLM backends for quickly deploying a private intelligent assistant.
The cloud-native API and AI Gateway providing LLM request routing, rate limiting, load balancing and observability for AI agent applications.
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer, supporting multiple local LLMs with a full desktop chat UI and API server.
MCP server providing Chrome DevTools capabilities to coding agents, enabling web debugging, performance analysis, and DOM manipulation automation.
Fabric is an open-source framework for augmenting humans using AI, providing a modular system of crowdsourced AI prompts for solving specific problems anywhere.
CLI-Anything aims to make all software agent-native by transforming applications into unified CLI interfaces, enabling AI agents to naturally interact with and operate any software through a centralized CLI Hub.
An accessible multi-agent sentiment analysis assistant that breaks filter bubbles, reveals true public opinion, and predicts trends — built from scratch without external frameworks.
The Zero-Server Code Intelligence Engine — a client-side knowledge graph creator running entirely in your browser with a built-in Graph RAG Agent for code exploration.
Agno is a high-performance agent framework for building multimodal AI agents with memory, knowledge, and tool-use capabilities, supporting multiple LLM providers.
Phidata is a framework for building AI agents with memory, knowledge, and tool integration to make agents more capable and useful.
Chatbox is a powerful cross-platform AI client supporting OpenAI, Claude, Gemini, and other LLMs with desktop and mobile apps.
LLM-powered stock analysis system for A/H/US markets with multi-source quotes, real-time news, LLM decision dashboard and multi-channel push notifications.
An installable library of 1,400+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI and more, with installer CLI, bundles and workflows.
A generative speech model for daily dialogue, providing AI agents with natural and fluent voice synthesis with fine-grained prosody control.
MindsDB is a query engine for AI analytics that enables building self-reasoning agents across live data, connecting diverse data sources with AI models.
Opinionated RAG framework for integrating GenAI into your apps. Works with any LLM, any vectorstore, any files — so you can focus on your product instead of building RAG pipelines.
A local knowledge base RAG and Agent application platform built on Langchain with support for ChatGLM, Qwen, Llama and other LLMs, offering conversation, knowledge base management, and agent capabilities.
Open-source low-code platform for building internal tools, dashboards, business applications, workflows and AI agents with visual drag-and-drop development.
LibreChat is an enhanced open-source ChatGPT clone featuring Agents, MCP tools, multi-model support, code interpreter, AI search, and more.
A Python library by Google for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization, designed for data annotation and knowledge extraction workflows.
New API is a unified AI model hub for aggregation and distribution, supporting cross-conversion of various LLMs into OpenAI, Claude, or Gemini-compatible formats. A centralized gateway for personal and enterprise model management.
Intelligent automation and multi-agent orchestration for Claude Code. Supports automated workflows, task coordination, and intelligent agent system building.
AgentGPT is a platform for assembling, configuring, and deploying autonomous AI Agents in your browser, allowing users to create goal-driven agents that execute tasks autonomously.
LightRAG is a simple and fast Retrieval-Augmented Generation framework using graph-enhanced retrieval, published at EMNLP 2025.
ByteDance's open-source multimodal AI agent stack connecting cutting-edge AI models with agent infrastructure for GUI automation and computer control.
Wrap Gemini CLI, Antigravity, ChatGPT Codex, Claude Code as an OpenAI/Gemini/Claude/Codex compatible API service for unified agent access.
A modern open-source VPS control panel with native AI agent support, enabling Ollama model deployment, AI agent management and full server stack control.
Teams-first multi-agent orchestration for Claude Code. Designed for team collaboration with support for multi-agent coordination, task distribution, and result integration to enhance team AI development efficiency.
In-depth tutorials on LLMs, RAGs and real-world AI agent applications. Rich notebook examples for learning AI engineering practices.
An AI-powered answering engine with multi-model integration, web search and local knowledge base, providing a Perplexity-like search experience.
An open-source browser automation CLI for AI agents by Vercel, built with Rust for high performance and programmability.
Multica is the open-source managed agents platform that turns coding agents into real teammates with task assignment, progress tracking, and compound skill accumulation.
Khoj is a self-hostable AI second brain that answers questions from the web or your docs, builds custom agents, schedules automations, and performs deep research.
DSPy is a declarative LLM programming framework focused on optimizable prompts and program structure, suitable for complex agent workflows.
One API is an LLM API management and redistribution system that unifies OpenAI, Azure, Anthropic Claude, Google Gemini, DeepSeek, and more under a single API. Supports key management, redistribution, and one-click Docker deployment.
AI-powered PDF scientific paper translation with preserved formats, supporting Google/DeepL/Ollama/OpenAI services via CLI/GUI/MCP/Docker/Zotero.
GitHub's official community-contributed collection of instructions, agents, skills, and configurations to help you make the most of GitHub Copilot.
Agent skills for Obsidian. Teach your agent to use Markdown, Bases, JSON Canvas, and use the CLI.
LangGraph is an agent workflow orchestration framework from the LangChain team, using graph structures to model agent state and transitions.
An AI Agent assistant that integrates multiple IM platforms, LLMs, plugins and AI features, supporting QQ, Telegram, Discord and more.
Tabby is a self-hosted AI coding assistant supporting code completion, code generation, and enterprise-grade deployment, compatible with major IDEs.
Continue is an open-source AI code assistant extension for VS Code and JetBrains IDE. It can autocomplete code, refactor, and explain code, helping developers improve programming efficiency.
A modular graph-based Retrieval-Augmented Generation system by Microsoft that uses LLMs to extract structured knowledge graphs from text, enabling global and local community summarization queries.
Playwright MCP is a Microsoft MCP server exposing Playwright browser automation capabilities to AI agents, supporting web interaction, screenshots, and structured data extraction.
ChatDev 2.0 enables full-lifecycle software development through LLM-powered multi-agent collaboration, simulating role-based teamwork in a virtual software company.
Chatbot UI is an open-source AI chat interface supporting OpenAI, Claude, Gemini and more, with a modern conversation UI and flexible deployment options.
The Frontend Stack for Agents & Generative UI. React + Angular component library, makers of the AG-UI Protocol, enabling AI agents to work directly in application UIs.
CopilotKit is an open-source framework for building AI agent frontends, supporting Generative UI and the AG-UI Protocol to help developers quickly integrate agent capabilities into apps.
An event-driven agentic orchestration platform providing a durable and highly resilient execution engine for applications and AI agents.
Qdrant is a high-performance vector database widely used as the retrieval layer for RAG and agent memory search scenarios.
Fast, small, and fully autonomous AI personal assistant infrastructure built with Rust. Deploy anywhere, swap anything, on any OS and platform.
Marketing skills for Claude Code and AI agents, covering CRO, copywriting, SEO, analytics and growth engineering.
A curated collection of 500 AI agent use cases across industries including healthcare, finance, education, and retail. Showcases practical applications with open-source project links.
A lightweight browser runtime designed for automation and scraping scenarios, offering lower overhead than traditional browsers for headless tasks.
GitHub's official MCP Server providing standardized access to GitHub APIs for AI agents, supporting repository management, issue handling, and PR operations.
An AI prompt optimizer that helps users write better prompts and achieve improved AI results.
Open source AI platform with enterprise-grade AI chat, advanced RAG and AI search capabilities that works with every LLM.
An AI agent that automates the job application process, analyzing job requirements and tailoring applications for personalized mass submission.
A lightweight AI assistant platform running securely in containers. Connects to WhatsApp, Telegram, Slack, Discord, Gmail with memory, scheduled jobs, and built on Anthropic's Agents SDK.
Hugging Face's official Agents Course covering fundamentals, frameworks (LangChain, LlamaIndex), and hands-on projects from beginner to advanced level.
Void is an open-source AI code editor built on VS Code architecture, supporting Claude, GPT, and other models, delivering a Cursor-style intelligent coding experience.
The Monorepo Platform that amplifies both developers and AI agents, optimizing builds, scaling CI and automatically fixing failed PRs.
LLM Frontend for Power Users with multi-model support, rich role-playing features, extensible plugin system, and local deployment.
Sim is a platform to build, deploy, and orchestrate AI agents with a visual low-code workflow editor, supporting OpenAI, Anthropic, DeepSeek and more for enterprise agent orchestration.
Composio is a tools and SaaS integration layer for agents, helping applications connect quickly to services like Gmail, Slack, and GitHub for multi-tool workflows.
Open-source LLM engineering platform providing tracing, evaluations, prompt management, and dataset management with integrations for LangChain, OpenAI, Anthropic, and more.
Langfuse is an open-source observability platform for LLM applications, supporting tracing, evaluation, prompt versioning, and cost analytics.
Open-source coding agent CLI supporting OpenAI, Gemini, DeepSeek, Ollama, Codex, GitHub Models, and 200+ models via OpenAI-compatible APIs.
FastGPT is a knowledge-based platform built on LLMs, offering out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration for easily developing and deploying complex question-answering systems.
Chroma is an open-source AI-native embedding database designed for building LLM applications. It provides simple APIs to store embeddings and perform similarity search, making it ideal for RAG applications.
A curated list of AI autonomous agents. A comprehensive collection of open-source agent projects for discovering and understanding the agent ecosystem.
Microsoft Semantic Kernel is a lightweight SDK for combining large language models with conventional programming languages to build AI agent applications.
An open-source low-code platform for building AI agents, automations and business applications, model agnostic with drag-and-drop visual development.
A multi-agent LLM-powered Chinese financial trading framework, enhanced Chinese version of TradingAgents with multi-source market data, real-time news, and LLM decision-making.
A comprehensive showcase of advanced Retrieval-Augmented Generation (RAG) techniques with detailed notebook tutorials and code examples, covering foundational to cutting-edge RAG implementations.
smolagents is a lightweight agent framework from Hugging Face for quickly building tool-using LLM agents.
Vercel's official collection of agent skills, providing practical skill modules and tools for AI coding agents.
GPT Researcher is an autonomous research agent that can gather, organize, and analyze information to produce detailed research reports.
A free, local, open-source 24/7 cowork app supporting multiple coding agents like Gemini CLI, Claude Code, and Codex with unified management and collaboration features.
Hundreds of models and providers. One command to find what runs on your hardware. Provides local LLM runtime for AI agents.
A set of ready-to-use Agent Skills for research, science, engineering, analysis, finance and writing across multiple coding agents.
An AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket and the web, then synthesizes a grounded summary.
An MCP-based open-source chatbot for ESP32 embedded devices, supporting voice interaction, multi-model integration, and IoT control for building smart hardware agents.
Graphiti is a temporal knowledge-graph engine for agent memory, helping systems continuously accumulate long-term context.
A lightweight, powerful framework from OpenAI for building multi-agent workflows with tool calling, agent handoffs, and guardrails.
An autonomous agent for deep financial research. Automatically analyzes financial reports, market data, company filings, generates investment recommendations and risk assessment reports.
A unified CLI for Google Workspace covering Drive, Gmail, Calendar, Sheets, Docs and more, with built-in AI agent skills for automation.
Get 10X more out of Claude Code, Codex or any coding agent. Manage agent tasks through kanban boards, track progress, and optimize workflows.
Python scraper based on AI that uses LLMs and knowledge graphs to automatically build web data extraction pipelines.
Fully local Manus AI alternative that autonomously browses the web, writes code, and interacts via voice, with no API costs
MLflow is the open-source AI engineering platform for debugging, evaluating, monitoring, and optimizing AI agents and LLM applications, with model and data access management.
Open-source multi-agent framework from Alibaba, enabling the construction of observable and interpretable agents with rich distributed capabilities.
Repomix packs your entire repository into a single AI-friendly file, perfect for feeding your codebase to LLMs like Claude, ChatGPT, and DeepSeek for analysis, review, or code generation.
FastMCP is a fast, Pythonic library for building MCP servers and clients with over 1 million daily downloads, making it easy to create Model Context Protocol tools.
Haystack is an enterprise-grade framework for RAG and search applications, covering document processing, retrieval, generation, and evaluation end to end.
Kotaemon is an open-source RAG-based tool for chatting with your documents, featuring a clean chat interface and support for multiple LLM and embedding model backends.
Open-AutoGLM is an open phone agent model and framework enabling AI to autonomously operate smartphone interfaces, unlocking the AI Phone experience for everyone.
OpenViking is an open-source context database from Volcengine that unifies management of agent memory, resources, and skills through a filesystem paradigm, enabling hierarchical context delivery and self-evolution.
An open-source tool by OpenAI that turns project work into isolated, autonomous implementation runs, allowing teams to manage work instead of supervising coding agents.
Open-source agentic coding CLI by the Charm team, supporting multiple LLM backends for autonomous coding in the terminal
An open-source AI coding agent that lives in your terminal, built by Qwen team with support for code generation, editing, debugging and multi-file operations.
Powerful MCP toolkit for coding that provides semantic retrieval and editing capabilities, serving as an IDE for AI agents
An extremely fast and scalable memory engine for the AI era. Provides a unified Memory API for AI applications with large-scale knowledge storage and efficient retrieval.
Mastra is a TypeScript-first agent platform that combines workflows, memory, RAG, evaluation, and deployment for scalable full-stack AI agent applications.
The AI Toolkit for TypeScript by the Next.js creators. Build AI-powered applications and agents with streaming, tool use, multimodal, and agent orchestration.
DeepTutor: Agent-Native Personalized Learning Assistant.
A Claude Code plugin that shows what's happening — context usage, active tools, running agents, and todo progress for enhanced agent workflow visibility.
A memory upgrade for coding agents. Provides persistent contextual memory for Claude Code, Codex, and other coding agents to improve long-task consistency.
Roo Code is an autonomous coding agent extension for VS Code and JetBrains that can create/edit files and run terminal commands directly in your editor.
A2A (Agent-to-Agent) Protocol is an open protocol by Google enabling interoperability and collaborative communication between AI agents built across different frameworks and vendors.
AI-powered PPT generation tool that creates natively editable PPTX from any document, producing real PowerPoint shapes instead of images.
Agent harness built with LangChain and LangGraph. Equipped with a planning tool, filesystem backend, and ability to spawn subagents for complex agentic tasks.
A web scraping and browser automation library for Node.js to build reliable crawlers, supporting Puppeteer, Playwright, Cheerio, and raw HTTP. Extract data for AI, LLMs, RAG, or GPTs with proxy rotation and both headful and headless modes.
Write HTML. Render video. Built for agents. An open-source tool from HeyGen that turns HTML templates into video content.
Chat with your SQL database using natural language. Accurate Text-to-SQL Generation via LLMs using Agentic RAG.
A universal CLI Hub and AI-native runtime that transforms any website, Electron app, or local binary into a standardized command-line interface built for AI agents.
MCP Python SDK is the official Python implementation for building MCP servers and agent-side integrations with a standardized tool protocol.
Letta (formerly MemGPT) is an open-source framework for building stateful AI agents with advanced reasoning and transparent long-term memory. It allows you to visually test, debug, and observe agents.
Crawl4AI is a web crawling toolkit for LLM and agent systems, offering structured extraction, site traversal, cleanup, and crawl controls for external knowledge acquisition.
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?
The SDK for browser agents by Browserbase. Provides act, extract, and observe primitives for AI agents to naturally browse and interact with web pages.
Claude Code skill implementing Manus-style persistent markdown planning — the structured workflow pattern for agent task management.
Activepieces is an open-source AI workflow automation platform with 400+ MCP servers for AI agents, enabling no-code business process orchestration.
A workflow orchestration framework for building resilient data pipelines and AI workflows in Python, with task scheduling, state management, and failure recovery from local to distributed deployments.
GenAI Agents is a comprehensive collection of 50+ tutorials and implementations for Generative AI Agent techniques, from basic conversational bots to complex multi-agent systems.
MCP server for Blender that enables AI agents to directly control the 3D modeling software for natural language-driven scene creation, model manipulation, and rendering automation.
A powerful GUI app and Toolkit for Claude Code — create custom agents, manage interactive Claude Code sessions, run secure background agents, and more.
A simple, open format for guiding coding agents. Define agent behavior, rules, and skills through structured AGENTS.md files to help AI coding assistants better understand project requirements.
Jina AI Serve is a cloud-native framework for building multimodal AI applications, supporting RAG pipelines, agent systems, and multimodal search.
Test and evaluate LLM prompts, agents, and RAG pipelines. Built-in red teaming and security evaluation for reliable AI applications.
Promptfoo is an evaluation and regression testing tool for LLM apps and agents, useful for comparing prompts, tool-call results, and model outputs over time.
Skyvern is an agent platform for browser task automation, using page understanding and action planning to complete complex web workflows such as forms and back-office tasks.
Open-source vector similarity search extension for PostgreSQL, enabling native vector storage and ANN retrieval in relational databases, a foundational component for building agent memory and RAG systems.
OpenAI Swarm is a lightweight multi-agent collaboration framework focused on simplicity and controllability, ideal for learning and prototyping.
Guidance is a programming framework for controlling LLM output, supporting structured generation, constrained decoding, and templated prompts to ensure outputs conform to predefined formats.
An MCP for Claude Desktop, Claude Code, Windsurf, and Cursor to build n8n workflows through natural language interaction.
MaxKB is an open-source knowledge base Q&A and agent building platform powered by LLMs, with vector retrieval, workflow orchestration, and multi-model support out of the box.
Open-source AI agent development platform from Coze, providing visual tools to simplify agent creation, debugging, and deployment with one-click publishing to multiple channels.
All-in-one RAG framework supporting text, images, tables, equations and more document formats for retrieval-augmented generation with unified knowledge QA.
Give your AI agent eyes to see the entire internet. Read and search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu with one CLI and zero API fees.
Ghostty-based macOS terminal with vertical tabs and notifications for AI coding agents. Manage multiple coding agent sessions efficiently.
A persistent memory system for AI coding agents, designed around real-world benchmarks to preserve context across sessions.
Turn Claude Code into a full game dev studio with 49 AI agents, 72 workflow skills, and a complete coordination system mirroring real studio hierarchy.
Garry Tan's opinionated OpenClaw/Hermes Agent Brain with optimized system prompts and agent workflows.
End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment.
Dyad is a local AI app builder that lets users create and iterate on web applications through natural language conversations, supporting multiple LLM providers.
Vercel's official AI SDK-based chatbot template featuring streaming output, model invocation, message persistence, and a modern UI.
Google Agent Development Kit (ADK) is Google's agent development framework for building complex AI agent systems with tool integration and multimodal processing capabilities.
An autonomous AI agent loop that runs repeatedly until all PRD items are complete, automating the entire development cycle.
OWL (Optimized Workforce Learning) is a multi-agent collaboration framework for real-world task automation, decomposing and executing complex tasks through agent interaction.
An autonomous company operating system powered by AI agents, providing intelligent workflow automation for research, data analysis, customer communication and other business processes.
KiloCode is an all-in-one open-source coding agent platform for VS Code and JetBrains, integrating 200+ models with autonomous coding, debugging, and iteration capabilities.
Open-source agentic software engineer and Devin alternative with planning, reasoning, web browsing, and multi-model support
Opik is an open-source LLM observability platform providing agent tracing, evaluation testing, and prompt experiment management to help developers monitor and optimize AI agent systems.
bolt.diy is an open-source platform to prompt, run, edit, and deploy full-stack web applications using any LLM you want, providing a visual development environment for AI-powered app creation.
SWE-agent takes a GitHub issue and automatically generates fixes using your LLM of choice, also applicable to cybersecurity auditing and competitive coding. NeurIPS 2024 paper.
Open-source deep research agent from Alibaba Tongyi Lab, using multi-stage iterative information retrieval and reasoning to conduct deep analysis, synthesis, and summarization of complex topics with web search and document analysis.
AI-powered research assistant that performs iterative deep research on any topic by combining search engines, web scraping, and LLMs
Open source real-time audio/video infrastructure for AI agents. WebRTC transport, agent framework, SIP telephony, and real-time transcription.
Open source real-time audio/video infrastructure for AI agents. WebRTC transport, agent framework, SIP telephony, and real-time transcription.
DB-GPT is an open-source agentic AI data assistant framework integrating multi-agent collaboration, RAG, and AWEL workflow engine, purpose-built for AI+Data applications.
OpenAI's framework for evaluating LLMs and LLM systems, providing an open-source registry of benchmarks and tools for systematic model assessment.
An autonomous agent framework for everyone, built in TypeScript with multi-platform deployment support and a rich plugin ecosystem for conversational AI agents and social bots.
Open Multi-Agent Interactive Classroom — Get an immersive, multi-agent learning experience in just one click. Features multi-role AI teachers, intelligent Q&A, and personalized learning paths to redefine online education.
Page Agent is a JavaScript in-page GUI agent by Alibaba that controls web interfaces with natural language, enabling automated form filling, page navigation, and element interaction.
Official Google Gemini fullstack quickstart using LangGraph. Complete React + Python implementation for building production AI agent applications.
Turn any API into a paid MCP service instantly. Helps developers wrap existing APIs into MCP-compatible agent tools for AI agent capability extension.
High-performance in-browser LLM inference engine — run large language models directly in the browser using WebGPU, no server-side computation needed.
The interaction control harness for customer-facing AI agents, optimized for building controlled, consistent and predictable customer interactions with LLMs.
Use your Neovim like using Cursor AI IDE. AI-powered code generation, editing, and chat deeply integrated into the Neovim ecosystem.
A private AI platform for agents, assistants, and enterprise search with built-in agent builder, deep research, document analysis, and multi-model support.
Agent Zero is an open-source Python framework for building autonomous general-purpose AI agents with tooling and OS-level integration.
Agent Zero is a general-purpose AI agent framework supporting autonomous task planning, tool use, and code execution for building self-directed AI assistants.
PUA is a highly proactive AI agent skill that motivates agents to continuously improve and deliver high-quality results within 30 days, using a performance-driven persona approach.
Open-source Agent Operating System.
A knowledge engine for AI agent memory that builds knowledge graphs and memory layers in 6 lines of code, supporting graph databases, vector stores, and more for knowledge extraction and retrieval.
SuperAGI is a dev-first open-source autonomous AI agent framework for building, managing, and running useful autonomous agents quickly and reliably.
CUA provides open-source infrastructure for Computer-Use Agents, including sandboxes, SDKs, and benchmarks to train and evaluate AI agents that control full desktops (macOS, Linux, Windows).
PydanticAI builds agents on top of type systems, emphasizing verifiable data structures, tool calling, and production-grade reliability.
PydanticAI — an AI agent framework built with the Pydantic way. Provides type-safe structured output, dependency injection, and multi-model support for building reliable AI agents.
Fully autonomous AI Agents system capable of performing complex penetration testing tasks using multi-agent architecture with support for multiple LLM providers.
Google Gemini official Cookbook with examples and tutorials for building agents, function calling, and multimodal applications.
Agent Lightning is Microsoft's open-source training framework for AI agents, using reinforcement learning to enhance agent capabilities.
A flexible framework for experiencing heterogeneous LLM inference and fine-tuning optimizations — run large language models efficiently on consumer hardware with kernel-level optimizations.
CAMEL is an open-source framework for multi-agent collaboration, supporting role-play, task decomposition, and coordinated execution.
232+ skills & agent plugins for Claude Code, Codex, Gemini CLI, Cursor, and 8 more coding agents — covering engineering, marketing, product, compliance, and C-level advisory.
Agent framework built on Qwen LLM, featuring function calling, MCP tool integration, code interpreter, RAG, and browser extension support.
ChatALL lets you concurrently chat with ChatGPT, Bing, Bard, Claude, ChatGLM, and many more LLMs to discover the best answers through side-by-side comparison.
Context Mode is a context window optimization tool for AI coding agents that sandboxes tool output for 98% context reduction across 12 major platforms.
A comprehensive collection of Agent Skills for context engineering, multi-agent architectures, and production agent systems. Use when building, optimizing, or debugging agent systems.
Weaviate is an open-source vector database that stores objects and vectors, allowing for combining vector search with structured filtering. It has built-in vectorization modules and supports multimodal data search.
Production-grade platform for building agentic IM bots supporting Discord, Slack, LINE, Telegram, WeChat, Feishu, DingTalk, QQ, and more with Agent orchestration, knowledge base, and plugin system.
RagaAI Catalyst is an observability, monitoring, and evaluation framework for Agent AI, supporting agent/LLM/tool tracing, multi-agent debugging, and self-hosted dashboard analytics.
Clone any website with one command using AI coding agents. Built with Next.js, React, and shadcn-ui with web scraping and automated code generation.
A web interface for running AI agents in the browser, providing a visual experience for browser automation operations.
A multi-agent orchestration system inspired by ancient governance structures, featuring 9 specialized AI agents with a real-time dashboard, model configuration, and full audit trails for complex multi-agent collaboration scenarios.
Tencent's open-source LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG.
DeepEval is an open-source evaluation framework for LLM applications. It provides rich evaluation metrics and tools, supporting unit testing and integration testing to help developers build reliable LLM applications.
DeepCode is an open agentic coding platform supporting Paper2Code, Text2Web, and Text2Backend, leveraging agent technology for automated software development workflows.
Unofficial Python API and agentic skill for Google NotebookLM. Full programmatic access to NotebookLM features including capabilities the web UI doesn't expose, via Python, CLI, and AI agents like Claude Code, Codex, and OpenClaw.
Multi-agent workspace manager that supports agent team collaboration, task scheduling, and resource allocation. Provides a unified workspace view for efficient multi-agent coordination.
An orchestration platform for developing, producing, and observing data assets and AI workflows, with built-in asset definitions, scheduling, and monitoring.
MemVid is a long-term memory layer for AI agents that uses video encoding for lightweight single-file storage, replacing complex RAG pipelines with instant retrieval.
Hindsight is an agent memory system that learns autonomously, supporting memory retention, recall, and reflection to give AI agents persistent experiential memory.
MCP Toolbox is an open-source MCP server for databases by Google, enabling agent access to PostgreSQL, MySQL, BigQuery, Spanner, and more.
Open source AI coding agent designed for large projects and real world tasks, providing terminal-based code generation with multi-step planning and file management.
Open-source text-to-SQL and text-to-chart GenBI agent with a semantic layer. Ask your database questions in natural language and get accurate SQL, charts, and BI insights. Supports 12+ data sources and any LLM.
ChuanhuChatGPT is a lightweight GUI for ChatGPT API and many LLMs, supporting agents, file-based QA, web search, and GPT finetuning with a neat UI.
Trigger.dev is an open-source platform for background jobs and workflow automation, well suited for long-running asynchronous agent execution in production.
Agent-native memory infrastructure that turns agent execution and conversation into structured, persistent state with an LLM-agnostic memory layer, MCP integration, and Python/TypeScript dual SDK support.
OpenAI Agents SDK is OpenAI's official agent development toolkit, supporting the building of multi-step workflow AI agents with core features like tool calling and state management.
MCP server that provides Figma layout information to AI coding agents like Cursor, enabling precise design-to-code conversion.
An open-source AI coworker with persistent memory, supporting multi-turn conversations and context retention for knowledge management and collaborative task completion.
llmware is a unified enterprise RAG framework for deploying small specialized models, featuring knowledge graphs, document parsing, vector indexing, and agent toolchains for building private, compliant AI applications.
Unstructured provides document parsing and cleaning capabilities, commonly used in RAG ingestion and preprocessing pipelines.
Botpress is an open-source conversational AI platform with a visual flow editor, knowledge base integration, multi-channel deployment, and GPT/LLM agent building capabilities for enterprise chatbot development.
Agentic AI Infrastructure for magnifying HUMAN capabilities.
GuiZang PPT Skill is an AI agent skill for generating polished HTML slide decks, featuring editorial magazine and Swiss layouts, image prompts, social covers, and a WebGL/low-power presentation runtime.
A CLI tool for code structural search, lint, and rewriting based on AST. Written in Rust, supports 20+ languages, providing precise code pattern matching for AI coding agents.
Browser Harness | Self-healing harness that enables LLMs to complete any task.
Ragas is a framework for evaluating RAG (Retrieval Augmented Generation) systems. It provides various evaluation metrics including faithfulness, answer relevance, context precision, helping developers optimize RAG application performance.
QAnything is an open-source local knowledge base Q&A system by NetEase Youdao, supporting any file format with offline RAG capabilities for building private knowledge Q&A.
AG-UI is the open-source implementation of the Agent-User Interaction Protocol, defining a standardized interaction protocol between AI agents and frontend applications, initiated by the CopilotKit team.
Structured text generation library using regex, JSON Schema, and context-free grammars to constrain LLM outputs and ensure generated content matches specified formats.
A memory system for 24/7 proactive agents with MCP protocol integration, providing long-term memory management, skill storage, and proactive reasoning capabilities for continuously running AI agents.
agency-agents-zh is a community-maintained knowledge base and tutorial hub for AI agent development, RAG, and LLM engineering.
754 structured cybersecurity skills for AI agents mapped to 5 frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND and NIST AI RMF. Works with Claude Code, Codex CLI, Cursor, Gemini CLI and 20+ platforms.
AI-powered vision-driven UI automation that lets you describe actions in natural language instead of writing selectors, supporting browser and mobile platforms
An automated penetration testing agentic framework powered by large language models for security testing and vulnerability discovery.
OpenHarness is an open agent harness platform with a built-in personal agent called Ohmo, providing an integrated solution for agent development, testing, and deployment.
An introductory guide to context engineering - the systematic approach to building high-quality context for AI coding assistants, centered on Claude Code but applicable to any AI coding tool.
The official Lark/Feishu CLI tool maintained by the larksuite team, covering Messenger, Docs, Base, Sheets, Calendar, Mail, Tasks, Meetings and more with 200+ commands and 20+ AI Agent Skills for both humans and AI agents.
Microsoft's R&D automation agent focused on data-driven and model-driven AI development processes, automating high-value R&D to enhance industrial productivity.
Deploy headless browsers in Docker. Run on cloud or bring your own infrastructure. Provides powerful web automation and rendering capabilities for AI agents. Free for non-commercial uses.
Instructor is a Python library providing structured outputs for LLMs using Pydantic models, enabling AI agents to receive reliable typed responses — a key building block for agent tool-use.
NanoBrowser is an open-source Chrome extension for AI-powered multi-agent browser automation, supporting web task workflows with your own LLM API key.
An agent framework connecting digital humans (2.5D/3D/mobile/web) with business systems, compatible with OpenAI and DeepSeek LLMs.
A powerful AI coding agent. Built for the terminal. Supports code generation, refactoring, debugging with intelligent suggestions and automated workflows.
A framework for few-shot evaluation of language models by EleutherAI, providing standardized evaluation pipelines supporting hundreds of benchmark tasks and widely adopted as a core LLM evaluation tool in the community.
All-in-one AI framework for semantic search, LLM orchestration, and language model workflows with agent support, RAG, and vector database
Pipecat is an open-source framework for voice and multimodal conversational AI, enabling real-time voice assistants, video bots, and multimodal agents with integrated TTS, STT, and LLM services.
MCP TypeScript SDK is the official TypeScript implementation for building MCP servers and clients, standardizing protocol integrations across JS/TS agent ecosystems.
A collection of projects showcasing RAG, agents, workflows, and other AI use cases with practical examples and tutorials.
Industry-first professional AI Agent platform for controllable film and video production, covering the entire pipeline from shorts to live-action.
Waoowaoo is the industry-first professional AI agent platform for controllable film and video production, offering Hollywood-standard workflows from shorts to live-action with multi-agent collaboration for scriptwriting, storyboarding, and generation.
E2B provides secure cloud sandboxes for AI agents, supporting code execution, file operations, and isolated compute as an execution layer for coding and automation workflows.
A self-evolving agent framework that grows a skill tree from a 3.3K-line seed, achieving full system control with 6x less token consumption.
Run any open-source LLMs such as DeepSeek and Llama as OpenAI-compatible API endpoints in the cloud. Supports fine-tuning, quantization, and distributed inference for production-grade LLM deployment.
OCR and document extraction tool using vision models, efficiently converting PDFs and images into structured text.
LangChain4j is a Java library that simplifies LLM integration through a unified API, supporting popular models and vector databases with built-in RAG, tool calling, MCP, and agent capabilities that integrate seamlessly with enterprise Java frameworks.
Python framework for building production-grade conversational AI interfaces with multimodal support, file uploads, intermediate step visualization, and agent workflow display.
Chainlit is an open-source UI and development framework for LLM and agent chat applications, enabling fast delivery of interactive assistants.
Locally runnable version of Claude Code with cross-platform desktop app and Computer Use capabilities, including core module analysis.
LLM is Simon Willison's open-source CLI and plugin framework for working with multiple models through one interface, with embeddings, templates, tool extensions, and lightweight agent workflows.
Portkey AI Gateway is a blazing fast AI gateway with integrated guardrails, routing to 200+ LLMs with 50+ AI guardrails through a single fast and friendly API.
eBPF-powered network observability for Kubernetes. Indexes L4/L7 traffic with full K8s context, queryable by AI agents via MCP and humans via dashboard.
Creative Tim UI provides open-source components, blocks, and AI agents designed to speed up your workflow, importable seamlessly into your favorite tools through Registry and MCPs.
Library to expose FastAPI endpoints as Model Context Protocol tools with authentication support, enabling AI agents to call existing APIs directly
RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device. Published at MLsys 2026.
A Chrome extension-based MCP server that exposes browser functionality to AI assistants, enabling complex browser automation, content analysis, and semantic search.
Open-source BGE series embedding models and retrieval tools from BAAI, providing state-of-the-art text embeddings and rerankers for Chinese and English, widely used in RAG systems and agent retrieval pipelines.
Open-source agentic framework that uses computers like a human, capable of completing complex GUI tasks with autonomous learning and experience accumulation.
A minimalistic AI-powered search engine that helps you find information on the internet and cites it too. Powered by Vercel AI SDK.
Code search MCP for Claude Code and coding agents. Makes entire codebases available as context for AI coding assistants using vector-based semantic code search for precise understanding of large projects.
Trae Agent is an LLM-based agent for general purpose software engineering tasks.
Eino is an open-source Go framework from ByteDance for building LLM applications, offering type-safe orchestration, streaming, tool calling, and RAG pipelines for high-performance AI agent applications.
Open-source web UI for Claude Code, Cursor CLI, and Codex enabling remote management of AI coding sessions and projects from mobile and web.
PAL MCP Server unifies Claude Code, GeminiCLI, and CodexCLI with multiple LLM providers (Gemini, OpenAI, OpenRouter, Azure, Grok, Ollama, and custom models) into a single collaborative MCP service.
Code editor for the AI agents era that runs an army of Claude Code, Codex, and other coding agent instances in parallel
The original open-source AI PR reviewer. Automatically analyzes pull requests and generates code review feedback, improvement suggestions, and PR descriptions across GitHub, GitLab, and Bitbucket.
TensorZero is an open-source inference gateway and optimization platform for LLM apps and agent systems, focused on high-performance serving, experimentation, routing, and production observability.
Open-source LLM DevOps platform providing one-stop AI application development with GenAI workflow, RAG, Agent, model management, evaluation, and enterprise system administration.
ARIS (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in.
The open-source Agentic browser that transforms your browser into an AI-powered operating system. Alternative to ChatGPT Atlas, Perplexity Comet, and Dia.
Promptflow is a development and evaluation toolkit for LLM applications and agent workflows, with visual orchestration and debugging.
Microsoft's comprehensive multi-language framework for building, orchestrating, and deploying AI agents and multi-agent workflows with support for Python and .NET.
LlamaGPT is a self-hosted, offline ChatGPT-like chatbot powered by Llama 2. 100% private with no data leaving your device, with Code Llama support and one-click deployment via Umbrel.
OpenSandbox is an open-source, secure, fast, and extensible sandbox runtime for AI agents, developed by Alibaba.
HumanLayer provides a human-in-the-loop layer for AI coding agents, enabling them to seek human approval and guidance when solving hard problems in complex codebases.
A Postgres-based backend platform built for coding agents, combining auth, storage, compute, hosting, and an AI gateway for rapid app development.
LiveKit Agents is LiveKit's real-time voice and multimodal agent framework for phone, assistant, and interactive use cases that need low-latency experiences.
ValueCell is a community-driven multi-agent platform for financial applications, enabling collaborative financial analysis, trading strategies, and market research through multi-agent orchestration.
HuggingChat UI is the open-source chat interface by HuggingFace powering the HuggingChat service, supporting conversations with various open-source LLMs.
PocketFlow is a minimalist 100-line LLM framework that lets Agents build Agents, enabling complex AI agent workflows through a clean abstraction layer.
GitHub Copilot CLI brings the power of Copilot coding agent directly to your terminal. Supports code generation, command suggestions, error fixing and more.
TEN Framework is an open-source framework for building conversational voice AI agents with real-time multi-modal interaction support.
An open-source embedded retrieval library for multimodal AI with zero server configuration, using the Lance columnar format for efficient vector search and filtering, ideal for agent memory and RAG applications.
Hive is a production-ready multi-agent execution harness providing state management, failure recovery, observability, and human-in-the-loop control with auto-generated multi-agent topologies for complex business workflows.
TypeScript/React component library for building AI chat interfaces with customizable, production-ready UI components supporting multiple AI providers.
A complete search engine and RAG pipeline in your browser, server, or edge network. Supports full-text, vector, and hybrid search in less than 2kb. Perfect for building AI-powered search experiences anywhere.
Universal skills loader for AI coding agents. One-command installation of skill packages. Extends agent capabilities with code review, test generation, documentation writing and more.
An MCP bridge connecting AI assistants to the Unity Editor, enabling LLMs like Claude and Cursor to manage assets, control scenes, edit scripts, and automate game development tasks through natural language.
Crucix is a personal intelligence agent that watches the world from multiple data sources and pings you when something changes, helping you stay on top of information in real time.
A beautiful, highly customizable statusline for Claude Code CLI with powerline support, themes, and more.
All-in-one LLM CLI tool with Shell Assistant, Chat-REPL, RAG, AI Tools & Agents, supporting OpenAI, Claude, Gemini, Ollama, Groq, and more.
MCP Use is a Model Context Protocol orchestration project that helps agents connect to MCP servers, unify tool invocation, and improve portability across toolchains.
Phoenix is an open-source observability and evaluation tool for LLM and agent applications, supporting online tracing and offline diagnosis.
MCP Inspector is a debugging and inspection tool for the Model Context Protocol ecosystem, useful for validating MCP server behavior and troubleshooting integrations.
An open-source asynchronous coding agent by LangChain built on LangGraph, autonomously handling software engineering tasks including code generation, debugging, and file editing.
High-performance Python library for data extraction, analysis, conversion and manipulation of PDF and other document formats.
ART (Agent Reinforcement Trainer) trains multi-step agents for real-world tasks using GRPO reinforcement learning, enabling on-the-job training for models like Qwen, Llama, and more.
A research prototype of a human-centered web agent from Microsoft Research, emphasizing human-in-the-loop interaction for collaborative web browsing and data collection tasks.
Spring AI Alibaba is an Agentic AI Framework for Java developers, built on the Spring ecosystem to provide multi-agent collaboration, workflow orchestration, and RAG capabilities.
Your favorite Terminal Coding Agent, now in Rust. Provides high performance and memory safety. Supports code generation, file editing, command execution and complete development workflow.
A lightweight, lightning-fast, in-process vector database by Alibaba with C++ core, Node.js and Python bindings, designed for RAG, agent memory, and vector search use cases.
OpenRLHF is a high-performance agentic RL framework based on Ray and vLLM, offering PPO, DAPO, and REINFORCE++ algorithms for large-scale training of agents and vision-language models.
Vibe-Trading is an open-source personal trading agent stack that combines LLMs, MCP tooling, and multi-agent workflows for algorithmic trading and backtesting.
MemOS is a Memory Operating System for LLMs and AI agents that unifies store, retrieve, and manage for long-term memory, with built-in KB, multi-modal, and tool memory support.
AI-powered autonomous web browsing framework that enables agents to click, type, navigate, and extract data like a human, with support for OpenAI, Anthropic, and Google models.
Fully-automated and zero-code LLM agent framework that enables users to build and deploy custom AI agents through natural language without writing code.
LangChainGo is the Go implementation of LangChain, providing the easiest way to write LLM-based programs in Go with chains, agents, and tool integrations.
VoltAgent is an agent platform for the modern TypeScript ecosystem, focused on workflows, tool orchestration, and application integrations for production web agents.
A Data Agent Ready Warehouse unifying Analytics, Search, AI, and Python Sandbox in one system. Runs on your S3 with built-in vector search, full-text search, and Python execution for AI-powered data analysis.
HexStrike AI is an advanced MCP server that lets AI agents autonomously run 150+ cybersecurity tools for automated pentesting, vulnerability discovery, and security research.
Official MCP server collection from AWS, providing AI agents with integration to core AWS services including Lambda, S3, DynamoDB, and Bedrock.
Next Generation Multi-tenant AI One-Stop Solution with built-in admin and billing. Enterprise-grade unified LLM gateway supporting 200+ models and 35+ providers.
AI Data Runtime for Agents. Provides serverless Postgres with a multimodal datalake, enabling scalable retrieval and training. Unifies vector storage, dataset management, and streaming data loading for AI agent workflows.
MCP server for Ghidra reverse engineering platform, enabling AI agents to autonomously perform binary analysis and vulnerability discovery.
A frontier, first-principles handbook for moving beyond prompt engineering to the wider discipline of context design, orchestration, and optimization — inspired by Karpathy and 3Blue1Brown.
UFO is a Windows GUI automation agent by Microsoft that understands screen interfaces and executes complex OS tasks through natural language commands.
An application framework for AI engineering from the Spring team, providing unified LLM integration, vector storage, function calling, RAG, and agent development for Java and Spring ecosystems with support for OpenAI, Anthropic, Ollama, and more.
Accelerate local LLM inference and finetuning on Intel XPU. Supports LLaMA, Mistral, Qwen, DeepSeek and more. Seamlessly integrates with LangChain, LlamaIndex, and other agent frameworks.
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs for building logical reasoning and factual Q&A solutions for professional domain knowledge bases, effectively overcoming the limitations of traditional RAG vector similarity models.
Multi-platform SDK for integrating GitHub Copilot Agent into apps and services. Supports multiple programming languages and platforms with unified Agent API interface.
Go implementation of the Model Context Protocol SDK enabling seamless integration between LLM applications and external data sources and tools
Microsoft's TypeScript library replacing prompt engineering with type definitions for structured LLM output, using TypeScript interfaces to define AI response contracts.
A private and local AI personal knowledge management app. All data and processing stay on-device with built-in RAG, semantic search, and knowledge graph features for managing personal knowledge bases with full privacy.
An autonomous LLM agent framework for complex task solving with automatic task decomposition, tool usage, and multi-step reasoning from the OpenBMB team
Enterprise-grade agentic workflow platform from iFlytek, offering commercial-friendly SuperAgent building capabilities with complex workflow orchestration and multi-agent coordination.
Polyglot document intelligence framework with a Rust core, extracting text, metadata, and structured data from PDFs, Office documents, images and 91+ formats via MCP server, CLI, and REST API.
An amazing UI for OpenAI's ChatGPT with enhanced conversation management, prompt templates, and model parameter tuning across Web, Windows, MacOS, and Linux.
mcp-agent is LastMile AI's toolkit for building agents around Model Context Protocol integrations, making it easier to connect MCP tools into multi-step workflows.
BAML is an AI framework that adds engineering rigor to prompt engineering, offering type-safe prompt definitions, automatic testing, version management, and multi-model support across Python, TypeScript, Ruby, Java, C#, Rust, and Go.
Model Context Protocol (MCP) is an open protocol initiated by Anthropic, defining standardized interfaces for AI models to interact with external tools and data sources — the infrastructure of the agent tool-use ecosystem.
The first GitHub Copilot, Codeium, and ChatGPT Xcode Source Editor Extension, bringing AI code completion and chat directly into Apple development workflows.
A 24/7 online AI agent team that automates information collection, data analysis and content generation for continuous operations.
A deep research agent framework optimized for complex research and prediction tasks, with MiroThinker-1.7 and MiroThinker-H1 models achieving 74.0 and 88.2 on BrowseComp benchmark, supporting multi-step reasoning and information retrieval.
GitMCP is a free remote MCP server that enables AI agents to understand and access any GitHub project repository, eliminating code hallucinations.
An extensible workflow development framework with built-in canvas, form, variable, and materials that helps developers build AI workflow platforms faster and simpler.
An open-source, code-first Go toolkit by Google for building, evaluating, and deploying sophisticated AI agents with flexible tool integration, multi-turn conversation management, and streaming responses.
PraisonAI is a low-code multi-agent framework with handoffs, guardrails, memory, RAG, and 100+ LLM providers, deployable to Telegram, Discord, and WhatsApp.
NVIDIA's open-source LLM vulnerability scanner that automatically detects security issues in language models including safety vulnerabilities, hallucination tendencies, jailbreak risks, and prompt injection attacks.
Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider, using Stream's edge network for ultra-low latency realtime interactions.
A production-ready Agentic RAG system with RESTful API, featuring multimodal document ingestion, hybrid search, knowledge graph construction, and agent-driven retrieval-augmented generation workflows.
Agent framework designed for fintech and enterprise scenarios, providing task orchestration, tool integration, and production-grade reliability with multi-LLM backend support.
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
A next-generation Android RPA agent framework that enables intelligent device control through agent-driven automation, with smart UI element recognition and automated operations for mobile testing and intelligent assistants.
An open-source AI presentation generator and API that creates professional slides from text, as an alternative to Gamma, Beautiful AI and Decktopus.
GSD-2 is a powerful meta-prompting, context engineering, and spec-driven development system that enables agents to work autonomously for long periods without losing track of the big picture.
An open-source RAG chatbot powered by Weaviate vector database, supporting multiple data import methods, LLM backends, and embedding models for out-of-the-box retrieval-augmented generation.
Sweep is an AI coding assistant for JetBrains IDEs that automatically resolves GitHub issues and submits code changes, automating software development workflows.
Manage multiple AI terminal agents like Claude Code, Codex, OpenCode, and Amp in a unified terminal interface
A vector search SQLite extension. Add vector similarity search to SQLite with float32/int8 vectors — ideal for local RAG applications.
Flexible and powerful framework for managing multiple AI agents and handling complex conversations.
Agent Squad is an open-source multi-agent orchestration framework from AWS for managing multiple AI agents and handling complex conversations.
The GEP-Powered Self-Evolution Engine for AI Agents — enables agents to autonomously optimize and evolve using Genome Evolution Protocol for continuous capability improvement.
Mintlify is a developer documentation and AI-search platform that gives agent toolchains, SDKs, and APIs a structured knowledge surface for both humans and assistants.
Evidently is an open-source ML and LLM observability framework with 100+ metrics for evaluating, testing, and monitoring any AI-powered system or data pipeline.
Yao is a single-binary runtime to build and run autonomous agents — no Python, no Node.js, just define the role. Provides lightweight, high-performance agent development framework.
Build modular and scalable LLM Applications in Rust. Provides agent orchestration, tool-use, RAG pipelines, and other core capabilities for high-performance AI agent systems.
The fastest and most accurate file search toolkit for AI agents, Neovim, Rust, C, and NodeJS environments.
An agentic orchestrator for parallel coding agents that plans tasks, spawns agents, and autonomously handles CI fixes, merge conflicts, and code reviews for complex development workflows.
A desktop app providing a graphical interface for OpenClaw AI agents — turns CLI-based AI orchestration into a desktop experience without using the terminal.
Refly is the first open-source agent skills builder. Define skills through vibe workflows and run them on Claude Code, Cursor, Codex and more. Skills are infrastructure, not prompts.
Claude Code skill for generating production-quality SVG and PNG technical diagrams — supports 8 diagram types, 5 visual styles, and deep AI/Agent domain knowledge.
🪓 An orchestration engine for background tasks, AI agents, and durable workflows.
Fast, flexible LLM inference engine built in Rust — supports multiple model architectures and quantization schemes for high-performance local LLM deployment.
OpenLLMetry is an open-source observability tool for LLM applications based on OpenTelemetry, providing tracing, metrics, and monitoring capabilities.
Official Python SDK from Anthropic for building Claude-powered AI agents with tool use, multi-turn conversations, and agent orchestration.
Open-source AI agent platform for financial analysis using LLMs, featuring intelligent research, market forecasting, and automated financial report generation.
Steel Browser is an open-source browser sandbox purpose-built for AI agents and applications. It provides a full browser API with session management, proxy integration, and built-in anti-detection, enabling web automation without infrastructure headaches.
An open-source AI chat frontend for self-hosted models and multi-model chat scenarios, providing a clean conversation interface and basic configuration.
Dynamic, resilient AI orchestration platform. Coordinate data, models, and compute as you build AI workflows with scalable ML pipeline and production-grade workload management.
OpenCompass is a comprehensive LLM evaluation platform supporting a wide range of models including Llama, Mistral, GPT-4, Qwen, GLM, and Claude across 100+ benchmark datasets.
big-AGI is a feature-rich AI suite providing multi-model parallel chats, AI personas, text-to-image, voice synthesis, code highlighting and execution, PDF import, and more. Deploy on-prem or in the cloud.
Guardrails AI adds programmable guardrails to large language models, ensuring reliability and safety through input/output validation, structured data extraction, and custom validators.
Open-source AI code generation tool built on Llama models, generating complete code projects from natural language descriptions — a local alternative to Claude Artifacts.
The official community-driven registry service for Model Context Protocol servers, providing discovery, publishing, and version management for the MCP ecosystem.
A demonstration of advanced agentic patterns built on top of OpenAI's Realtime API, showcasing real-time voice interaction and multi-agent collaboration.
A demonstration of advanced agentic patterns built on top of OpenAI's Realtime API, showcasing real-time voice interaction and multi-agent collaboration.
An LLM-based multi-agent framework for web search engines, similar to Perplexity.ai Pro and SearchGPT, enabling intelligent web search.
An autonomous novel writing AI agent where multiple agents write, audit and revise novels with human review gates for quality control.
TalkToFigma is an MCP integration tool that enables AI agents (Cursor, Claude Code) to communicate with Figma for reading designs and modifying them programmatically.
Turn any AI agent into a living microservice that is interoperable, observable and composable, enabling standardized agent communication and orchestration.
Swarms is an enterprise-grade production-ready multi-agent orchestration framework for deploying and scaling collaborative AI agent swarms.
EverOS is a platform for building, evaluating, and integrating long-term memory for self-evolving agents, enabling AI agents to continuously accumulate experience and optimize themselves.
Smart model routing for personal AI agents that cuts costs up to 70% by dynamically selecting the optimal LLM for each request.
AppAgent is an LLM-based multimodal agent framework designed to operate smartphone apps like a human, supporting touch interaction and autonomous exploration.
OpenGPTs is an open-source alternative to GPTs by the LangChain team. Provides a self-hostable agent platform with multiple LLM backends, RAG, and tool-use capabilities.
stagewise is a purpose-built browser for developers with a coding agent integrated right in, enabling direct code interaction from the web interface.
Token compression tool that compresses tool outputs, logs, files, and RAG chunks before they reach the LLM, reducing 60-95% of tokens while maintaining answer quality. Available as library, proxy, or MCP server.
LLM-driven extraction of unstructured data, built for API deployments and ETL pipeline workflows. Automates document parsing, PDF extraction, and intelligent data processing with LLM-powered intelligence.
OpenShell is the safe, private runtime for autonomous AI agents, developed by NVIDIA. Provides controlled execution environments and resource management.
Superagent protects AI applications against prompt injections, data leaks, and harmful outputs, embedding safety directly into your app.
Julep is a serverless AI workflow deployment platform for building and scaling AI agent applications, described as Firebase for AI agents.
BrowserMCP is a browser extension-based MCP server that allows AI applications like Claude and Cursor to directly control and automate your browser.
Rediscover your social memories with local, AI-powered analysis. Import chat histories from multiple platforms and analyze them with AI agents for insights and visualization.
An AI-native proxy and data plane for agentic apps with built-in orchestration, safety, observability, and smart LLM routing so developers can focus on agent core logic.
An AI agent for Street Fighter II Champion Edition that demonstrates autonomous gameplay through visual recognition and reinforcement learning techniques.
IntentKit is an open-source, self-hosted cloud agent cluster that manages a collaborative team of AI agents for complex task completion.
Microsoft AI call center solution. Send phone calls from an AI agent via API, or directly call the bot from a configured phone number.
Official Firecrawl MCP Server that adds powerful web scraping and search capabilities to Cursor, Claude, and other LLM clients.
Ship AI Agents to Google Cloud in minutes, not months. Production-ready templates with built-in CI/CD, evaluation, and observability.
An attempt to engineer prompts that help us understand AI agents. Research into agent reasoning mechanisms through prompt engineering.
OpenSpace is a platform that makes your agents smarter, lower-cost, and self-evolving, optimizing agent architectures and reasoning workflows for efficient autonomous evolution.
A production-focused Agentic RAG course teaching how to build scalable, reliable RAG agent systems with indexing strategies, retrieval optimization, and monitoring.
Secure, local, cross-platform and programmable sandboxes for AI agents. Provides strict resource isolation using microVM technology.
A customer service demo built with the OpenAI Agents SDK, demonstrating tool use, context management, and intelligent customer support workflows.
A customer service demo built with the OpenAI Agents SDK, demonstrating tool use, context management, and intelligent customer support workflows.
Open-source context retrieval layer for AI agents that automatically extracts, indexes, and retrieves structured context from diverse data sources.
LaVague is a Large Action Model (LAM) framework for developing AI web agents, combining RAG techniques for natural-language-driven browser automation.
An LLM playground you can run on your laptop. Compare models side-by-side for prompt testing and model evaluation in a local environment.
NVIDIA NeMo Guardrails is an open-source toolkit for adding programmable guardrails to LLM-based conversational systems, supporting topic control, safety enforcement, and dialog guidance.
Materialize is the live data layer for apps and AI agents, creating up-to-the-second views into business data using SQL, providing agents with real-time, accurate data querying capabilities.
Open Multi-Agent is a lightweight TypeScript multi-agent framework that auto-decomposes tasks and executes them in parallel with a single runTeam() call. Only 3 dependencies, deploys anywhere Node.js runs.
From a goal to a task DAG, automatically. TypeScript-native multi-agent orchestration with MCP and live tracing. Three runtime dependencies.
An open-source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl.
Camofox Browser is a headless browser automation server powered by Camoufox, a Firefox fork with C++-level fingerprint spoofing. It bypasses Google, Cloudflare, and most bot detection, providing token-efficient accessibility snapshots and stable element references for AI agents.
A Claude Skill that gives your AI coding agent the ability to use a web browser for browser automation.
Transforms PDF, documents and images into enriched structured data with table recognition, reading order restoration, and Markdown output.
TaskWeaver is Microsoft's open-source code-interpreter-style agent framework, suitable for data analysis and complex task automation.
DesktopCommanderMCP is an MCP server that gives AI assistants like Claude terminal control, file system search, and diff file editing capabilities.
A high-performance, secure sandbox service for AI agents by Tencent Cloud, built on RustVMM and KVM with hardware-level isolation, sub-60ms cold start, <5MB memory overhead, and E2B SDK compatibility.
Zero-Config Code Flow for Claude Code and Codex, providing one-click project initialization and context management.
Strands Agents SDK is an AWS open-source agent framework using a model-driven approach to build AI agents with built-in tool use, conversation memory, and multi-agent collaboration.
DevOpsGPT is a multi-agent system for AI-driven software development that combines LLMs with DevOps tools to convert natural language requirements into working software, supporting any development language and extending existing codebases.
A completely locally running search aggregator using LLM agents. Users can ask questions and the system uses a chain of LLMs to find answers without any external API keys.
Atomic Agents is a modular AI agent building framework with an atomic design philosophy, providing composable components including tools, pipelines, and memory management for constructing agent systems.
Curated collection of system prompts for top AI tools. Perfect for AI agent builders and prompt engineers. Including: ChatGPT, Claude, Perplexity, Manus, Claude-Code, Loveable, v0, Grok, same new, windsurf, notion, and MetaAI.
Multi-model AI agent desktop client that connects to any AI provider, extends with MCP and skills, and supports remote control from your phone. Built with Electron and Next.js.
Windows MCP is an MCP server for the Windows desktop, providing AI agents with computer-use capabilities for desktop automation and system operations.
AIOS: AI Agent Operating System - a foundational runtime for large-scale deployment and management of LLM agents with scheduling, memory management, and tool registration.
MCP server and CLI by Sentry providing AI agents with build, test, and development tools for iOS and macOS projects.
Open-source LLM observability platform with one-line integration, providing request logging, caching, rate limiting, cost tracking, and experimentation.
Helicone is an open-source proxy and observability platform for LLM applications, offering request tracing, caching, and cost analytics.
Deep-dive research reports on AI Agent source code — systematic analysis of mainstream AI agent framework architectures, core principles, and implementation details.
MCP integration platform that lets AI agents use tools reliably at any scale, providing MCP servers, clients, and integration solutions for production agent workflows.
PySpur is a visual agent workflow editor that supports drag-and-drop construction of AI agent pipelines with built-in evaluations and human-in-the-loop support.
WhatsApp MCP Server provides AI assistants with WhatsApp messaging capabilities, enabling LLMs like Claude to interact with WhatsApp directly via the MCP protocol.
A TypeScript tool for orchestrating sandboxed coding agents with secure execution environments powered by sandcastle.run.
A tool for managing project collaboration between humans and AI Agents in a git ecosystem.
An MCP server and CLI that turns the browser into an API, allowing AI agents to control Chrome with existing login sessions for web operations, data scraping, and automation tasks without re-authentication.
Ottomator Agents is a collection of runnable agent examples and automation patterns covering research, browser actions, tool use, and multi-step flows for practical learning.
AgentOps is an observability platform for AI agents, providing monitoring, debugging, and evaluation to help developers optimize agent performance.
Another curated list of awesome MCP servers, collecting a large number of open-source Model Context Protocol server implementations covering databases, file systems, API integrations, and more.
Cross-platform chatbot framework made with love. Supports Discord, Telegram, QQ and more through a highly extensible plugin architecture.
A low-code MCP framework for building complex and innovative RAG pipelines. Combines visual pipeline design with MCP protocol integration for end-to-end RAG — from data ingestion and chunking to retrieval and generation.
Rivet Actors are the primitive for stateful workloads. Built for AI agents, collaborative apps, and durable execution.
Native macOS harness for AI agents with any model, persistent memory, autonomous execution, and cryptographic identity. Fully offline.
Playwright Model Context Protocol server for automating browsers and APIs in Claude Desktop, Cline, Cursor IDE and other AI coding tools
AgentGuide is a community-maintained knowledge base and tutorial hub for AI agent development, RAG, and LLM engineering.
Next-generation AI Agent optimization platform providing full-lifecycle management capabilities from development, debugging, evaluation to monitoring with prompt management, agent evaluation, and LLM observability.
AI Chat Browser: Fast, full webapp access to ChatGPT, Claude, Bard, Bing, Llama and more. Quick switching and parallel usage across models.
The leading workflow orchestration platform. Run stateful step functions and AI workflows on serverless, servers, or the edge with durable execution and event-driven architecture.
An observability and gateway platform for LLM applications, providing request tracing, model routing, logging, and cost analysis for agent workflows.
OpenClaw-RL: Train any agent simply by talking.
ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.
An open-source evaluation and testing library for LLM agents providing automated model scanning, bias detection, performance benchmarking, and compliance checks.
Microsoft's open-source browser and web task agent that uses large models to understand pages, plan actions, and complete real web workflows.
A proactive context-aware AI partner from ByteDance Volcengine that uses context engineering to provide AI agents with precise project understanding and code context management.
A high-performance LLM inference API and Chat UI that integrates DeepSeek R1's chain-of-thought reasoning traces with Anthropic Claude models.
A multi-tenant agent harness platform integrating LightRAG knowledge base and knowledge graphs, built with LangChain, Vue, and FastAPI, supporting DeepAgents, MinerU PDF parsing, Neo4j graph database, and MCP protocol.
MCP server for Atlassian tools (Confluence, Jira) that enables AI agents to directly read and interact with Jira issues, Confluence pages, and enterprise collaboration data.
A CLI for Git worktree management, designed for parallel AI agent workflows. Run multiple AI coding agents simultaneously across branches.
Superduper: End-to-end framework for building custom AI applications and agents.
Claude Coder is an autonomous coding agent as a VSCode extension. It transforms mockups to code, auto-fixes lint errors, writes tests, and performs complex multi-file edits with an agent mode for autonomous task execution.
The RL bridge for LLM-based agent applications, providing a simple and flexible reinforcement learning framework to optimize agent performance.
A cross-platform desktop AI assistant and MCP client compatible with major LLM providers, featuring local knowledge base support and MCP server integration for a unified chat and tool-use experience.
An AI-powered custom node for ComfyUI that enhances workflow automation through natural language interaction, with intelligent node recommendations and parameter configuration.
A 24/7 all-scenario AI agent platform by NetEase Youdao that automates various tasks with multi-model scheduling, tool integration, and intelligent workflow orchestration.
Sparrow is a structured data extraction tool that supports instruction calling with ML, LLM, and Vision LLM for extracting structured information from documents, suitable for document parsing in RAG pipelines.
Self-hosted AI agent orchestration platform for dispatching tasks, running multi-agent workflows, monitoring spend, and governing operations
An open-source enterprise-level AI knowledge base and MCP management platform with integrated knowledge retrieval, model management, and agent chat for enterprise AI applications.
Next-generation personal AI assistant powered by LLM, RAG and agent loops, supporting computer-use, browser-use and coding agent capabilities.
Model Context Protocol server for mobile automation and scraping on iOS, Android, emulators, simulators and real devices
Kode CLI — Design for post-human workflows. One unit agent for every human & computer task.
ROMA (Recursive-Open-Meta-Agent) is a meta-agent framework for building high-performance multi-agent systems with recursive task decomposition and coordination.
SWE-bench is a benchmark for evaluating language models on real-world GitHub issue resolution, featuring genuine problems from popular Python repositories, now a core standard for measuring AI coding agent capabilities.
An LLM-based intelligent agent as a digital lifeform that values warmth, authenticity and genuine connection, with long-term memory and personalized conversation.
AgentVerse is a multi-agent deployment framework by Tsinghua OpenBMB, offering task-solving and simulation paradigms for collaborative multi-LLM-agent systems.
Cloudflare Agents is Cloudflare's platform for building agents on the edge runtime, combining Workers, durable state, and tool execution for low-latency production services.
Argilla is a collaboration platform for AI engineers and domain experts to build high-quality datasets, collect human feedback, and evaluate models.
21st Magic MCP is a frontend-focused MCP server providing v0-like AI component generation capabilities inside Cursor, WindSurf, Cline, and other IDEs.
RouteLLM is a framework for serving and evaluating LLM routers, enabling cost reduction without compromising quality through intelligent request routing across multiple model tiers.
The most powerful AI agent and AI chat software on Android. Supports local LLM execution, terminal operations, file management and more.
Kodezi Chronos is a debugging-first language model achieving state-of-the-art performance on SWE-bench, capable of autonomously handling software debugging and code repair tasks.
Build production-ready agentic workflows with natural language, supporting browser automation, computer use, and RAG workflows
A simple SWE style browser agent framework that achieves SOTA results on long horizon web tasks.
Open-source AI portal to quickly build operational AI portals for launching agents, prompts, and tools. Integrates with Coze, Dify.
All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container, providing a secure isolated execution environment for agents.
Kiln is an AI system building platform integrating evals, RAG, agents, fine-tuning, and synthetic data generation for end-to-end AI development.
Desktop and web interface for OpenCode AI agent, providing a graphical UI for managing agent sessions, MCP server configurations, and task execution.
Build local voice agents with open-source models. An end-to-end speech-to-speech pipeline from HuggingFace for fully local voice AI agent deployment.
A zero-code platform for auto-generating production-grade AI agents using Harness Engineering principles with unified tools, skills, memory, and orchestration with built-in constraints and feedback loops.
Open-source all-in-one AI productivity platform combining a generalist AI agent, workflow engine, instant messaging, and online documents
Mini SWE-Agent is a minimalist AI agent in just 100 lines of code that solves GitHub issues or assists developers in the command line, demonstrating core coding agent capabilities with minimal implementation.
A protocol where two conversational AI agents switch from English to a sound-level protocol after confirming they are both AI agents, improving inter-agent communication efficiency.
A blazing fast inference solution for text embeddings models built in Rust, serving as core infrastructure for building RAG systems and vector retrieval pipelines with high throughput and low latency.
ByteRover CLI provides persistent structured memory for autonomous coding agents. It features context tree management, git-like version control, and cloud sync, compatible with Cursor, Claude Code, Windsurf, and 22+ coding agents via MCP integration.
AutoRAG is an open-source RAG evaluation and optimization framework using AutoML-style automation to help developers automatically find the best RAG pipeline configurations and benchmark them.
ACI.dev is an open-source tool-calling platform that hooks up 600+ tools into any agentic IDE or custom AI agent through direct function calling or a unified MCP server.
Warcraft III Peon voice notifications for Claude Code, Codex, IDEs, and any AI agent — stop babysitting your terminal, employ a Peon today.
An interactive visualization tool for large embeddings by Apple. Explore, cross-filter, and search embeddings and metadata to understand and debug embedding models, vector retrieval, and RAG system behavior.
Open-source agentic development environment (YC W26) that runs multiple coding agents in parallel with any LLM provider
An open-source Collaborative Multi-Agent OS for transparent, human-in-the-loop task coordination via Matrix rooms. Features real-time task tracking, agent status monitoring, and collaborative decision-making.
Solace Agent Mesh is an event-driven multi-agent AI framework for building and orchestrating multi-agent systems with MCP integration and complex multi-step workflows.
A memory library for building stateful agents. Provides user-level state management and persistent memory so agents can remember and understand user preferences.
Neovim AI agent done right. Deep AI integration into Neovim editor for intelligent coding assistance.
An open-source graph-vector database built from scratch in Rust, combining graph database and vector retrieval capabilities to provide AI agents with unified storage for both knowledge graphs and semantic search.
Claude Code Ultimate Guide is a comprehensive guide for Claude Code covering best practices, advanced tips, and workflow optimization, helping developers fully leverage the capabilities of the Claude Code coding agent.
The official Go SDK for the Model Context Protocol, maintained in collaboration with Google, enabling developers to build MCP servers and clients in the Go ecosystem.
Zep is an AI agent memory management platform providing long-term memory, context management, and conversation history understanding through knowledge graph technology.
AG2 (formerly AutoGen) is an open-source AgentOS providing a multi-agent conversation framework with flexible agent orchestration, tool integration, and distributed collaboration for building complex multi-agent systems.
Dev environments in your web app — run Node.js runtime environments in the browser with full sandboxing, no server-side execution needed.
The first AI agent that builds permissionless integrations through reverse engineering platforms' internal APIs for cross-platform automation.
macOS CLI and MCP server enabling AI agents to capture screenshots with optional visual question answering via AI models.
Deep Research enables deep research using any LLM provider, offering SSE API and MCP server support with OpenAI, Gemini, DeepSeek, Ollama, and more.
Official GitHub CLI extension for Agentic Workflows, enabling definition and execution of AI agent workflows within the GitHub ecosystem for automated code reviews, issue handling, and more.
Youtu Agent is a lightweight agent framework by Tencent that delivers out-of-the-box support for open-source LLMs, simplifying agent development and deployment.
An agentic framework for reflective PowerPoint generation that automates slide creation, content arrangement, and visual design.
Infinity is an AI-native database providing incredibly fast hybrid search of dense vectors, sparse vectors, tensors, and full-text, designed for LLM applications and RAG systems.
The official Exa MCP server providing AI coding assistants and chat tools with powerful web search and crawling capabilities, including semantic search, precise content extraction, and deep crawling for real-time web information.
Claude Code Router is a model routing tool for coding-agent scenarios, unifying requests across providers to optimize cost, latency, and task-specific routing strategies.
A high-performance graph database built on GraphBLAS, optimized for LLM and GraphRAG scenarios with real-time knowledge graph construction and querying for graph-structured AI agent retrieval.
The AI-native Multi-Agent development platform built on Kotlin Multiplatform, covering all 7 phases of SDLC. Supports automated code generation, testing, deployment, documentation and full development workflow.
CLI that hooks into your Git workflow to capture AI agent sessions as you work — sessions are indexed alongside commits, creating a searchable record of how code was written in your repo.
The easiest way to use Agentic RAG in any enterprise. Provides out-of-the-box retrieval-augmented generation capabilities with Docker-based deployment for simplified enterprise RAG application building and management.
Shell Superpowers for AI Agents. Enhanced CLI toolkit helping AI agents execute tasks more efficiently in terminal environments.
Agency Swarm is a reliable multi-agent orchestration framework built on OpenAI API, providing structured multi-agent collaboration and communication.
Ouroboros is a spec-driven multi-agent framework that shifts from traditional prompting to specification-driven development, supporting multi-agent collaboration, MCP tool integration, and automated workflow orchestration for building high-quality agent systems.
Cognita is a modular RAG framework for production environments by TrueFoundry, supporting flexible document parsing, vector storage, and retrieval pipeline orchestration for scalable knowledge QA systems.
The UI design language and React library for Conversational UI by Alibaba, providing complete chat interface components for building customer service and conversational applications.
The official Notion MCP server enabling AI assistants to directly read and manipulate pages, databases, and content in Notion workspaces, with full API support for search, creation, and editing.
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent with long-term memory and task tracking.
An AI agent that writes actually useful code for you by writing tests first, then generating code to pass them.
Full toolkit for running an AI agent service built with LangGraph, FastAPI, and Streamlit, providing a complete reference architecture for agent service deployment.
The official C# SDK for the Model Context Protocol, maintained in collaboration with Microsoft, enabling developers to build MCP servers and clients in the .NET ecosystem.
A JVM framework by JetBrains for building predictable, fault-tolerant, enterprise-ready AI agents across all platforms — from backend services to Android, iOS, and in-browser environments, with built-in MCP and multi-provider LLM support.
MS-Agent: a lightweight framework to empower agentic execution of complex tasks.
AI observability platform for production LLM and agent systems by the Pydantic team. Provides real-time monitoring, tracing, and debugging capabilities.
An experimental lightweight isolated runtime from Anthropic for executing agent tasks in a sandboxed environment.
An open-source JupyterLab extension that connects AI agents to computational notebooks, enabling code generation, error explanation, and document Q&A.
Data processing, indexing, and retrieval service examples from the LlamaIndex ecosystem, helping developers integrate external knowledge into agent workflows.
World's first open-source, agentic video production system with 12 pipelines, 52 tools, and 500+ agent skills. Turn your AI coding assistant into a full video production studio.
A simple, secure MCP-to-OpenAPI proxy server that converts MCP tools into OpenAI-compatible API endpoints for seamless integration with any AI application.
OpenAgentsControl is an AI agent framework for plan-first development workflows with approval-based execution. Supports TypeScript, Python, Go, and Rust with automatic testing, code review, and validation.
A spec-driven development workflow MCP server for AI-assisted software development, featuring a real-time web dashboard and VSCode extension for monitoring and managing project progress in AI coding workflows.
Demystify AI agents by building them yourself. Local LLMs, no black boxes, real understanding of function calling, memory, and ReAct patterns.
A mature open-source code execution and online judge system supporting multi-language compilation, resource limits, and API access, suitable for agent code execution tasks.
Meta's set of tools to assess and improve LLM security, including safety benchmarks, prompt injection detection, and output auditing to help evaluate and enhance the safety of large language models.
A comprehensive tutorial repository for learning AI agent development, covering OpenAI Agents SDK, LangGraph, MCP protocol, and more with hands-on projects.
Local persistent memory store for LLM applications including Claude Desktop, GitHub Copilot, Codex, and more. Provides durable context memory capabilities for AI agents.
Agenta is an open-source LLMOps platform providing prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.
AdalFlow: The library to build & auto-optimize LLM applications.
USearch is a fast open-source search and clustering engine for vectors and arbitrary objects, with bindings in C++, Python, JavaScript, Rust, Java, Swift, C#, Go, and Wolfram for large-scale vector retrieval.
A curated list of Model Context Protocol (MCP) servers, collecting high-quality MCP server implementations from the community for developers to quickly discover and integrate the MCP tools they need.
A visualization MCP server by AntV with 25+ chart types, enabling AI assistants to generate line charts, bar charts, pie charts, maps, and more through MCP for data analysis and reporting.
The lightweight ingestion library for fast, efficient and robust RAG pipelines. Supports multiple chunking strategies and embedding models to significantly improve retrieval-augmented generation results.
Open Source Voice Agent Platform.
A comprehensive single-package Retrieval-Augmented Generation platform built on Langflow, Docling, and OpenSearch, providing a complete pipeline from document parsing to vector retrieval and generation with multi-model and multi-vector-database support.
Engram is a persistent memory system for AI coding agents. Agent-agnostic Go binary with SQLite + FTS5, MCP server, HTTP API, CLI, and TUI interfaces.
Strict AI coder for enterprises, quality first, including AI Agent, AI CodeReview, and AI Completion.
An open-source, vision-first browser agent that drives web automation through visual understanding, supporting complex web interaction tasks for QA testing and workflow automation.
Latitude is the open-source agent engineering platform.
An automation workflow project in the browser-use ecosystem that enables AI agents to operate browsers and complete multi-step web tasks.
Langroid is a Python multi-agent programming framework that leverages an intuitive Agent-Task-Tool abstraction to help developers build LLM-powered multi-agent applications.
AI Agent Orchestration Dashboard for managing AI agents, assigning tasks, and coordinating multi-agent collaboration.
CozoDB is a transactional, relational-graph-vector database that uses Datalog for queries. Designed as the hippocampus for AI, it unifies graph traversal, vector search, and relational queries.
An open-source, modern-design AI training tracking and visualization tool. Supports PyTorch, Transformers and more. Monitor and evaluate AI agent training processes.
MCP server for the Godot game engine, providing tools for launching the editor, running projects, and capturing debug output for AI-assisted game development.
A beautiful Ruby API for OpenAI, Anthropic, Gemini, Azure, Ollama, and more. Built-in agents, chat, vision, audio, tools, streaming, and Rails integration.
An embedded property graph database built for speed with built-in vector search and full-text search, implementing Cypher query language for knowledge graph construction and AI agent structured knowledge retrieval.
Clawith is an open-source project for building and operating AI agent systems.
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
An AI-driven local automation assistant like Manus, a computer use agent that uses natural language to make computers work autonomously.
The Python Risk Identification Tool for generative AI — an open-source framework by Microsoft for proactively identifying risks in generative AI systems through red teaming and automated probing.
Model Context Protocol server for Excel file manipulation, enabling AI agents to read, create and modify spreadsheets
A task-aware agent-driven prompt optimization framework from Microsoft Research that iteratively refines prompts for better LLM performance.
A self-hosted email client with an AI agent, running entirely on Cloudflare Workers.
LazyLLM is a lightweight multi-agent LLM application framework offering the easiest way to build multi-agent LLM apps, with built-in RAG, knowledge graph, fine-tuning, and integration with LangChain and LlamaIndex ecosystems.
MCP server that interacts with Obsidian via the Obsidian REST API community plugin, enabling AI agents to manage notes.
Microsoft's AI Agent Governance Toolkit providing policy enforcement, zero-trust identity, execution sandboxing, and reliability engineering for autonomous AI agents. Covers 10/10 OWASP Agentic Top 10.
Tencent's full-stack AI red teaming platform integrating OpenClaw security scanning, agent scanning, skills scanning, MCP scanning, AI infrastructure scanning, and LLM jailbreak evaluation.
Official Cloudflare MCP server enabling AI agents to access and manage Cloudflare services including Workers, KV, R2 and more.
Development environments for coding agents. Enable multiple agents to work safely and independently with your preferred stack. Provides isolated development environments to avoid conflicts and improve collaboration.
An AI Gateway, registry, and proxy by IBM that sits in front of any MCP, A2A, or REST/gRPC APIs, exposing a unified endpoint with centralized discovery, guardrails, and management.
Code, Build and Evaluate agents - excellent Model and Skills/MCP/ACP Support.
Motia is a TypeScript platform that models APIs, background jobs, agents, and workflows together for teams that want one structure for business logic and automation.
Context management for Claude Code with hooks for state maintenance via ledgers and handoffs. Enables MCP execution without context pollution and agent orchestration with isolated context windows for long-running conversations.
An enhanced MCP server for interactive user feedback and command execution in AI-assisted development, with dual Web UI and desktop app support, intelligent environment detection, and cross-platform compatibility.
Enterprise AI Platform with guardrails, MCP registry, gateway and orchestrator — comprehensive AI agent governance and management.
Real-time transport layer for Java AI agents supporting WebSocket, SSE, gRPC, and WebTransport/HTTP3, with native MCP, A2A, and AG-UI protocol support for building event-driven AI agent communication architectures.
An agentic LLM-powered data processing and ETL system. Enables complex data transformations using natural language-defined pipelines, turning unstructured data into structured, analyzable outputs with LLM intelligence.
A dataset of 15,140 ChatGPT prompts including 1,405 jailbreak prompts from Reddit, Discord, and other platforms, providing a large-scale benchmark for LLM safety research and jailbreak detection.
A universal local knowledge base solution based on vector databases and GPT, providing one-stop document processing with vectorization, semantic search, and intelligent Q&A for building private knowledge bases.
Leading AI Agent Context Platform that provides unified context delivery for AI agents with knowledge management, resource scheduling, and skill integration.
OpenAgents - AI Agent Networks for Open Collaboration. Build, deploy, and manage AI agent networks with multi-LLM support.
A framework for quickly building AI-native IDE products with built-in MCP client support for deep integration of coding assistants.
Open-source security automation platform for teams and AI agents. Build SOAR workflows to detect and respond to threats.
Official Polymarket autonomous trading AI agents that automatically make and execute trading decisions in prediction markets.
OpenOperator is an open-source agent project for computer and browser control, focused on GUI automation, task execution, and human-in-the-loop workflows.
Chrome extension & CLI to let agents control your browser. Runs Playwright snippets in a stateful sandbox. Available as CLI or MCP.
CodeGraphContext is an MCP server that indexes local codebases into a graph database, providing structural context to AI coding assistants for precise code understanding and navigation.
An MCP server and CLI tool that indexes local code into a graph database to provide precise code context for AI assistants.
A multi-agent personal assistant that captures real-time on-screen activities and consolidates them into structured memories, building a knowledge base that adapts to your digital experiences.
NeurIPS 2024 RAG framework inspired by human long-term memory, combining knowledge graphs with personalized PageRank for continuous knowledge integration in LLMs.
Refact is a Rust-based AI coding agent that handles engineering tasks end-to-end, integrating into developer workflows with code completion, chat, agent actions, and self-hosted deployment.
Arrow is the first UI framework for the agentic era, tiny and performant with built-in WASM sandboxes for safe code execution, purpose-built for building AI agent interfaces.
An open-source tool for analyzing and optimizing LLM context, helping developers observe how prompts, memory fragments, and retrieved content affect output.
Laravel-focused MCP server for augmenting your AI-powered local development experience. Deeply integrates AI coding assistants with the Laravel ecosystem.
Expect tests your agent's code in a real browser, providing a visual browser testing environment to verify that AI agent-generated code works as expected.
An agent framework for the JVM built in Kotlin, providing a complete toolchain for developing, orchestrating, and deploying AI agents in the Java/Kotlin ecosystem.
A comprehensive benchmark to evaluate LLMs as agents (ICLR 2024), covering operating systems, databases, knowledge graphs, digital card games and more.
SimpleMem: Efficient Lifelong Memory for LLM Agents — supports text and multimodal memory for long-term information retention and retrieval.
The official Java SDK for Model Context Protocol servers and clients, maintained in collaboration with Spring AI, enabling MCP tool calling and context management in Java applications.
Dagu is a local-first declarative workflow engine that is file-based, self-contained, and air-gapped ready. A single binary that scales from laptop to distributed cluster with a persistent Workflow Operator.
Agent-oriented programming framework for building LLM applications in Java. Provides agent abstractions, tool calling, multi-agent collaboration, and other core capabilities for enterprise Java ecosystem integration.
DeepResearchAgent is a hierarchical multi-agent system designed for deep research tasks and general-purpose problem solving, using a top-level planning agent to coordinate specialized sub-agents for automated task decomposition and efficient cross-domain execution.
A meta-learning agent framework that learns and evolves through conversation, enabling agents to autonomously acquire new skills and optimize strategies.
Code repository for AI Agents Masterclass tutorials covering multi-agent systems, tool use, RAG, and more.
Agentuity is a production-oriented agent platform focused on runtime, tool execution, and orchestration for teams building deployable agent services.
DIMOS is an agentic operating system for physical space, enabling natural language control of humanoids, quadrupeds, drones, and other hardware platforms, with multi-agent systems that seamlessly integrate cameras, lidar, and actuators.
An educational Agentic RAG project with clean code demonstrating how to build RAG systems with agent capabilities — routing, retrieval, evaluation, and iterative refinement.
Browserbase MCP server allows LLMs to control a browser with Browserbase and Stagehand, providing cloud-based browser automation capabilities for AI agents including web interaction, data scraping, and automated testing.
TruLens is an open-source tool for evaluating and tracking LLM apps. It provides specialized evaluation for RAG applications including context relevance, groundedness, and answer relevance.
II-Agent: a new open-source framework to build and deploy intelligent agents.
MTEB (Massive Text Embedding Benchmark) is a comprehensive benchmark framework for evaluating text embeddings across classification, retrieval, clustering, reranking, and more, helping select optimal embedding models for RAG systems.
Platform for LLM evaluations and AI agent testing, providing comprehensive tracing, evaluation, and quality monitoring to help teams build reliable AI applications.
Bee Agent Framework is a production-ready AI agent development framework supporting both Python and TypeScript, offering multi-modal agent building, tool integration, and observability capabilities for rapid production deployment.
🌊 AChat - An open-source/self-hosted/local-first AI platform, designed for enterprises and teams, perfectly combining powerful local processing capabilities with seamless remote synchronization.
Catalog of official Microsoft MCP server implementations for AI-powered data access and tool integration.
An open-source, developer-first LLMOps platform for streamlined prompt design, version management, real-time observability, monitoring, and team collaboration across LLM applications.
GoClaw - GoClaw is OpenClaw rebuilt in Go — with multi-tenant isolation, 5-layer security, and native concurrency. Deploy AI agent teams at scale without compromising on safety.
The Unofficial and Awesome Home Assistant MCP Server — enables AI assistants to interact with smart home systems via the Model Context Protocol for intelligent device control and automation.
The Application Engine for the AI Era. Multi-threaded, AI-native runtime with persistent Scene Graph for real-time agent introspection.
Dynamic AI agent automation platform with multi-provider orchestration, adaptive memory, smart features, and a versatile plugin system
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications, integrating core components and templates needed for retrieval-augmented generation.
A fast TypeScript framework for building MCP servers with a clean, developer-friendly API for creating Model Context Protocol tools and services.
OpenAI Agents JS is the JavaScript version of the OpenAI Agents SDK, bringing tool calling, state orchestration, and runtime interfaces to JS/TS web stacks.
A general-purpose biomedical AI agent from Stanford for autonomous bioinformatics analysis, literature search, and scientific reasoning.
An open-source autonomous agent powered by Grok, capable of executing tasks, browsing the web, and generating code directly from your terminal.
Collection of Apple-native tools for the Model Context Protocol, giving AI agents access to macOS system features like Notes, Calendar, Reminders and more
Universal memory layer for AI Agents providing scalable, extensible, and interoperable memory storage and retrieval to streamline agent state management for autonomous systems.
A lightweight multi-agent orchestration demo showcasing how specialized agents collaborate to accomplish complex tasks.
Deepgram Agent API is a real-time interface layer for voice agents, combining speech recognition, TTS, and dialog control for phone, assistant, and voice workflow applications.
Zed Agentic is Zed's open-source project for in-editor agent collaboration, focused on code understanding, editing suggestions, and enhanced developer workflows.
Official Grafana MCP server enabling AI agents to query dashboards, manage alerts, and analyze monitoring data for intelligent ops.
Give AI agents access to your live Chrome session. Works out of the box, connects to tabs you already have open.
AutoCodeRover is a project structure-aware autonomous software engineer agent that achieves automated program repair and issue resolution by understanding the overall codebase architecture.
An AI multi-agent framework for .NET with multi-LLM backend integration, providing agent management, tool calling, and conversation state management for enterprise agent development.
Building a self-evolving ecosystem of AI agents with automatic optimization, role evolution, and multi-agent collaboration from single agent to complex systems.
Open-source Computer-Use-Agent that automates GUI interactions through natural language instructions, enabling intelligent desktop automation.
One command brings a complete pre-wired LLM stack with hundreds of services to explore and build AI applications.
Extensible AI agent microservice framework with plugin system, custom tools, and multi-model integration for conversational AI.
PromptTools provides open-source tools for prompt testing and experimentation, supporting multiple LLMs (OpenAI, LLaMA) and vector databases (Chroma, Weaviate, LanceDB) to help developers systematically evaluate and optimize RAG systems.
ReMe: Memory Management Kit for Agents - Remember Me, Refine Me.
The security toolkit for LLM interactions, providing prompt injection detection, PII anonymization, content safety auditing, and more to secure production LLM deployments.
An intelligent dialogue system based on RAG (Retrieval-Augmented Generation) technology with a complete web UI for document upload, knowledge base management, and smart Q&A.
BuildWithClaude is a centralized hub for finding Claude Skills, Agents, Commands, Hooks, Plugins, and Marketplace collections to extend Claude Code, Claude Desktop, Agent SDK, and OpenClaw.
Assemble, configure, and deploy autonomous AI Agents in your browser. One-click free deployment of your private AutoGPT web application.
A production-ready Reinforcement Learning AI Agent Library from Meta with comprehensive algorithm implementations.
A modular high-level library from Meta to train embodied AI agents across a variety of tasks and environments.
Open Lovable is an open-source experimental project for conversational app generation, combining agent-style interaction, generative UI, and rapid prototyping.
OpenRouter Agents is OpenRouter's platform capability for multi-model agent use cases, focused on routing, tool calling, and unified access layers.
Next generation agentic proxy for AI agents and MCP servers. Provides unified traffic management, routing, and security control.
Official Python SDK for ElevenLabs voice AI services — text-to-speech, voice cloning, real-time streaming, and Conversational AI agents.
Official Python SDK for ElevenLabs voice AI services — text-to-speech, voice cloning, real-time streaming, and Conversational AI agents.
An AI Agent builder and runtime by Docker Engineering, bringing container-native isolation, portability, and standardization to AI agent lifecycle management from development through production deployment.
Framework to build resilient language agents as graphs.
Open-source AI native terminal for cloud and infrastructure management, enabling you to deploy, troubleshoot, and automate services using natural language and intelligent agents.
Agent Orchestration Command Center.
LMNR is an open-source observability platform for LLM and agent applications, focused on tracing, quality analysis, and production diagnostics.
Oxylabs AI Studio Python SDK provides an all-in-one AI-powered web scraping toolkit integrating an AI scraper, crawler, browser agent, search engine, and sitemap tool for structured data extraction driven by natural language instructions.
Amazon Bedrock Agentcore samples that accelerate AI agents into production with scale, reliability, and security for real-world deployment.
An improved implementation of the Ralph Wiggum technique for autonomous AI agent orchestration, built in Rust for reliable multi-agent task coordination and scheduling.
High-performance code intelligence MCP server that indexes codebases into a persistent knowledge graph, supporting 66 languages with sub-millisecond queries and 99% fewer tokens.
A simple general-purpose AI agent based on the OpenAI API, ideal for learning and rapid prototyping of autonomous agents.
AgentStation is an open-source platform focused on agent runtime orchestration, tool execution, and developer workflows for unifying multiple automation capabilities.
Block Open is infrastructure for the open agent ecosystem, focused on runtimes, tool connectivity, and task orchestration for teams that want standardized agent platforms.
OpenPipe Artifacts is a data and artifact management tool for agent and LLM applications, helping teams track prompts, outputs, experiments, and evaluation records.
Web app for interacting with any LangGraph agent (Python and TypeScript) via a chat interface
Zero-dependency, token-efficient database MCP server for Postgres, MySQL, SQL Server, MariaDB, and SQLite.
Visible multi-agent CLI teams for Claude, Codex, Gemini, OpenCode, and Droid with project memory and tmux supervision.
A lightweight, cross-platform code editor built with Tauri (Rust and React) with Git support, AI agents, and vim keybindings.
Browser automation tool for AI agents and humans, providing high-performance web interaction capabilities built in Go
Model Context Protocol server for searching and analyzing arXiv papers, enabling AI agents to retrieve and deeply analyze academic research
HELM (Holistic Evaluation of Language Models) is Stanford CRFM's open-source framework for holistic, reproducible, and transparent evaluation of foundation models including LLMs and multimodal models.
DO Browser is a browser-task agent tool focused on page understanding, action planning, and automation, serving as a lighter alternative to browser-use or Stagehand.
Gradio Agents is Gradio's interaction-layer toolkit for agent interfaces, helping developers build demoable and testable agent UIs for prototyping and human-in-the-loop workflows.
MCP Gateway is a gateway layer for Model Context Protocol integrations, providing unified access, permission boundaries, and routing control between agents and tool services.
A framework-agnostic, git-native standard for defining AI agents.
Open-Source AI Camera Skills Platform, AI NVR & CCTV Surveillance. LLM-powered agentic security camera agent with pluggable AI skills. Runs on Mac Mini & AI PC.
A user memory service for AI applications that extracts preferences, facts, and behavioral information from conversations and retrieves them in subsequent interactions.
AI agent teams for any project. Assemble multiple AI agents to collaborate on tasks with work division and parallel execution.
Ruler applies the same rules to all coding agents. Unify your coding rules and configurations across Claude Code, Cursor, Copilot, and more.
Model Context Protocol server for converting web pages, PDFs, Office documents and other formats to Markdown for AI agent consumption
A secure persistent personal agent server in Rust. One binary, sandboxed execution, multi-provider LLMs, voice, memory, and MCP tools.
A Kubernetes-based isolation solution for running AI agents, exploring secure execution environments within the K8s ecosystem.
Coval is an evaluation tool for voice and conversational agents, helping teams test response quality, interaction stability, and real dialog behavior.
LangDB is a data and operations tool for LLM and agent applications, helping teams manage prompts, traces, and experiment versions as a lightweight operational layer.
Aide is a VSCode extension for AI-powered coding assistance, featuring one-click comments, code conversions, UI-to-code generation, and AI batch file processing.
Open Agent SDK TypeScript is an Agent SDK without CLI dependencies, serving as an alternative to Claude Agent SDK, fully open source for TypeScript developers to build custom AI agents.
Multi-agent orchestration workflow platform supporting Claude Code, Codex, Gemini, OpenCode and more. Provides unified orchestration interface for cross-platform agent collaboration.
danghuangshang is a community-maintained knowledge base and tutorial hub for AI agent development, RAG, and LLM engineering.
AWS Labs workflow steering rules for AI-Driven Life Cycle development, guiding AI coding agents through adaptive software-delivery processes.
All-in-one platform for search, recommendations, RAG, and analytics offered via API. Built in Rust with vector search, full-text search, and semantic reranking for enterprise-grade AI retrieval applications.
Omnara (YC S25) - Talk to your AI agents from anywhere. A unified platform for multi-agent communication and task coordination across platforms.
A memory-first coding agent that uses Letta-style long-term memory to help developers work continuously across codebases.
The Open Source Memory Layer For Autonomous Agents. Provides long-term memory, knowledge storage, context management with support for memory retrieval, associative reasoning, and knowledge graph construction.
Terminal session manager for AI coding agents. One TUI for Claude, Gemini, OpenCode, Codex, and more, enabling session switching and parallel workflows.
FastRTC is a developer tool for real-time multimodal and voice applications, useful as a communication layer for low-latency agent conversations and interactive audio/video workflows.
Mem0 TS is the TypeScript version of Mem0, offering long-term memory management, preference extraction, and context compression for agent applications built in JS/TS stacks.
LLMTracer is a tracing tool for agent and LLM applications, helping developers capture call paths, tool execution, and state transitions for debugging and incident analysis.
Framework enabling AI agents to use real Android and iOS apps just like a human, supporting autonomous operation and interaction with mobile interfaces.
Comprehensive Google Workspace MCP server and CLI tool for controlling Gmail, Calendar, Docs, Sheets, Slides, Chat, Forms, Tasks, Search and Drive with AI.
Comprehensive Google Workspace MCP server enabling AI control of Gmail, Calendar, Docs, Sheets, Drive, and more Google applications.
PentestAgent is an AI agent framework for black-box security testing, supporting bug bounty, red-team, and penetration testing workflows.
A capability-based, multiplexing sandbox tool built for developers — run agents securely without needing any additional infra, zero setup, zero latency.
A CNCF Sandbox SRE Agent that automatically analyzes infrastructure logs and metrics to assist with incident diagnosis and system operations.
Griptape is a modular framework for building and deploying AI agents, supporting toolchains, memory, and multi-model integration.
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
Enterprise-grade multi-tenant AI agent development platform from China Unicom, featuring RAG, workflow orchestration, and MCP tool integration
Security scanner for AI agents, MCP servers, and agent skills by Snyk — detect and fix security vulnerabilities before deployment.
Blaxel AI SDK is a production-focused toolkit for agent systems, emphasizing tool definitions, execution control, tracing, and service integrations for enterprise apps.
Contextal is a context management and retrieval-enhancement tool for multi-turn agents, long conversations, and complex knowledge injection workflows.
Gweaver is an experimental platform for multi-agent collaboration and task weaving, useful for exploring decomposition, coordination, and role-based execution.
OpenLIT is an open-source AI engineering platform providing OpenTelemetry-native LLM observability, GPU monitoring, guardrails, evaluations, prompt management, and playground, integrating with 50+ LLM providers and agent frameworks.
Open-source local realtime voice AI system supporting fully offline real-time voice conversations, suitable for building private voice assistants and voice interaction applications.
AI agent and animation engine powered by Large Language Models for creating interactive animations and visual content.
Claude Agent SDK Demos is Anthropic's official collection of demos for the Claude Code SDK, showcasing how to build various AI agent applications using the Claude Agent SDK across multiple use cases and best practices.
Fully autonomous AI Agent that can perform complicated tasks and projects using terminal, browser, and editor with sandboxed execution.
A library by Hugging Face for easily evaluating machine learning models and datasets, providing a wide range of metrics and evaluation methods.
Official Microsoft collection of Skills, MCP servers, Custom Agents, and Agents.md for SDKs to ground coding agents.
HuggingFace's all-in-one toolkit for evaluating LLMs across multiple backends, deeply integrated with the HuggingFace ecosystem and providing flexible evaluation metrics and benchmark configuration.
Powerful MCP server providing all-in-one public web access for AI agents with web scraping and structured data extraction.
An open-source LLM observability platform providing logging, tracing, feedback, evaluation, and prompt management for chatbots and agent applications.
Versatile, UI-agnostic OpenAI-compatible plugin framework for adding custom pipelines like content filtering, RAG enhancement, and tool calling to any AI chat interface.
CLI to control iOS and Android devices for AI agents, enabling coding agents to directly interact with mobile devices for testing and automation.
An open-source library by NVIDIA for efficiently connecting and optimizing teams of AI agents with orchestration, tool calling, and workflow management.
Official spec and SDK of MCP Apps protocol - the standard for UIs embedded in AI chatbots, served by MCP servers, enabling interactive user interfaces directly from MCP tools.
An evaluation and monitoring tool for LLM applications that checks response quality, context relevance, factuality, and user feedback for agent systems.
The SOTA open-source browser agent for autonomously performing complex tasks on the web with natural language-driven web automation.
A hands-on Java and Spring AI project for building AI agents with RAG, tool calling, MCP, and ReAct-style autonomous planning.
🤖Self-Modifying Framework from the Future 🔮 World's First AMS.
Python and JS/TS SDK for running AI-generated code in secure cloud sandboxes with Jupyter-style code interpretation
A full-stack AI infrastructure tool for data, model, and pipeline orchestration. Streamlines building versatile AI-first applications with a visual pipeline editor for end-to-end workflows from data ingestion to model inference.
A spec-driven multi-agent project management framework that turns requirements, planning, execution, and review into collaborative agent workflows.
No-code multi-agent framework to build LLM agents, workflows, and applications with your own data, supporting diverse data source integrations
AI chat assistant for Obsidian with contextual awareness, smart writing assistance, and one-click edits. Features vault-aware conversations, semantic search, and local model support.
Framework for running agent evaluations and creating RL environments to measure and improve agent performance
Free open-source chat SDK for building fast, real-time apps and generative AI agents with high-performance, customizable, cross-platform UI.
LLM Agent framework within ComfyUI integrating MCP server, TTS, OCR, GraphRAG, and other AI tool nodes for visual workflow building
Official MCP server implementation for the Perplexity API Platform, enabling AI agents to leverage search capabilities.
An LLM-based multi-agent framework that lets developers easily build multi-agent applications with core abstractions for agent roles, tools, knowledge management, and collaboration patterns.
An AI agent for teams, communities, and multi-user environments. Supports intelligent conversations, task delegation, and information sharing.
Visual workflow builder for AI agents powered by Firecrawl - drag-and-drop web scraping pipelines with real-time execution. Build agent workflows without coding.
Offline multi-agent simulation and prediction engine using Neo4j graph database and Ollama local inference for swarm intelligence simulation.
Make your agents learn from experience. A context engine that helps agents continuously improve through structured memory management and experience replay.
VectorAdmin is the universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with an intuitive web interface for data import, querying, and maintenance.
An intelligent RAG system integrating GraphRAG, LightRAG, and Neo4j graph builders with DeepSearch for reasoning-augmented retrieval and a custom evaluation framework for GraphRAG.
AI chat client implementing the Model Context Protocol (MCP) with multi-model support and cross-platform desktop experience
An AI-powered research assistant web UI that performs iterative, deep research on any topic by combining search engines with LLM reasoning.
An AI Agent workforce platform that assigns every team member an AI agent squad for multi-agent collaboration, task orchestration, and compound skill building to scale team capacity beyond headcount.
MCP server for interacting with the Financial Datasets stock market API for AI agent financial data analysis.
A framework for large language model evaluations developed by the UK AI Safety Institute (AISI), providing comprehensive model capability assessment tools with support for safety and alignment testing.
Lightweight MCP gateway that instantly transforms existing MCP Servers and APIs into MCP servers with zero code changes.
A graph-native context development platform for storing, enriching, and retrieving structured knowledge with semantic search and portable context cores, supporting RDF, SPARQL, and other standards for AI agent knowledge management.
Unified hub for centrally managing and dynamically orchestrating multiple MCP servers and APIs with flexible routing.
A platform for building semantically enhanced knowledge graphs, supporting entity modeling, relation extraction, and knowledge fusion for long-term agent memory.
Open-source MCP server for LinkedIn. Enables Claude and any MCP-compatible AI assistant to access profiles, companies, jobs, and messages.
Free, private, UI-based tech documentation MCP server designed for coders and AI coding assistants with precise doc retrieval.
JSON-driven multi-agent cadence-team development framework with intelligent CLI orchestration (Gemini/Qwen/Codex), context-first architecture, and automated workflow execution.
A portable .agent workspace that stores memory, skills, and protocols so coding agents like Claude Code, Cursor, and Windsurf can share durable knowledge.
A framework for building, running, and scaling AI agents as APIs and microservices, with built-in observability, auditability, and identity-aware access control from day one.
AI computer use powered by open source LLMs and E2B Desktop Sandbox.
Superlinked Inference Engine is an open-source inference server and production cluster for embeddings, reranking, and extraction, providing high-performance data processing pipelines for RAG systems.
Microsoft Word MCP server providing AI assistants with document creation, editing, and manipulation capabilities through the MCP protocol.
Build applications that make decisions (chatbots, agents, simulations, etc.). Monitor, trace, persist, and execute on your own infrastructure.
Claude Agent ACP enables using the Claude Agent SDK from any ACP client, providing standardized Agent Client Protocol integration for unified Claude Agent capability access across different platforms.
The open-source RAG platform with built-in citations, deep research, 22+ file formats, partitions, and MCP server.
Development platform to debug, chat, inspect, and evaluate MCP servers and apps for faster MCP development.
Witsy: desktop AI assistant and universal MCP client that connects to multiple AI models and tool services via the Model Context Protocol.
Asynchronous coordination layer for AI coding agents providing identities, inboxes, searchable threads, and advisory file leases over FastMCP, Git, and SQLite.
Notte is a framework for building web agents and deploying serverless browser automation functions, providing reliable browser infrastructure and web-aware agent capabilities.
An agentic company research tool powered by LangGraph and Tavily that conducts deep diligence on companies using a multi-agent framework. It leverages Google's Gemini 2.5 Flash and OpenAI's GPT-5.1 on the backend for inference.
Microsoft's open-source AI red teaming playground labs with infrastructure for running AI red teaming trainings and hands-on security exercises.
OxyGent is an open-source multi-agent collaboration framework from JD.com, supporting flexible agent role definition, task decomposition, and collaborative orchestration for enterprise AI agent applications.
Neuron AI is a PHP agentic framework for building production-ready AI applications, enabling developers to connect LLMs, vector databases, and memory systems to create agents that interact with data.
An open-source template for building web agents with Stagehand on Browserbase, providing serverless browser automation for AI agents to safely execute web tasks in the cloud.
Out-of-the-box (OOTB) GUI Agent for Windows and macOS.
An AI-powered agentic red team framework that automates offensive security operations, from reconnaissance to exploitation to post-exploitation, with zero human intervention.
Shannon is a production-oriented multi-agent orchestration framework built in Go, focusing on efficient and reliable agent coordination and task scheduling for enterprise-grade multi-agent systems.
A curated directory of open-source AI agent skills for Swift and Apple platform development.
Open-source persistent memory service for AI agents, supporting LangGraph, CrewAI, and AutoGen with REST API, knowledge graph, and autonomous memory consolidation.
YoMo is a serverless AI Agent framework built on geo-distributed edge AI infrastructure, using low-latency stream processing for real-time agent orchestration and MCP tool integration, ideal for edge computing agent deployments.
Automatically generate demo applications using LLMs. Describe your idea and get an interactive prototype in Streamlit or Gradio format.
An open-source LLM vulnerability scanner and AI red teaming kit for automated security fuzzing of LLM applications, detecting jailbreaks, prompt injection, and adversarial attacks.
A Markdown-first memory system and standalone library for any AI agent. Provides memory storage and retrieval with vector search and semantic matching to help agents manage long-term context.
An open-source chat UI for Ollama providing a clean, intuitive interface for local LLM conversations with model selection and conversation management.
AutoChain is a lightweight, extensible, and testable LLM Agent framework by Forethought, providing clean abstractions for agent building with automatic tool selection, conversation history management, and automated testing workflows.
An observability platform for AI agents that tracks model calls, tool executions, task trajectories, and runtime costs.
An open-source background agents coding system that autonomously executes coding tasks in the background, including code reviews, test generation, and feature implementation.
Open Agent Platform is LangChain's open-source deployment platform for agents, focused on multi-agent execution, long-running tasks, observability, and production orchestration.
A curated collection of safety-related papers, articles, and resources focused on Large Language Models — comprehensive reference for researchers and practitioners exploring LLM safety implications and advancements.
LLM Compiler for Parallel Function Calling (ICML 2024), significantly improving agent tool calling efficiency and speed through parallel execution.
ShowUI is an open-source, end-to-end Vision-Language-Action model for GUI agents and computer use, capable of understanding screenshots and executing precise interface interactions.
An enterprise-grade platform for running and managing MCP servers with containerized deployment, security isolation, network policies, resource limits, and unified management of large-scale MCP server fleets via Kubernetes or Docker.
Sandbox your local AI agents so they can only read and write what they need. File system permission control for secure local agent execution.
Self-hosted, always-on AI agent platform running in containers. Create multiple bots with long-term memory and connect them to Telegram, Discord, Feishu, Matrix, and more.
Sourcery is an instant AI code review tool that automatically detects code issues, suggests refactoring, and improves code quality, integrating into developer workflows for real-time code review.
Powerful, self-hostable AI agent platform designed for maximum privacy and flexibility. A complete drop-in replacement for OpenAI Responses APIs running locally on consumer-grade hardware.
Elegant lightweight AI chat client with multi-workspace, plugin system, cross-platform sync, Artifacts, and MCP support — local first
Desktop AI assistant with multi-model support (GPT-5, Claude, Gemini, Ollama, etc.), featuring chat, vision, voice, RAG, image generation, agents, and MCP plugins
A collection of Microsoft chat application samples demonstrating how to build LLM conversation experiences with Azure and common frontend frameworks.
Run Claude Code, Gemini, Codex — or any coding agent — in a clean, isolated sandbox with sensitive data redaction and observability baked in.
An open-source AI assistant framework with skills and agent architecture.
An MCP client for Neovim that seamlessly integrates MCP servers into your editing workflow with an intuitive interface for managing, testing, and using MCP servers with your favorite chat plugins.
Realtime Voice AI on Arduino ESP32 with 100+ Voice AI Models for AI Toys, Companions, and Devices. Supports OpenAI Realtime, Gemini, Grok, and Eleven Labs.
PowerPoint MCP server using python-pptx, enabling AI assistants to create, edit, and manipulate PowerPoint presentations through the MCP protocol.
DATAGEN: AI-driven multi-agent research assistant automating hypothesis generation, data analysis, and report writing.
Official sample application for OpenAI Computer Using Agent (CUA). Learn how to use CUA via the API on multiple computer environments.
Multi-agent AI coding platform powered by Vercel Sandbox and AI Gateway.
An open-source Cultivation World Simulator using Agentic Workflow to create a dynamic, emerging Xianxia world. Showcases multi-agent collaboration in complex scenarios.
An open-source multi-agent simulation platform where multiple LLM-powered agents collaborate on complex tasks in shared environments, with customizable roles, memory systems, and environment interaction for studying multi-agent collaboration.
An autonomous agent framework for Elixir built for distributed, autonomous behavior and dynamic workflows, leveraging BEAM VM concurrency and fault tolerance for production-grade agent systems with high availability.
JVector is the most advanced embedded vector search engine, built in pure Java by DataStax. It provides high-performance ANN search for RAG and AI applications on the JVM.
Vald is a highly scalable distributed vector search engine built on cloud-native architecture, designed for high-performance approximate nearest neighbor search across massive vector datasets.
LangGraph for Java — a library for developing AI agentic architectures in the Java ecosystem, designed to work seamlessly with both LangChain4j and Spring AI, supporting stateful graph-based workflows and complex agent orchestration.
A multi-modal vector database that supports upserts and vector queries using unified MySQL-compatible SQL on structured and unstructured data, meeting high concurrency and ultra-low latency requirements.
IBM's open-source Industry 4.0 AI agent benchmark and framework with 460+ scenarios, 4 specialist agents, and multi-agent orchestration blueprints for industrial asset operations.
LLPhant - A comprehensive PHP Generative AI Framework using OpenAI GPT 4. Inspired by Langchain.
Official Microsoft Learn MCP Server and CLI tool, powering LLMs and AI agents with real-time, trusted Microsoft docs and code samples.
Framework for AI agents to build and maintain an Obsidian wiki using Karpathy's LLM Wiki pattern.
A framework that uses code execution as agent actions. Research shows code is a better action space than text for agents, powering the CodeAct paradigm.
⚡️ Superpowers for your Openclaw. Powerful prebuilt agent workflows.
An open-source MCP client that provides unified access to Model Context Protocol tools, enabling integration of any MCP server into AI applications with simplified tool calling.
A ChatGPT web client supporting multiple users, languages, and database connections for persistent storage. Provides Docker images and quick deployment scripts.
MCP server for n8n automation platform, enabling AI agents to interact with n8n API to manage workflows and automation tasks via natural language.
Skybridge is a full-stack TypeScript framework for MCP Apps and ChatGPT Apps. Type-safe. React-powered. Platform-agnostic.
A memory platform for personal AI and agent applications, providing persistent context, semantic retrieval, and cross-session knowledge management.
OpenAdapt is an open-source agent tool for desktop automation and computer-use scenarios, capturing user interactions, replaying tasks, and enabling GUI automation workflows.
AI-powered RPA tool that records and replays user actions, combining traditional RPA with LLM agents for intelligent task automation.
Adala is an autonomous data labeling agent framework that uses AI agents to automate data annotation, classification, and quality checks, significantly improving data processing efficiency.
A command-line interface for interacting with MCP (Model Context Protocol) servers using both stdio and HTTP transport.
Brazilian public API MCP server collection, integrating 70+ Brazilian government open APIs for AI assistants to access rich Brazilian public service data.
A fast and lightweight framework for creating decentralized agents with ease.
A GenAI application development framework that simplifies agent interaction with structured data and chained-calls syntax, using event-driven flow for complex logic.
Google ADK Java is Google's Java toolkit for building, evaluating, and deploying sophisticated AI agents, filling the agent framework gap in the Java ecosystem.
An open source autonomous agent built in Rust that lives on your machines 24/7 and keeps your apps running on autopilot.
Next-gen AI+IoT framework for T2/T3/T5AI/ESP32/and more – Fast IoT and AI Agent hardware integration.
Agentic AI framework for enterprise workflow automation. Uses LLM-powered pipelines for code reviews, DevOps, and other enterprise tasks.
ChatArena is a multi-agent language game environment for LLMs, designed to develop and evaluate communication and collaboration capabilities of AI agents across diverse game scenarios.
Let your AI agent use your browser. Actionbook makes browser automation actually work through natural language instructions.
MCP Language Server gives MCP-enabled clients access to semantic code tools like go-to-definition, find-references, rename, and diagnostics, providing AI agents with precise code navigation capabilities.
The first full-stack open-source self-evolving general AI agent, offering a fully local alternative to agentic platforms like Manus and Genspark AI with autonomous thinking, task planning, tool usage, and knowledge accumulation.
A simple yet powerful agent framework for personal assistants, designed to enable intelligent interaction, multi-agent collaboration, and seamless tool integration with built-in memory and tree-of-thought reasoning.
Talk to your Mac, query your docs, no cloud required. On-device voice AI with RAG for private, local voice assistant experience.
Official data.gouv.fr MCP server for the French national open data platform, enabling AI chatbots to search, explore, and analyze French government datasets.
Open chat interface for all your models — a unified, modern frontend for connecting to various AI providers
Playwright for Windows desktop automation, enabling AI agents to control desktop applications through natural language
A cognitive architecture framework for enterprise AI agents, providing complete methodology for agent planning, execution, and learning.
Official MiniMax MCP server enabling Text-to-Speech, image generation, and video generation APIs through the MCP protocol for multimodal AI agent capabilities.
An LLM prompt injection detector that combines heuristics, vector similarity, and language model-based detection to identify and block malicious prompt injection attacks.
WebArena is a realistic benchmark environment for evaluating autonomous web agents. It provides Gym-like interactive website simulations covering e-commerce, forums, CMS, and more, enabling end-to-end task evaluation as a standard framework for web agent research.
Mirascope is a lightweight LLM development library that takes a type-safe, Pythonic approach to building LLM applications, emphasizing simplicity over framework constraints.
Open-source AI agent desktop app for Windows and macOS with one-click install of Claude Code, MCP tools, and Skills, featuring sandbox isolation, multi-model support, and Feishu/Slack integration.
The TypeScript version of the Claude Agent SDK, officially maintained by Anthropic, providing the official toolkit for TypeScript developers to build Claude Agent applications.
🤖 AI-native Visual Analytics framework build for agents.
LangMem is LangChain's memory layer for agents, helping developers add long-term memory, replay summaries, and context management to improve multi-turn performance.
The community edition of Pica, the agentic tooling platform.
Extract and convert data from any document (PDFs, images, Word, PPT, URLs) into multiple formats including Markdown, JSON, and CSV.
ColiVara is a suite of services for storing, searching, and retrieving documents based on visual embeddings. It uses vision models instead of chunking and text-processing, achieving state-of-the-art retrieval on both text and visual documents without OCR.
(BETA) AI shouldn't have a meter. Unlimited tokens. Forever. Your machine. Your agent. Use it from anywhere. Terminal-native coding agent powered by local LLMs — 100% open source, free forever, and installed with a single command. Proudly built on C#/.NET, because AI tooling s...
OctoTools is an agentic framework with extensible tools for complex reasoning, featuring a tool card system for flexible composition of diverse reasoning capabilities.
Common interface for interacting with AI agents. The protocol is tech stack agnostic - you can use it with any framework for building agents.
An automated LLM fuzzing tool by CyberArk that helps developers and security researchers identify and mitigate jailbreak vulnerabilities in LLM APIs with multiple attack vectors.
ESP-Claw, a "Chat Coding" AI agent framework for IoT devices.
KaibanJS is a JavaScript-native multi-agent framework with a Kanban-inspired approach for managing agent collaboration, supporting task assignment, role definition, and parallel execution for rapid multi-agent system development.
A real-time observability toolkit for Claude Code agents that tracks hook events to monitor multi-agent coding workflows.
SWE-Lancer is an OpenAI benchmark dataset evaluating frontier language models on freelance software engineering tasks, covering real scenarios from simple bug fixes to complex feature development.
[EMNLP 2025 Oral] MemoryOS is designed to provide a memory operating system for personalized AI agents.
CrewAI Tools provides reusable integrations for the CrewAI ecosystem, including search, scraping, database access, and code execution to extend multi-agent workflows quickly.
Phantom is an AI co-worker with its own computer, featuring self-evolving capabilities, persistent memory, and MCP server support, autonomously completing complex tasks like a virtual colleague.
Official Qdrant MCP server implementation, enabling AI assistants to interact with Qdrant vector database for semantic search and knowledge storage.
Run coding agents in sandboxes. Control them over HTTP. Supports Claude Code, Codex, OpenCode, and Amp with isolated execution environments.
OpenReview is an open-source, self-hosted AI code review bot powered by Vercel that automatically analyzes pull requests and provides code review suggestions.
Fully-featured web interface for Ollama LLMs built with Next.js. Supports local model conversations, multi-model switching, and browser-side persistent storage.
A hyper-fast local vector database for use with LLM Agents, providing lightweight vector storage and similarity search capabilities for embedding as instant memory and knowledge retrieval components in agent applications.
HyperAgent is a Playwright-based AI browser automation framework offering high-level APIs like page.ai(), page.perform(), and page.extract(). It features built-in MCP client support and action caching, enabling AI agents to browse, interact, and extract data using natural language.
Grounded Docs MCP Server providing precise technical documentation retrieval for AI coding assistants, an open-source alternative to Context7.
E2B Desktop Sandbox for LLMs. A secure sandbox with graphical environment that connects to any LLM for safe computer use operations.
Your AI agent skills, finally organized — a macOS app to browse, edit, and manage skills across Claude Code, Cursor, Codex, Windsurf, and Amp.
Official ElevenLabs MCP server providing AI assistants with high-quality speech synthesis and voice cloning capabilities.
AI video agents framework for next-gen video interactions and workflows.
A suite of tools for connecting AI to the web with a query language and Playwright integrations for precise, scalable web element interaction and data extraction.
A multi-agent system that keeps running for ~100 hours and solves very complicated coding or math problems that can be verified.
The Powerful Conversational AI JavaScript Library with UI for any LLM. Supports LangChain, HuggingFace, Vercel AI, and more. Works with React, Next.js, and plain JavaScript.
A multi-agent coding system where orchestrator, explorer, and coder agents collaborate on software tasks with shared context.
A custom AI agent platform that lets teams build and deploy AI assistants by composing multiple agents, connecting them to internal knowledge bases and tools for trusted AI-powered collaboration in enterprise workflows.
Anti-detection patches for Playwright and browser automation scenarios, helping automated browsers appear more like real user sessions.
AI-Engineering Foundation Framework built with AI and designed for AI. Hundreds of architectural and domain decisions (multi-tenancy, RBAC, event flow, pricing, sales pipeline,CRM/ERP processes) are already made conventions and specs so agents (Cursor, Claude Code, Codex) arch...
10x is an open-source AI coding accelerator delivering up to 20x faster coding with multi-step capabilities, featuring smart model routing, BYOK, and fully self-hosted deployment.
A reactive runtime for building durable AI agents, written in Rust for high reliability and persistence.
The first open-source Artificial Narrow Intelligence generalist agent that fully operates GUIs using only natural language. Uses Visualization-of-Thought and Chain-of-Thought reasoning for spatial perception and HID simulation.
A Gemini chatbot example from Vercel Labs demonstrating how to implement streaming conversations with modern frontend techniques.
Samurai-inspired multi-agent system for Claude Code. Orchestrate parallel AI tasks via tmux with shogun-karo-ashigaru hierarchy.
Experimental Linux microvm setup with a TypeScript Control Plane as Agent Sandbox.
Multi-agent orchestration for AI coding agents with pluggable runtime adapters for Claude Code, Pi, and more.
A high-performance vector database designed to handle up to 1 billion vectors on a single node, delivering significant performance gains through optimized indexing and execution. Also available as a cloud service.
A deliberately vulnerable MCP server for security education, containing multiple MCP protocol vulnerability scenarios to help developers understand and prevent agent security risks.
An AWS sample project demonstrating how to build enterprise-grade chat applications with Amazon Bedrock, including conversation UI, RAG integration, and multi-model support.
MCP server that enables AI agents to extract data from social media, search engines, maps, and e-commerce sites using thousands of Apify scrapers.
The toolkit for AI devtools context engineering. Build with codebase mapping, symbol extraction, and many kinds of code search to help AI agents better understand and operate on codebases.
BrowserWing turns browser actions into MCP commands or Claude Skills, allowing AI agents to control browsers efficiently and reliably with reduced dependency on heavy LLM interactions.
A curated list of awesome LLM and AI Agent Skills, resources and tools for customising AI Agent workflows. Works with Claude Code, Codex, Gemini CLI and custom agents.
MySQL MCP server enabling secure interaction between AI assistants and MySQL databases, supporting query execution and data manipulation.
A self-organizing multi-agent collaboration platform where multiple AI agents work as an autonomous team, handling planning, executing, reviewing, and patrolling tasks with zero human intervention.
Orchestrate thousands of agents and harnesses as a graph programatically.
TrustRAG is a RAG framework focused on reliable input and trusted output, providing complete RAG pipeline components including document parsing, chunking, retrieval, and reranking with multiple retrieval strategies and evaluation methods.
Connect AI models like Claude & GPT with robots using MCP and ROS protocols, enabling AI-driven robot control and interaction.
EmbedAnything is a highly performant, modular, and memory-safe embedding inference and indexing framework built in Rust, providing production-ready RAG ingestion and indexing pipelines for local and cloud deployment.
BaseAI is a serverless AI agent framework for web developers, enabling local-first agentic pipes, tools, and memory with one-command serverless deployment for rapid AI agent application delivery.
Agent-MCP is a multi-agent framework built on the Model Context Protocol (MCP) that enables coordinated, efficient AI collaboration with multiple specialized agents working in parallel on different aspects of a project.
Every AI Agent deserves a wallet.
Open-source agent platform for Global × China enterprises — wire every system through one agent core. Self-hosted, any LLM.
An advanced browser AI tool developed by Oxylabs AI Studio that automates real user browsing tasks using natural language instructions.
trpc-agent-go is a powerful Go framework for building intelligent agent systems with LLM integration, tool calling, multi-step reasoning, and workflow orchestration, designed for enterprise-grade agent systems in microservice architectures.
An MCP adapter that bridges the Abilities API to the Model Context Protocol, enabling MCP clients to discover and invoke WordPress plugin, theme, and core abilities programmatically.
Langtrace is an open-source, OpenTelemetry-based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations, and metrics for popular LLMs, agent frameworks, and vector databases.
KubeAI is a Kubernetes-native AI inference operator that makes it easy to serve ML models in production, supporting LLMs, VLMs, embeddings, and speech-to-text with autoscaling.
A lightweight, fast, and secure code execution environment supporting multiple programming languages — provides sandboxed code execution for the Dify platform.
AI browser automation assistant as a Chrome extension, privacy-first with MCP support, alternative to Claude Chrome and Manus Browser Operator
Proma brings the smoothest universal agent experience into your workflow. Built on Claude Agent SDK with native Feishu (Lark) group chat integration and flexible LLM provider support, bringing top-tier agent capabilities to everyday work scenarios.
Browserable is a self-hostable browser automation tool purpose-built for AI agents. It provides secure Docker-based browser environments with a JavaScript SDK, achieving 90.4% accuracy on the Web Voyager benchmark for autonomous web navigation.
Multi-Agent System Framework For Complex Tasks.
A curated list of papers and resources for multi-modal Graphical User Interface agents, systematically covering computer use, mobile interaction and more.
TypeScript AI platform with AI chat, Autonomous agents, Software developer agents, chatbots and more.
A Burp Suite extension that adds MCP tooling, AI-assisted analysis, privacy controls, and passive or active scanning to security testing workflows.
Git LRC is a free, unlimited AI code review tool that runs automatically on every commit, helping developers catch and fix code issues early in the development workflow.
ApeRAG is a production-ready GraphRAG system with multi-modal indexing, AI agent integration, MCP support, and scalable Kubernetes deployment, providing a complete solution for enterprise-grade RAG applications.
Weixin Agent SDK enables connecting any AI agent to WeChat (Weixin) bots, allowing quick integration of AI agents into WeChat official accounts or bots for intelligent conversation and automated services.
Official Neo4j GraphRAG Python SDK providing an integrated toolkit for knowledge graph construction, vector retrieval, and graph querying, supporting agent-driven graph retrieval-augmented generation workflows.
Build your own Cowork, AI Scientist and other SoTA Agents just by editing config files. Support anthropic skills. An infinite-horizon agent framework designed for long-running, complex tasks.
Zylos Core is open-source agent infrastructure for team collaboration, providing AI agents with lifecycle management capabilities to enable team collaboration and continuous operations.
A single interface to use and evaluate different agent frameworks.
Inkeep’s agent platform for creating AI assistants and multi-agent workflows through a no-code visual builder or TypeScript SDK.
CVS Health's open-source uncertainty quantification library for language models, providing UQ-based hallucination detection with confidence scoring and mitigation tools to identify and reduce unreliable LLM outputs.
A lightweight, rollbackable, and visual long-term memory server for MCP agents, replacing traditional vector RAG with reliable context retention.
Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and version control agents across compatible frameworks.
An AI agent framework focused on agent collaboration, featuring a clean API design and documentation-driven development approach, supporting task decomposition, coordination, and result aggregation across multiple agents.
Kodus AI is an open-source AI code review tool with full control over model choice and costs, automatically analyzing pull requests and delivering high-quality code review feedback.
Agent-native flight search & booking. Saved $116 across 5 routes vs Google Flights (verified). 400+ airlines in 5 seconds.
MCP server for Jupyter that enables AI agents to interact with Jupyter kernels, execute code, and manage notebooks.
An AI workflow builder template from Vercel Labs with visual workflow orchestration. Built on Next.js and Vercel AI SDK with drag-and-drop design.
SQL-Driven RAG Engine that automatically builds knowledge graphs during querying, combining SQL query capabilities with Retrieval-Augmented Generation for efficient knowledge retrieval.
A framework for integrating AI into JSX components. Build AI applications the React way with streaming rendering, tool use, and agent composition.
Wuying AgentBay SDK is a cloud sandbox built for AI agents, providing secure isolated execution environments for agents to safely run code and operations in the cloud, suitable for production-grade agent applications requiring sandboxed execution.
VectorDBBench is a benchmarking tool for vector databases, providing standardized performance testing and comparative analysis for popular vector databases including Milvus, Qdrant, Chroma, Weaviate, and more.
Claude Memory Compiler gives Claude Code a memory that evolves with your codebase. Hooks automatically capture sessions, the Agent SDK extracts key decisions and lessons, and an LLM compiler organizes everything into structured, cross-referenced knowledge articles.
Open-sourced computer use agents that can operate on cross-platform environments including Windows, macOS, Ubuntu, and Android. ICLR 2026 Oral paper project.
A Python framework that emulates Grok Heavy functionality using intelligent multi-agent orchestration. Deploy 4 (or more) specialized AI agents in parallel to deliver comprehensive, multi-perspective analysis on any query.
A better chatbot platform powered by Agent, MCP, and Workflows. Supports multi-model integration, visual workflow orchestration, and low-code configuration.
Open-source web data agent optimized for structured web research, capable of autonomously browsing websites and extracting structured data.
Markdown memory system for you and your AI agent. Provides persistent memory storage through structured Markdown files with context management and retrieval.
A toolkit by Weights & Biases for developing AI-powered applications, providing LLM call tracing, evaluation experiment management, and versioning from prototype to production.
Calendar sync tool and universal calendar MCP server for aggregating, syncing, and controlling calendars across Google, Outlook, Office 365, iCloud, and CalDAV.
Skales is a local AI desktop agent for Windows, macOS, and Linux. It features an agent skills system (SKILL.md), autonomous coding (Codework), multi-agent team collaboration, and desktop automation with 15+ AI providers, requiring no Docker or terminal.
A low-code tool to rapidly build and coordinate multi-agent teams for complex task execution.
Open-source, end-to-end platform for evaluating, observing, and improving LLM and AI agent applications. Tracing · Evals · Simulations · Datasets · Gateway · Guardrails. Self-hostable. Apache 2.0.
Lightweight and portable LLM sandbox runtime Python library — provides a code interpreter for safely executing AI agent-generated code in isolated environments.
Independently authored prompt templates for AI coding agents — system prompts, tool prompts, agent delegation, memory management, and multi-agent coordination. Informed by studying Claude Code.
Lightweight AI agent framework with built-in memory, tool calling, and tree-of-thought reasoning, supporting multi-agent collaboration and self-learning, compatible with OpenAI, DeepSeek, Qwen, and other major LLMs with MCP/SSE protocol integration.
LLMChat provides a unified interface for AI chat and agentic workflows, supporting multi-model conversations and agent workflow orchestration with a clean, modern user interface.
A MemAgent framework that can extrapolate to 3.5M context tokens, along with a training framework for RL training of any agent workflow.
Dynamiq is an orchestration framework for agentic AI and LLM applications.
An open-source AI Voice Agent that integrates with Asterisk/FreePBX using Audiosocket/RTP technology for low-latency AI-powered phone interactions.
An agentic memory system for LLM agents inspired by human memory mechanisms, enabling dynamic memory generation, retrieval, and consolidation with automatic memory evolution and self-organization.
An evaluation framework for LLM applications providing test set management, metric computation, and output quality assessment for agent development teams.
The first open-source testing agent that enables UI, API, security, accessibility, and visual validations without writing code or maintaining tests
Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone. Provides out-of-box RAG solution with support for knowledge base building, semantic search, and context management.
AI Agent Evaluator and Red Team Platform. Provides systematic security evaluation and adversarial testing tools to discover and fix vulnerabilities in agent systems.
Agentica is a TypeScript AI function calling framework enhanced by compiler skills, using type-safe schemas to auto-generate function calls and boost LLM tool-use capabilities for reliable AI agent backends.
Production-grade multi-agent orchestration platform with JSON-defined agents, multi-tier memory, and built-in observability, battle-tested on 200+ enterprise AI agents with full enterprise deployment support.
ByteDance's open-source code sandbox and evaluation framework for LLM code generation, function calling, and agent execution tasks, with multi-language support and batch evaluation.
AgentSociety 2 is a modern, LLM-native agent simulation platform designed for social science research and experimental design. It provides a flexible framework for creating and managing intelligent agents in simulated environments.
Open-source framework for building browser agents for real-world tasks, learning from user demonstrations to automate web interactions.
OpenTelemetry instrumentation for AI observability, providing standardized tracing, metrics collection, and span definitions for LLM inference processes to help developers monitor and debug AI agent systems.
The first LLM-based web agent and benchmark for generalist web agents, providing datasets, evaluation frameworks and baseline methods for building agents that operate on real websites.
Web UI for AutoGen (A Framework Multi-Agent LLM Applications).
A powerful, easy-to-use Python library for implementing Google's Agent-to-Agent (A2A) protocol for inter-agent communication.
An open-source toolkit for monitoring Large Language Models, extracting signals from prompts and responses for quality and safety evaluation.
Agents-flex is A Lightweight Java AI Application Development Framework.
A version of verl to support diverse tool use.
A security scanner for LLM agentic workflows. Automatically detects security vulnerabilities, prompt injection risks, and permission violations in agent pipelines before deployment.
Latent Collaboration in Multi-Agent Systems — exploring implicit communication and coordination mechanisms for efficient multi-agent reasoning and task allocation.
Open source autonomous software development system that automates the entire process from requirements to code using LLMs.
Build and run AI agents using Docker Compose. A collection of ready-to-use examples for orchestrating open-source LLMs, tools, and agent runtimes.
Hexabot v3 is an AI automation platform, combining workflows, actions, agents, and conversational channels in one runtime.
chromem-go is an embeddable vector database for Go with a Chroma-like interface and zero third-party dependencies. It supports in-memory storage with optional persistence, ideal for lightweight RAG applications.
An automatic prompt optimization framework by Salesforce AI Research that leverages LLMs to search for and refine prompts for improved model performance.
A modular RAG system with MCP Server architecture. Using Skill to make AI follow each step of the spec and complete the code 100% by AI.
The Microsoft 365 Agent SDK simplifies building full stack, multichannel, trusted agents for platforms including M365, Teams, Copilot Studio, and Webchat.
A cross-platform, ultra-efficient SQLite extension that brings vector search capabilities to embedded databases, ideal for local-first RAG applications and agent memory storage.
A curated knowledge base on AI memory for LLMs and agents, systematically covering long-term memory, reasoning, retrieval, and memory-native system design.
Middleware providing an OpenAI-compatible API endpoint that bridges MCP tools to any client or framework supporting the OpenAI API format
Open Agent is an open-source alternative to Claude Agent SDK, ChatGPT Agents, and Manus, providing autonomous AI agent capabilities with support for multiple LLM backends and a focus on building open, customizable agent platforms.
Easy Linux virtual machine on macOS to sandbox LLM agents — a lightweight VM solution for safely running AI-generated code in isolation.
LangSmith SDK is LangChain's observability toolkit for LLM apps and agents, covering tracing, evaluation, dataset management, and debugging for production workflows.
Layra is an enterprise-ready solution combining visual RAG with multi-step agent workflow orchestration, providing out-of-the-box document parsing, knowledge base construction, and intelligent Q&A capabilities.
An open-source platform for building AI chatbots and automated assistants, including frontend UI, bot configuration, and integration examples.
ArtifactFS is a filesystem driver designed to mount large git repos as quickly as possible, hydrating file contents on-the-fly instead of blocking on the initial clone.
A GenAI-powered multi-agent medical assistant for diagnostic support, healthcare research, and question answering through coordinated agent roles.
An implementation of agentic memory for LLM agents from the NeurIPS 2025 A-Mem paper, focused on long-term memory mechanisms.
A PostgreSQL vector database extension for building AI applications, adding high-performance vector search capabilities to PostgreSQL with support for generating and indexing embeddings directly in the database.
🦀 Crabwalk 🦀 Real-time companion monitor for OpenClaw agents.
Inngest Agent Kit is a TypeScript toolkit for agent development that combines step orchestration, tool calling, streaming execution, and event-driven workflows for production tasks.
Web, Desktop & Mobile client for Codex, Claude Code, OpenCode, Kimi, Augment Code, Qwen. Fully end-to-end encrypted cross-platform agent client.
Deep research agent to help you find the best GitHub repositories — AI-powered intelligent search to discover the most suitable open-source projects for your needs.
Official LiveKit React voice agent starter template, demonstrating how to build real-time voice interactive AI agents with LiveKit Agents framework.
Augment SWE-bench Agent is the number one open-source SWE-bench Verified implementation, demonstrating how to build high-performance software engineering agents to automatically resolve GitHub issues.
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
A spatial IDE for recursive multi-agent orchestration. Like an Obsidian graph-view that you work directly inside of — visually build and manage agent teams.
An easy-to-use Python framework for generating adversarial jailbreak prompts, helping researchers systematically evaluate LLM safety defenses with multiple attack method combinations.
A general memory system for AI agents powered by deep research, providing a flexible memory architecture that supports unified management and retrieval of multiple memory types including short-term, long-term, and episodic memory.
TanStack Store is a lightweight state-management tool that works well for agent UIs, workflow frontends, and real-time consoles that need to manage agent state and event flows.
A system for generalist web agents that autonomously carry out tasks on any given website, leveraging large multimodal models like GPT-4V.
An MCP server for SearXNG, providing AI agents with privacy-friendly meta search engine capabilities.
Agentic SOC Platform: A powerful, flexible, open-source, and agent-centric automated security operations platform.
A Pydantic AI framework for Claude Code-style deep agents with tool calling, sandboxed execution, multi-agent teams, skills, checkpoints, and extended context.
A programmable code execution service for AI applications, supporting script execution, online evaluation, and agent tool invocation scenarios.
Microsoft's enterprise solution accelerator for multi-agent automation workflows using Azure OpenAI, with agent orchestration and task decomposition.
SWE-AF is an autonomous software engineering fleet platform using a multi-agent factory architecture. It orchestrates planner, coder, reviewer, and verifier agents to automate the full software engineering lifecycle from issue analysis to code fix, scoring 95/100 on benchmarks.
Golf MCP is a production-ready MCP Server framework with built-in auth, observability, debugger, telemetry, and runtime for building secure AI agent infrastructure.
Sandboxed Execution Environment by WithSecure. Designed for malware analysis and security research with virtualization-based isolation, adaptable for AI agent secure execution.
A practical skill collection for Claude Code covering web development, WordPress, databases, and DevOps to enhance domain-specific task execution.
An open-source tool from Meta for LLM prompt optimization. Automates the process of continuously improving and refining LLM prompts.
Arrakis is a fully customizable and self-hosted sandboxing solution written in Go, designed specifically for AI agent code execution scenarios, providing a secure isolated runtime environment.
A flexible multi-interface AI agent framework supporting reasoning, tool use, memory, deep research, blockchain interaction, and MCP protocol, capable of building agent applications ranging from simple conversations to complex research tasks.
An end-to-end infrastructure for training and evaluating various LLM agents — provides a complete toolchain from data construction to model training and evaluation.
Automated QA testing MCP tool using Browser-Use agents, leveraging AI agents for browser-based automated quality assurance testing.
OpenHands Software Agent SDK is a clean, modular SDK for building AI agents based on OpenHands V1, providing a concise API and extensible architecture for quickly building custom agent applications.
A visualization and management tool for AI long-term memory, helping developers inspect, edit, and debug agent memory accumulated across sessions.
Security gateway for AI coding agents providing security protection, workspace isolation, and multiplexing, supporting Claude, Copilot, Cline, and other IDE extensions to prevent sensitive data leaks and malicious prompt injections.
Blades is a Go-based multimodal AI agent framework from the Kratos team, supporting vision, voice, and text interactions with built-in agent orchestration, tool calling, and memory management.
A private agent fleet platform with spec coding, where each agent gets its own GPU-accelerated desktop for running Claude, Codex, Gemini and open models.
CodeFuse-muAgent is an innovative agent framework driven by a knowledge graph engine, integrating EKG (Enterprise Knowledge Graph) technology for multi-agent collaboration, RAG-enhanced retrieval, and tool learning.
Open-source AI Agent platform that brings Local Manus to your desktop — one-click model downloads, seamless LLM integration, offline RAG knowledge bases, and DeepResearch capabilities with 100% local data.
Open Foundations for Computer-Use Agents. Provides datasets, benchmarks, and foundation models for training and evaluating AI agents that control desktop environments.
AI agent security scanner that detects vulnerabilities in agent configurations, MCP servers, and tool permissions. Available as CLI, GitHub Action, and GitHub App integration.
Agents Deep Research is a multi-agent collaborative deep research tool using specialized agents to cooperatively complete complex research tasks, supporting automated literature search, analysis, and report generation.
A prompt management and debugging platform for LLMs, providing prompt logging, request tracking, replay capabilities, and debugging tools to help teams systematically manage LLM interactions and optimize prompts.
Science-Star: A Platform for Building, Extending, and Experimenting with Scientific Agents.
CUGA is an open-source generalist agent harness for the enterprise, supporting complex task execution on web and APIs, OpenAPI/MCP integrations, composable architecture, reasoning modes, and policy-aware features.
Multi-agent framework for design, simulation, and auditing. Built in Rust for high-performance multi-agent collaboration.
A sample pack of GitHub Agentic Workflows showing how to organize agent workflows around GitHub-based development processes.
Comprehensive benchmark for deep research agents, providing systematic evaluation framework for assessing deep research agent performance.
Agentic Flow enables switching between alternative low-cost AI models in Claude Code/Agent SDK and deploying agents created with Claude to the cloud, providing a complete workflow from development to production.
An end-to-end RL training framework by NVIDIA for orchestrating tools and agentic workflows. Optimizes multi-step agent decision-making and tool-use policies.
State of the Art 82% OSWorld Verified Computer Using Agent, fully open-source, safe, auditable, and production-ready for desktop automation.
An open-source implementation of Programmatic Tool Calling that demonstrates how agents can execute code and invoke tools through MCP-style mechanisms.
Official AWS Python SDK for building AI agents on Amazon Bedrock with lifecycle management, tool integration, memory, and audit trails.
Every practical and proposed defense against prompt injection — a comprehensive reference for LLM security practitioners.
Dapr Agents is a framework for building autonomous, resilient, and observable AI agents with built-in workflow orchestration, security, statefulness, and telemetry for production-grade agent deployments.
Browser Use Agent SDK is an agent SDK provided by the browser-use team, offering a toolkit for building browser automation agents, enabling developers to quickly create web-interacting AI agents.
Open-source AI agent firewall for MCP security providing agent egress control, DLP, SSRF protection, and prompt injection defense.
An Agent Development Kit providing core abstractions and tools for building enterprise-grade AI agents with multiple LLM backends, tool use, and workflow orchestration.
Pipelex is a declarative language and devtool for building composable AI workflows, enabling definition, debugging, and execution of complex LLM pipelines and agent workflows.
Enterprise-ready MCP Gateway and Registry that centralizes AI development tools with secure OAuth authentication, dynamic tool discovery, and unified access with Keycloak/Entra integration.
An MCP server powered by Mem0 for long-term agent memory, supporting user preference memory, context-aware retrieval, and cross-session memory persistence, also useful as a Python MCP server development template.
A Go implementation of the Model Context Protocol SDK, providing complete tooling for building MCP servers and clients in Go.
Scaling data for SWE-agents (NeurIPS 2025 D&B Spotlight). A toolkit for automatically generating large-scale training datasets for software engineering agents.
A multi-agent framework written in Rust for building, deploying, and coordinating multiple intelligent agents, designed for high performance and memory safety in latency-sensitive production systems.
Conversational voice AI agents platform for building natural language phone interactions with multilingual speech synthesis and real-time dialogue management.
An open-source, end-to-end voice AI orchestration platform for building real-time conversational voice agents with audio streaming, STT, TTS, VAD, and multi-channel integration.
A comprehensive collection of LLM jailbreak techniques and prompts for ChatGPT, Claude, Llama, and other models — essential reference for LLM security research.
Desktop app to control your computer with AI using your terminal, browser, mouse & keyboard.
An agent framework for llama.cpp that supports structured function calls and JSON output, enabling easy interaction with local LLMs without fine-tuning.
Open-source automation platform providing Python libraries and cloud runtime for building and running RPA and AI agent workflows.
An open-source chatbot and AI agent development platform with visual bot editor, NLP, and deep learning integration for multi-platform deployment.
An open-source chatbot and AI agent development platform with visual bot editor, NLP, and deep learning integration for multi-platform deployment.
Easy-to-use RAG framework — CCF AIOps International Challenge 2024 Top3 solution, providing out-of-the-box RAG pipeline building capabilities.
Lightweight, cross-platform process sandboxing powered by OpenAI Codex's runtime — sandbox any command with file, network, and credential controls.
AI-powered software engineering multi-agent system with researcher and developer agents that automate code implementation through intelligent planning and execution.
AI agents with graph-based reasoning memory by Neo4j. Scaffold graph databases in seconds to give agents knowledge-graph-driven memory and reasoning capabilities.
AI-friendly semantic code search engine for large codebases — combines ripgrep speed with tree-sitter AST parsing for precise, context-aware code understanding in AI coding assistants.
Community-driven P-lexity AI desktop app powered by Electron, bringing powerful AI language intelligence straight to your desktop.
Vectra is a local vector database for Node.js with features similar to Pinecone but built using local files. It supports semantic search and document embeddings with no external service dependencies, ideal for RAG application development in Node.js environments.
A tracing and debugging platform for LLM and agent applications, recording prompts, model responses, tool calls, and chain latency for observability.
AI-first security scanner with 76 analyzers, 9,600+ detection rules, and repo poisoning detection for AI/ML, LLM agents, and MCP servers. Scan any GitHub repo.
A dynamic environment by ETH Zurich to evaluate attacks and defenses for LLM agents, providing standardized benchmarks for measuring agent system security.
An open-source memory system for AI assistants that persists user experiences, preferences, and context into retrievable long-term storage.
An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.
MCP Sequential Thinking server that recommends the most effective MCP tools at each reasoning stage, enhancing AI agent tool selection
Open-source AI sandbox infrastructure for code execution, browser use, and AI agent runtimes.
Agent SDK Go is a powerful Go framework for building production-ready AI agents, providing core features including tool use, conversation management, and multi-model support.
Open Source Voice Agent Platform.
AgentPay SDK is an AI agent payment SDK built in Rust, providing agents with secure and reliable payment capabilities, enabling agents to autonomously complete payment and transaction operations.
A research-agent service example from the DeepLearning.AI Agentic Workflow course, demonstrating core patterns for agentic workflows.
A versatile workflow automation platform to create, organize, and execute AI workflows, from a single LLM to complex AI-driven workflows.
AgentLabs is a toolkit for agent development and testing, focused on experimentation, replay, and workflow support to improve iteration speed.
An agentic workflow tool for OpenCode that provides context-engineering support to help coding agents organize project knowledge.
Multi-agent platform built on Spring AI Alibaba with MCP protocol, skills system, memory management, dream mode, and multi-channel support.
Open-source self-hosted AI agent runtime and multi-agent framework for autonomous swarms, with persistent memory, MCP tools, schedules, delegation, and support for many LLM providers.
Audit-grade multi-agent orchestration for CLI coding agents with HMAC-chained audit logs, signed agent cards, per-artifact lineage, and air-gap deployment support.
Implementing cognitive architecture and psychological memory concepts into Agentic LLM Systems. Explores short-term, long-term, and working memory engineering for AI agents.
An MCP-native browser agent that gives AI systems a real browser for web tasks while keeping a human in the loop.
Give each AI agent its own isolated machine with root, Docker, and systemd. Active defense detects and stops threats automatically.
Opinionated agentic RAG powered by LanceDB, Pydantic AI, and Docling.
Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414.
Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.
A local-first LLM wiki and knowledge-graph builder that can serve as a RAG knowledge base, agent memory store, and AI second brain.
OpenClaw plugin implementing the A2A (Agent-to-Agent) protocol v0.3.0 — bidirectional agent communication gateway.
Keep it Simple AI Agent framework with General-Purpose and Software Engineering assistants, following KISS design principles.
A deep research agent for medical scenarios, built on a knowledge-informed trajectory synthesis framework for deep retrieval and reasoning across medical literature.
Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environments where agents must adapt their strategies as new information becomes available,...
GitAgent is a framework-agnostic, git-native standard for defining AI agents where identity, rules, memory, tools, and skills are version-controlled files in a Git repository, enabling reproducible and collaborative agent development.
An open-source multi-agent chat interface for managing multiple agents in one dynamic conversation and connecting MCP servers for deeper research.
Open-source cross-agent memory layer for coding agents via MCP, compatible with Cursor, Claude Code, Windsurf, and more.
The official Docker MCP registry for centralized discovery, distribution, and management of MCP servers, providing standardized tool access for AI agents.
Batteries for your Pydantic AI agent — official harness providing testing, evaluation, and debugging infrastructure.
Eidolon is the first AI Agent Server, providing a pluggable Agent SDK and enterprise-ready runtime supporting multi-agent orchestration, tool integration, and production deployment.
General-purpose AI agent framework and batteries-included app for building, running, and composing self-contained agents and multi-agent teams with built-in tools, sub-agents, persistent sessions, TUI, and web UI.
The open agent control plane that governs autonomous AI agents with pre-execution policy enforcement, approval gates, and audit trails. Works with LangChain, CrewAI, MCP, and more.
An active tool-discovery framework for autonomous LLM agents, helping them discover and select MCP tools at runtime.
Vigil is an LLM security detection tool that identifies prompt injections, jailbreaks, and other potentially risky LLM inputs through multi-dimensional analysis for real-time safety protection.
An MCP server for OpenAI Codex CLI, enabling Codex to call external tools and resources through the standard MCP protocol.
Modal platform Python client for running AI agents and compute-intensive tasks in the cloud with serverless GPU computing and elastic scaling.
Quantalogic is a ReAct-based coding agent framework supporting multiple LLM backends, with tool use, reasoning chain management, and an extensible plugin system.
An MCP integration for Roblox Studio that enables AI agents to participate in game-development workflows, resource editing, and automation.
An open-source benchmark for prompt injection attacks and defenses in LLMs, systematically evaluating the effectiveness of different attack strategies and defense mechanisms.
A unified Model Context Protocol server implementation that aggregates multiple MCP servers into one.
An open-source agentic AI sandbox matrix for Kubernetes and cloud-native environments, focused on isolated agent execution.
Curated systems, benchmarks, and papers on memory for LLMs and MLLMs, covering long-term context, retrieval, and reasoning for agent memory research.
Microsoft's official travel planning AI agent demo showcasing multi-agent collaboration with Azure OpenAI and Semantic Kernel for itinerary, hotel, and activity planning.
Open source AI agent security infrastructure that intercepts and blocks dangerous agent behaviors before they happen. Deploy with a single command for real-time behavior monitoring and protection.
An AI agent framework built with Rust focusing on performance and security. Supports MCP protocol integration, composable toolchains, and distributed agent execution.
An open-source evaluation tool for generative AI applications, helping teams build test suites, compare model outputs, and track quality changes over time.
MCPAdapt is an adapter library that unlocks 650+ MCP server tools for use in popular agentic frameworks like LangChain, LlamaIndex, and more.
A memory example and toolkit based on the Qdrant vector database, demonstrating how to store conversations, documents, and events for semantic recall by agents.
Stop AI agents from doing things you did not ask for. Behavior monitoring and permission control ensure agents operate only within authorized bounds.
Common AI agent framework solving data problems, providing data analysis, processing, and visualization capabilities.
Open-source LLM toolkit for building trustworthy LLM applications with TigerArmor (AI safety), TigerRAG (embedding and RAG), and TigerTune (fine-tuning) modules.
A browser runtime and control platform for AI agents, providing programmatic access to web sessions, page interactions, and automation workflows.
A curated collection of AI tools, utilities, and resources for developers and creators building agent-powered applications.
LlamaIndex team's agent framework providing agent orchestration, service discovery, and multi-agent collaboration in a microservices architecture.
Three lines of code to give your AI agents persistent memory. Reduce 90% token consumption while maintaining quality. High-performance memory layer built with Go.
AI agent tooling for data engineering workflows, providing intelligent agent-assisted capabilities for data processing pipelines.
An operational layer for coding agents with memory, validation, and feedback loops that compound across sessions.
Procedural memory for AI coding agents that transforms scattered session history into persistent, cross-agent memory so every agent learns from every other.
A high-performance MCP server implementation built in Elixir by CloudWalk, focused on AI agent tool calling in financial scenarios with reliable messaging.
LiveKit Agents Playground is an interactive environment for testing and debugging LiveKit voice AI agents, providing a visual agent conversation interface and debugging tools for developers to quickly validate and optimize voice agents.
A PHP implementation of the Model Context Protocol SDK, enabling PHP developers to build MCP servers and clients with standard transport and tool registration.
A functional, composable, TypeScript-first AI Agent framework for real-world applications with declarative agent definition, tool composition, and streaming workflows.
Amazon's AI agent evaluation tool for automated quality assessment of Bedrock Agents and other LLM agents with multi-dimensional metrics and benchmarks.
Runtime policy enforcement for AI agents with cryptographic audit trail, human-in-the-loop approvals, and kill switch. Zero code changes required.
Config-driven AI agent engine built on Quarkus. JSON-defined agents with multi-agent orchestration, 12+ LLM providers, MCP/A2A protocols, RAG, and persistent memory.
A lightweight open-source observability component for LLM applications, providing tracing, evaluation, and debugging capabilities.
An OWASP-aligned security plugin for AI agents, providing comprehensive security assessment and protection including prompt injection defense and access control.
A multi-modal multi-agent framework for document understanding that leverages multiple specialized agents to analyze and comprehend complex documents.
A multilingual benchmark for issue resolving. Extends SWE-bench to multiple programming languages for evaluating AI agent capabilities across diverse codebases.
Open source AI Agent evaluation framework for web tasks to measure and compare AI agent performance on web operations.
LangChain AWS is LangChain's AWS integration library, supporting building AI agents using AWS Bedrock, Lambda, and other services with seamless AWS cloud integration.
A RAG framework for Cyber Threat Intelligence integrating knowledge graphs and causal reasoning, providing security analysts with intelligent threat intelligence analysis tools.
A Python SDK for AI browser automation that enables models to locate elements, perform web actions, and extract structured data from web pages.
A local-first persistent agent memory system powered by a Recursive Memory Harness for durable context and knowledge management.
Self-improving multi-agent orchestration framework for Claude Code, Gemini CLI, and Codex CLI with TDD enforcement.
Automated harness engineering for AI agents. Auto-generates test harnesses to evaluate agent safety and reliability across different scenarios.
A graph-native memory system for AI agents backed by Neo4j. Store conversations, build knowledge graphs, and let agents learn from their own reasoning.
A powerful multi-agent orchestration framework built on LangGraph for intelligent task decomposition and collaborative execution.
AI session memory with Think-Execute-Reflect quality loops. Built on the Intelligent Distance principle to give your agent a brain that survives every session.
A multi-agent framework enabling AI agents to collaborate effectively, helping developers build powerful multi-agent systems.
Agentic Retrieval-Augmented Generation framework achieving state-of-the-art on multi-hop QA benchmarks through hierarchical retrieval with keyword, semantic, and chunk read tools.
Security toolkit for AI agents to scan dangerous skills and MCP configs, monitor supply chain attacks, test prompt injection resistance, and audit live MCP servers for tool poisoning.
A survey of graph-based agent memory. A curated list of resources including surveys, papers, benchmarks, and open-source projects on graph-based agent memory.
Home of the AI workforce featuring multi-agent systems, AI agents, and tools for building autonomous AI workflows in enterprises.
Next-gen transparent agent architecture with full behavior audit, two-phase secure invocation, dual-level memory, and heartbeat tasks. Compatible with OpenClaw and Claude Code.
Self-hosted, open-source AI gateway providing one API for 20+ LLM providers, databases, and files with integrated RAG, voice, and guardrails.
Official Redis agent memory server providing fast and flexible persistent memory for AI agents and applications, with context management and session memory support.
A self-hardening firewall for large language models that automatically learns and adapts from attacks to continuously strengthen LLM application security.
AI and LLM Red Team Field Manual and Consultant's Handbook, systematically covering red team assessment methodologies, attack techniques, and defense strategies.
Open-source LLM security research code and results from Dropbox, covering LLM security testing methods, vulnerability analysis, and defense strategies.
Centralized agent control plane for governing runtime agent behavior at scale. Configurable, extensible, and production-ready across multiple agent frameworks.
Lasso security integrations for Claude Code, including prompt-injection defenses to protect code during AI-assisted development.
Universal AI agent memory service providing a unified memory management interface with support for multi-agent shared memory and cross-session persistent storage.
Easy-to-use agent memory by ElizaOS, powered by ChromaDB and Postgres. Provides hybrid memory with vector search and structured storage for agents.
An enterprise-ready Spring AI platform integrating RAG, tool calling, asynchronous ingestion, JWT/RBAC security, and observability.
A toolkit for making AI agents and workflows measurably reliable, with epistemic measurement, Noetic RAG, sentinel gating, and grounded calibration.
Official LiveKit Python voice agent starter template, demonstrating real-time voice AI agent construction with speech recognition, synthesis, and NLU.
A complete LangGraph-based example of multi-agent RAG, showing agents collaborating on retrieval, routing, reasoning, and answer generation.
LLM security testing framework for detecting prompt injection, jailbreaks, and adversarial attacks with 190+ probes and 28 providers in a single Go binary.
A local-first AI agent memory system with offline Markdown storage, hybrid search, MCP support, and a web dashboard.
An autonomous web browser QA agent that evaluates performance, functionality, and user experience through GUI or CLI workflows.
WebMCP starter demo with a DoorDash-style food delivery app featuring 9 AI agent tools (imperative and declarative patterns).
A Python red teaming framework for testing chatbots and GenAI systems, helping security teams discover and fix security vulnerabilities in AI systems.
An ICLR 2024 Spotlight LM-based emulation framework for identifying the risks of LM agents with tool use, helping discover safety issues in tool-using agents.
Research tool for bypassing commercial LLM guardrails to evaluate and improve the effectiveness of LLM safety defense mechanisms.
A fully-featured, GUI-powered local LLM Agent sandbox with complete MCP protocol support.
Simple Prompt Injection Kit for Evaluation and Exploitation. Helps security teams quickly validate defense effectiveness against prompt injection vulnerabilities.
A benchmark for prompt injection detection systems to evaluate and compare the effectiveness of different detection approaches.
Top 10 for Agentic AI security vulnerabilities, serving as the core reference for OWASP and CSA red teaming work with a standardized framework for AI agent security assessment.
Safe local execution layer for AI agent tools to build, validate, and publish MCP tools with a no-password secure runtime.
A project combining browser-use agent control with Steel's cloud browser infrastructure for scalable web automation.
An in-process agentic memory system that gives applications and agents lightweight memory storage and retrieval.
An LLM-based data-analysis agent for dbt projects that automates exploration of models and project structure via a remote MCP server.
A vector-search-powered MCP server for agent memory, providing queryable long-term context storage for AI assistants.
A production-ready AI agent framework with tool calling, persistent memory, intelligent concurrency, and event-driven observability.
Advanced prompt injection defense system for AI agents with multi-language detection, severity scoring, and security auditing.
Bag of Tricks for benchmarking jailbreak attacks on LLMs. NeurIPS 2024 paper providing empirical tricks for LLM jailbreaking with standardized evaluation.
A pytest plugin for running and analyzing LLM evaluation tests, enabling systematic validation of AI agent performance.
Visual AI agent workflow automation platform with local LLM integration. Build intelligent workflows using drag-and-drop interface.
The fastest Trust Layer for AI Agents with prompt injection detection, PII filtering, and content safety guardrails.
Scan your dev machine for AI agents, MCP servers, IDE extensions, and suspicious packages in seconds. Identify potential security threats to keep your development environment safe.
Official Taskade MCP server and OpenAPI to MCP codegen for building AI agent tools from any OpenAPI specification.
Meta-project for the AI agent tooling ecosystem integrating Mulch, Seeds, Canopy, and Overstory agent tools.
A local execution sandbox for AI agents that uses Docker to isolate filesystem, network, and processes, enabling safe execution of model-generated code, commands, and tool calls.
Open-source EDR for AI agents to monitor processes, files, network, and behavior of autonomous AI agents.
Security Comprehension Awareness Measure by 1Password. An open-source benchmark testing AI agents' security awareness during realistic, multi-turn workplace tasks.
A lightweight, event-driven multi-agent framework for embodied AI systems providing efficient multi-agent collaboration for physical world applications.
Community edition of Spring AI Playground providing a safe local execution layer for AI agent tools and MCP tool building validation.
Curated catalog of must-have external toolkits to integrate with AI agents built with Python agent frameworks.
AI Agent Gateway to install MCP servers and skills once and share across all AI agents with unified tool management.
A toolkit for integrating graph database capabilities into LLM applications, supporting knowledge graph construction, querying, and context enhancement.
AI Agent Security Middleware with 8-layer defense, DLP data flow control, prompt injection detection, and zero-dependency security.
A+ Grade AI Agent Security Framework with military-grade protection against prompt injection, command injection, and Unicode bypass attacks.
A multi-agent framework to fully automate anomaly detection across different modalities including tabular, graph, and time-series data.
A multi-agent framework open sourced by Ant Group for creating and coordinating multiple AI agents to collaborate on complex tasks.
Monte Carlo’s official toolkit for AI coding agents, bringing data observability, triage, troubleshooting, and health checks into Claude Code, Cursor, and similar tools.
A comprehensive MCP server that gives AI assistants task management, project-specific storage, and agent memory capabilities for long-lived project context.
A lightweight AI browser automation agent framework providing a clean API for building web interaction automation tools.
An open-source AI monitoring platform supporting model performance, data drift, and production quality metric observation for LLM and agent applications.
A Rust-based sandboxed TypeScript interpreter for AI agent tool execution, designed as a fast lightweight alternative to MCP-style tool calling.
An open taxonomy and scoring framework for evaluating AI agent sandboxes with 7 defense layers and 7 threat models.
LangEvals aggregates various language model evaluators into a single platform, providing a standardized LLM evaluation interface with safety checks.
Platform to create, manage, and orchestrate stereOS AI agent sandboxes with secure isolated execution environments.
Guardrail capabilities for Pydantic AI including cost tracking, prompt injection detection, PII filtering, and safety validation.
An MCP server for AI agent memory backed by Neo4j knowledge graphs, useful for structured long-term context and relationship data.
RAG/LLM Security Scanner identifies critical vulnerabilities in AI-powered applications including misconfigurations, data leakage, and access control flaws.
A semantic knowledge platform for human-AI collaboration that can serve as a wiki, knowledge base, context graph, semantic layer, or agentic memory.
An API-first multi-agent coding assistant with CodeAct-style tool calling, terminal support, and collaborative code execution.
Research project investigating LLM security by performing binary classification for prompt injection attack detection and analysis.
A lightweight library for LLM jailbreaking defense with multiple defense strategies to protect large language models from jailbreak attacks.
A persistent memory system for AI coding assistants, using MCP to share project knowledge across Claude Code, Cursor, Copilot, Windsurf, Cline, and similar tools.
MCP server implementation exposing document processing capabilities through the Model Context Protocol for AI agent document handling.
A macOS browser agent that completes web tasks through autonomous execution, chat-based clarification, and resumable local workflows.
Open-source AI security playground for LLM red teaming with hands-on labs covering the full OWASP LLM Top 10 with progressive defenses.
An open-source platform for automatically testing AI agent security. Identifies vulnerabilities such as prompt injection, secret leakage, and system instruction exposure.
A general-purpose Java agent built with Spring Boot, Spring AI, RAG, tool calling, and MCP, supporting multi-turn dialogue and persistent memory.
A self-hosted AI chat platform with a web UI and terminal CLI, supporting any model, web search, browser-agent automation, persistent memory, and analytics.
Zero-code LLM security and observability proxy with real-time prompt injection detection, PII scanning, and security monitoring.
The first locally-hosted, open-source LLM security proxy written completely in Rust for high-performance AI safety protection.
AI agent tooling for Python data science workflows providing agents with data analysis and visualization capabilities.
Enterprise AI Agent Platform with distributed memory storage and SSH sandbox execution. Serve unlimited agents with minimal cloud infrastructure.
A research project exploring how models understand web interfaces, decompose action steps, and complete complex online tasks through browser agent capabilities.
Easy to use LLM prompt injection detection and prompt input sanitization Python package with multiple detection methods and custom rules.
An observability platform for workflows, pipelines, and AI agents, providing metrics, logs, and traces for automation systems.
Open-source security gateway for LLM APIs with prompt injection detection, PII redaction, dangerous response filtering, and more.
Lightweight prompt injection detection for LLM applications providing simple and efficient input safety validation.
Interactive sandboxes for AI agent evaluations and reinforcement learning on third-party APIs like Slack, LinkedIn, and more.
The open-source runtime for AI agents featuring sandboxed execution with built-in tools and human-in-the-loop approval.
A sandboxed execution environment for AI agents via WASM, providing lightweight and high-performance code isolation for safely executing untrusted agent code.
Firefox/Chromium extension for AI security researchers that streamlines LLM jailbreak testing and vulnerability discovery across multiple providers.
Jailed Docker environments with network isolation for AI agents to execute code safely in isolated containers.
Open benchmark for AI agent security tools, evaluating prompt injection, data exfiltration, tool abuse, and provenance tracking.
htop for AI Agents to monitor token usage, costs, and tool calls across Claude Code and Codex in real time.
AI Agent Tools library for the Graphlit Platform providing knowledge retrieval and content processing capabilities for Python agents.
An open-source AISI toolkit for sandboxing agentic evaluations, helping researchers isolate models, tools, and execution environments safely.
Dynamically convert OpenAPI specs into AI agent tools for automatic API-to-tool transformation.
Real virtual desktops for AI agents. MCP-native, self-hosted, and fully isolated. Ideal for secure execution of GUI-interacting agents.
Security scanner for AI agent tool definitions that detects security vulnerabilities and configuration risks in agent tool interfaces.
The SchemaPin protocol for cryptographically signing and verifying AI agent tool schemas to prevent tampering and supply chain attacks.
Taskara is an orchestration platform for long-running agent tasks and multi-step automation, emphasizing persistence, scheduling, and execution control.
AI Red Teaming Arsenal with a curated collection of prompt lists for diverse AI security testing and adversarial evaluation.
An integrated platform for AI agent tool management and security with tool registration, access control, and audit trails.
Working code examples to defend against Agentic AI threats including prompt injection detection, Claude Code security configuration, and agent access control.
MCP server for MicroSandbox that gives AI agents secure sandboxed execution environments for running untrusted code.
An open-source chatbot platform providing a chat frontend, bot configuration, and multi-channel integration for building user-facing AI dialogue systems.
A chat workbench for debugging and experimenting with multiple LLMs, providing session management, parameter configuration, and an interactive UI.