OpenClaw
OpenClaw is an open-source personal AI assistant platform supporting 25+ messaging channels (WhatsApp, Telegram, Slack, etc.) with multi-LLM integration and personal knowledge management.
OpenClaw is an open-source personal AI assistant platform supporting 25+ messaging channels (WhatsApp, Telegram, Slack, etc.) with multi-LLM integration and personal knowledge management.
n8n is a powerful workflow automation platform with native AI agent nodes, enabling multi-step agent workflow orchestration and hundreds of external service integrations.
AutoGPT is an autonomous AI agent that can complete user-defined tasks end-to-end. It plans and executes steps on its own and is considered a milestone in agent autonomy.
The agent harness performance optimization system with skills, instincts, memory, security and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
An agentic skills framework and software development methodology that provides reusable skill modules and engineered workflows for AI coding agents.
Langflow is a visual AI agent and workflow builder platform with drag-and-drop design, multi-LLM integration, and tool composition to simplify agent development.
OpenCode is an open-source terminal coding agent that supports multiple LLM providers, offering AI-powered code generation and editing in the terminal.
The open source coding agent. An AI-powered programming assistant that runs in your terminal. Supports code generation, refactoring, debugging, testing and full development workflow with intelligent code completion and suggestions.
Dify is an open-source LLM application development platform with a visual agent orchestration interface, supporting workflows, knowledge bases, and multiple models.
Full system prompts, internal tools and AI models from 40+ popular AI tools including Cursor, Devin, Windsurf, Manus, Lovable, and more.
LangChain is a framework for building applications powered by language models. It provides core capabilities such as chaining, memory management, and agent orchestration, making it a go-to choice for AI agent development.
Open WebUI is a feature-rich, user-friendly self-hosted AI platform supporting Ollama and OpenAI-compatible APIs, with RAG, agents, and MCP capabilities.
Official Anthropic repository for Agent Skills, providing ready-to-use Claude agent skill examples and templates.
Claude Code is an agentic coding tool by Anthropic that lives in your terminal, understands your codebase, and helps you code faster through natural language commands.
Firecrawl is the Web Data API for AI, turning web pages into clean, structured, LLM-friendly data with crawl, scrape, and search capabilities.
100+ AI Agent and RAG apps you can actually run — clone, customize, and ship. A great reference for quickly building LLM-powered applications.
Gemini CLI is a terminal-based AI agent tool from Google that supports code generation, file operations, and multi-turn conversations with a free usage tier.
An autonomous AI agent framework from NousResearch that supports multiple LLM backends and grows with user needs.
browser-use enables browser automation for agents, allowing LLMs to understand pages and perform complex web interactions.
NextChat is a lightweight, cross-platform AI assistant client supporting GPT-4, Claude, Gemini and more, with Web, desktop, and mobile experiences.
A curated collection of hundreds of community-verified MCP server implementations spanning databases, search engines, dev tools, browser automation, and more, helping developers quickly discover and integrate MCP services for their use cases.
MCP Servers provides a large collection of reusable Model Context Protocol server implementations, giving agents standardized tool capabilities.
A leading open-source RAG engine that fuses cutting-edge retrieval-augmented generation with agent capabilities to create a superior context layer for LLMs.
A high-throughput and memory-efficient inference and serving engine for LLMs, featuring PagedAttention, continuous batching, and optimized KV cache management for production deployments.
Codex CLI is OpenAI's open-source coding-agent command-line tool for code understanding, refactoring, generation, and terminal collaboration in developer workflows.
The ultimate space for work and life to find, build and collaborate with agent teammates that grow with you, enabling multi-agent collaboration and team design.
Lobe Chat is an open-source ChatGPT-style chat application with a plugin system and multi-model support, suitable as an agent conversation interface.
AI research automation agent by Andrej Karpathy that automatically runs nanochat training research experiments on a single GPU.
Comprehensive guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
Daytona provides secure development-environment infrastructure for coding agents and automation workflows, serving as a runtime base for remote execution tasks.
OpenHands is an open-source AI software engineering agent platform that can automatically execute development tasks, modify code, and support collaborative iteration.
The Multi-Agent Framework for building the first AI Software Company, enabling natural language programming with multi-role collaboration for automated requirement analysis, design, coding, and testing.
MetaGPT is a multi-agent framework that assigns different roles to GPTs to form a collaborative entity for complex tasks. Given one line of requirement, it returns PRD, design, tasks, and code.
A financial data platform for analysts, quants and AI agents, providing comprehensive financial data access across stocks, crypto, economics and more.
Pathway is a Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG applications.
Open Interpreter is a natural language interface for computers that lets LLMs run code locally to perform file operations, data analysis, and system management tasks.
An open-source long-horizon SuperAgent harness by ByteDance that researches, codes, and creates with sandboxes, memories, tools, skills, subagents and message gateway for complex tasks.
A Claude Code plugin that automatically captures coding session context, compresses it with AI, and injects relevant context back into future sessions for persistent memory.
Cline is an autonomous coding agent in your IDE that can create/edit files, execute commands, use the browser, and more with your permission every step of the way.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for Agentic workflows, supporting layout analysis, formula recognition, and table extraction.
Ready-to-run cloud templates for RAG, AI pipelines and enterprise search with live data, always in sync with Sharepoint, Google Drive, S3, Kafka and more.
AnythingLLM is an all-in-one AI productivity app with a self-hosted chat UI, RAG knowledge base, AI agents, and multi-model management, privacy-first with zero configuration.
Docling is an open-source document processing tool by IBM that converts PDF, Word, PPT, HTML and more into structured data for AI, purpose-built for GenAI and RAG pipelines.
Microsoft AutoGen is a multi-agent conversation framework that lets you create multiple agents to collaborate through dialogue and solve complex tasks.
12 Lessons to Get Started Building AI Agents by Microsoft. Hands-on curriculum covering core agent concepts, tool use, and multi-agent collaboration.
GPT Engineer is an AI tool that generates entire codebases based on natural language descriptions. Just describe what you want to build, the AI asks for clarification, and then builds it.
Platform to experiment with AI Software Engineer — specify software in natural language, watch AI write and execute code, then iterate improvements
A curated list of awesome Claude Skills, resources and tools for customizing Claude AI workflows.
A nano claude code-like agent harness built from scratch, demonstrating how to build AI coding assistants from zero to one.
Embedchain is a universal memory layer for AI agents, enabling quick integration of diverse data sources into LLMs for context-aware AI applications.
Mem0 is a long-term memory layer for AI agents, supporting cross-session memory management and personalized context retrieval.
Context7 is Upstash's context-engineering toolkit for agents, helping applications manage long context windows, retrieval injection, and history compression.
An open-source agent harness platform providing the best agent toolkit, supporting multiple AI coding agents.
Flowise is a low-code builder for LLM apps that lets you create agent workflows and RAG applications with drag-and-drop interfaces.
AI-driven public opinion and trend monitor with multi-platform aggregation, RSS subscriptions, smart keyword filtering, AI-powered news analysis and briefings, supporting MCP integration and push notifications via WeChat, Feishu, DingTalk, Telegram and more.
TradingAgents is a multi-agent trading framework built with LangGraph that mirrors real-world trading firm dynamics with specialized LLM-powered agents for fundamental analysis, sentiment analysis, risk management, and more.
Build agents that monitor and act on your behalf. Create automated agents for Twitter, weather monitoring, web scraping, and many other scenarios.
CrewAI is a framework for orchestrating role-based AI agents. It lets you create teams of agents with distinct goals that collaborate on complex tasks.
LlamaIndex is a data framework that provides the data connection layer for LLM applications, with strong RAG capabilities across diverse data sources and vector databases.
MemPalace is an open-source AI memory system providing a persistent long-term memory layer for AI agents, with ChromaDB vector storage and MCP protocol integration.
The original local LLM interface supporting text generation, vision, tool-calling, and training with both a web UI and API. Runs 100% offline and private.
A cross-platform desktop All-in-One assistant tool for managing Claude Code, Codex, OpenCode, OpenClaw and Gemini CLI agents in one place.
From vibe coding to agentic engineering — a practice guide helping developers master Claude Code best practices and advanced techniques.
An AI-driven low-code platform with zero-code and code-generation modes, featuring built-in AI chat, knowledge base, workflow orchestration and MCP plugin system.
Open-source AI engine to run any model — LLMs, vision, voice, image, video — on any hardware without GPU. Provides OpenAI-compatible API for fully local, privacy-first AI inference.
Milvus is a high-performance open-source vector database built for AI applications. It supports storage, indexing, and similarity search of large-scale vector data, ideal for RAG, recommendation systems, and more.
Cherry Studio is an AI productivity studio with smart chat, autonomous agents, and 300+ assistants, providing unified access to frontier LLMs.
LiteLLM provides a unified interface and proxy gateway for LLM calls, simplifying multi-model switching, routing, and cost control.
AI pair programming in your terminal. Collaborate with LLMs to edit code, manage Git, and refactor across multiple files with deep developer workflow integration.
Aider is an AI coding assistant that lets you pair program with LLMs in your terminal. It supports Git integration and can automatically commit code changes, perfect for developer pair programming.
CowAgent (formerly chatgpt-on-wechat) is a powerful AI assistant framework built on LLMs with autonomous planning, tool use, long-term memory, multi-agent collaboration, and multi-channel integration for WeChat, Feishu, DingTalk, and more.
The cloud-native API and AI Gateway providing LLM request routing, rate limiting, load balancing and observability for AI agent applications.
Open-source extensible AI coding agent that goes beyond code suggestions — install, execute, edit, and test with any LLM.
Goose is an open-source autonomous coding agent by Block that automates code writing, file operations, and terminal commands in the development environment.
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer, supporting multiple local LLMs with a full desktop chat UI and API server.
OpenSpec is a spec-driven development (SDD) platform that guides AI coding assistants to generate code through specification definitions, improving development efficiency and code quality.
Fabric is an open-source framework for augmenting humans using AI, providing a modular system of crowdsourced AI prompts for solving specific problems anywhere.
An accessible multi-agent sentiment analysis assistant that breaks filter bubbles, reveals true public opinion, and predicts trends — built from scratch without external frameworks.
nanobot is an ultra-lightweight personal AI agent that supports multiple LLM backends for quickly deploying a private intelligent assistant.
Agno is a high-performance agent framework for building multimodal AI agents with memory, knowledge, and tool-use capabilities, supporting multiple LLM providers.
Phidata is a framework for building AI agents with memory, knowledge, and tool integration to make agents more capable and useful.
Chatbox is a powerful cross-platform AI client supporting OpenAI, Claude, Gemini, and other LLMs with desktop and mobile apps.
Opinionated RAG framework for integrating GenAI into your apps. Works with any LLM, any vectorstore, any files — so you can focus on your product instead of building RAG pipelines.
A generative speech model for daily dialogue, providing AI agents with natural and fluent voice synthesis with fine-grained prosody control.
MindsDB is a query engine for AI analytics that enables building self-reasoning agents across live data, connecting diverse data sources with AI models.
A comprehensive tutorial on AI agent principles and practice, systematically covering core concepts, framework usage and hands-on projects.
A local knowledge base RAG and Agent application platform built on Langchain with support for ChatGLM, Qwen, Llama and other LLMs, offering conversation, knowledge base management, and agent capabilities.
Open-source low-code platform for building internal tools, dashboards, business applications, workflows and AI agents with visual drag-and-drop development.
An adaptive web scraping framework that intelligently handles anti-bot measures, from single requests to full-scale crawls, designed for AI agent data collection.
Pi Mono is a comprehensive AI agent toolkit including a coding agent CLI, unified LLM API, TUI and web UI libraries, Slack bot, and vLLM pod management for end-to-end agent development.
AgentGPT is a platform for assembling, configuring, and deploying autonomous AI Agents in your browser, allowing users to create goal-driven agents that execute tasks autonomously.
MCP server providing Chrome DevTools capabilities to coding agents, enabling web debugging, performance analysis, and DOM manipulation automation.
LibreChat is an enhanced open-source ChatGPT clone featuring Agents, MCP tools, multi-model support, code interpreter, AI search, and more.
A Python library by Google for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization, designed for data annotation and knowledge extraction workflows.
AI-powered job search system built on Claude Code with 14 skill modes, Go dashboard, PDF generation and batch processing.
A modern open-source VPS control panel with native AI agent support, enabling Ollama model deployment, AI agent management and full server stack control.
Khoj is a self-hostable AI second brain that answers questions from the web or your docs, builds custom agents, schedules automations, and performs deep research.
An AI-powered answering engine with multi-model integration, web search and local knowledge base, providing a Perplexity-like search experience.
Intelligent automation and multi-agent orchestration for Claude Code. Supports automated workflows, task coordination, and intelligent agent system building.
DSPy is a declarative LLM programming framework focused on optimizable prompts and program structure, suitable for complex agent workflows.
An installable library of 1,400+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI and more, with installer CLI, bundles and workflows.
In-depth tutorials on LLMs, RAGs and real-world AI agent applications. Rich notebook examples for learning AI engineering practices.
LightRAG is a simple and fast Retrieval-Augmented Generation framework using graph-enhanced retrieval, published at EMNLP 2025.
Tabby is a self-hosted AI coding assistant supporting code completion, code generation, and enterprise-grade deployment, compatible with major IDEs.
Chatbot UI is an open-source AI chat interface supporting OpenAI, Claude, Gemini and more, with a modern conversation UI and flexible deployment options.
AI-powered PDF scientific paper translation with preserved formats, supporting Google/DeepL/Ollama/OpenAI services via CLI/GUI/MCP/Docker/Zotero.
ChatDev 2.0 enables full-lifecycle software development through LLM-powered multi-agent collaboration, simulating role-based teamwork in a virtual software company.
Continue is an open-source AI code assistant extension for VS Code and JetBrains IDE. It can autocomplete code, refactor, and explain code, helping developers improve programming efficiency.
A modular graph-based Retrieval-Augmented Generation system by Microsoft that uses LLMs to extract structured knowledge graphs from text, enabling global and local community summarization queries.
The leading agent orchestration platform for Claude. Deploy intelligent multi-agent swarms, coordinate autonomous workflows, and build conversational AI systems. Features enterprise-grade architecture, distributed swarm intelligence, RAG integration, and native Claude Code/Codex integration.
One API is an LLM API management and redistribution system that unifies OpenAI, Azure, Anthropic Claude, Google Gemini, DeepSeek, and more under a single API. Supports key management, redistribution, and one-click Docker deployment.
An event-driven agentic orchestration platform providing a durable and highly resilient execution engine for applications and AI agents.
CLI-Anything aims to make all software agent-native by transforming applications into unified CLI interfaces, enabling AI agents to naturally interact with and operate any software through a centralized CLI Hub.
Playwright MCP is a Microsoft MCP server exposing Playwright browser automation capabilities to AI agents, supporting web interaction, screenshots, and structured data extraction.
Qdrant is a high-performance vector database widely used as the retrieval layer for RAG and agent memory search scenarios.
LLM-powered stock analysis system for A/H/US markets with multi-source quotes, real-time news, LLM decision dashboard and multi-channel push notifications.
CopilotKit is an open-source framework for building AI agent frontends, supporting Generative UI and the AG-UI Protocol to help developers quickly integrate agent capabilities into apps.
Fast, small, and fully autonomous AI personal assistant infrastructure built with Rust. Deploy anywhere, swap anything, on any OS and platform.
An AI Agent assistant that integrates multiple IM platforms, LLMs, plugins and AI features, supporting QQ, Telegram, Discord and more.
An AI agent that automates the job application process, analyzing job requirements and tailoring applications for personalized mass submission.
Teams-first multi-agent orchestration for Claude Code. Designed for team collaboration with support for multi-agent coordination, task distribution, and result integration to enhance team AI development efficiency.
An open-source browser automation CLI for AI agents by Vercel, built with Rust for high performance and programmability.
LangGraph is an agent workflow orchestration framework from the LangChain team, using graph structures to model agent state and transitions.
ByteDance's open-source multimodal AI agent stack connecting cutting-edge AI models with agent infrastructure for GUI automation and computer control.
AI coding assistant skill that turns any folder of code, docs, papers, images, or videos into a queryable knowledge graph. Works with Claude Code, Codex, Cursor, Gemini CLI, GitHub Copilot CLI, and more.
GitHub's official MCP Server providing standardized access to GitHub APIs for AI agents, supporting repository management, issue handling, and PR operations.
Void is an open-source AI code editor built on VS Code architecture, supporting Claude, GPT, and other models, delivering a Cursor-style intelligent coding experience.
A curated collection of 500 AI agent use cases across industries including healthcare, finance, education, and retail. Showcases practical applications with open-source project links.
The Monorepo Platform that amplifies both developers and AI agents, optimizing builds, scaling CI and automatically fixing failed PRs.
The Zero-Server Code Intelligence Engine — a client-side knowledge graph creator running entirely in your browser with a built-in Graph RAG Agent for code exploration.
An open-source low-code platform for building AI agents, automations and business applications, model agnostic with drag-and-drop visual development.
Sim is a platform to build, deploy, and orchestrate AI agents with a visual low-code workflow editor, supporting OpenAI, Anthropic, DeepSeek and more for enterprise agent orchestration.
Composio is a tools and SaaS integration layer for agents, helping applications connect quickly to services like Gmail, Slack, and GitHub for multi-tool workflows.
FastGPT is a knowledge-based platform built on LLMs, offering out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration for easily developing and deploying complex question-answering systems.
Microsoft Semantic Kernel is a lightweight SDK for combining large language models with conventional programming languages to build AI agent applications.
Chroma is an open-source AI-native embedding database designed for building LLM applications. It provides simple APIs to store embeddings and perform similarity search, making it ideal for RAG applications.
A lightweight AI assistant platform running securely in containers. Connects to WhatsApp, Telegram, Slack, Discord, Gmail with memory, scheduled jobs, and built on Anthropic's Agents SDK.
Open source AI platform with enterprise-grade AI chat, advanced RAG and AI search capabilities that works with every LLM.
New API is a unified AI model hub for aggregation and distribution, supporting cross-conversion of various LLMs into OpenAI, Claude, or Gemini-compatible formats. A centralized gateway for personal and enterprise model management.
A curated list of AI autonomous agents. A comprehensive collection of open-source agent projects for discovering and understanding the agent ecosystem.
A comprehensive showcase of advanced Retrieval-Augmented Generation (RAG) techniques with detailed notebook tutorials and code examples, covering foundational to cutting-edge RAG implementations.
smolagents is a lightweight agent framework from Hugging Face for quickly building tool-using LLM agents.
An autonomous agent that conducts deep research on any data using any LLM providers. Automatically collects information, analyzes data, generates research reports with multi-source data integration.
GPT Researcher is an autonomous research agent that can gather, organize, and analyze information to produce detailed research reports.
An AI prompt optimizer that helps users write better prompts and achieve improved AI results.
The agentic development environment built for coding with multiple AI agents, providing a next-generation terminal experience.
Fully local Manus AI alternative that autonomously browses the web, writes code, and interacts via voice, with no API costs
LLM Frontend for Power Users with multi-model support, rich role-playing features, extensible plugin system, and local deployment.
MLflow is the open-source AI engineering platform for debugging, evaluating, monitoring, and optimizing AI agents and LLM applications, with model and data access management.
Vercel's official collection of agent skills, providing practical skill modules and tools for AI coding agents.
Kotaemon is an open-source RAG-based tool for chatting with your documents, featuring a clean chat interface and support for multiple LLM and embedding model backends.
Get 10X more out of Claude Code, Codex or any coding agent. Manage agent tasks through kanban boards, track progress, and optimize workflows.
Langfuse is an open-source observability platform for LLM applications, supporting tracing, evaluation, prompt versioning, and cost analytics.
Graphiti is a temporal knowledge-graph engine for agent memory, helping systems continuously accumulate long-term context.
Open-AutoGLM is an open phone agent model and framework enabling AI to autonomously operate smartphone interfaces, unlocking the AI Phone experience for everyone.
A unified CLI for Google Workspace covering Drive, Gmail, Calendar, Sheets, Docs and more, with built-in AI agent skills for automation.
Haystack is an enterprise-grade framework for RAG and search applications, covering document processing, retrieval, generation, and evaluation end to end.
Agent skills for Obsidian. Teach your agent to use Markdown, Bases, JSON Canvas, and use the CLI.
FastMCP is a fast, Pythonic library for building MCP servers and clients with over 1 million daily downloads, making it easy to create Model Context Protocol tools.
A multi-agent LLM-powered Chinese financial trading framework, enhanced Chinese version of TradingAgents with multi-source market data, real-time news, and LLM decision-making.
Open-source multi-agent framework from Alibaba, enabling the construction of observable and interpretable agents with rich distributed capabilities.
Repomix packs your entire repository into a single AI-friendly file, perfect for feeding your codebase to LLMs like Claude, ChatGPT, and DeepSeek for analysis, review, or code generation.
Vercel AI SDK is a web and full-stack AI toolkit with streaming, tool calling, and agent patterns for integrating agent capabilities into JS/TS applications quickly.
An open-source AI coding agent that lives in your terminal, built by Qwen team with support for code generation, editing, debugging and multi-file operations.
Chat with your SQL database using natural language. Accurate Text-to-SQL Generation via LLMs using Agentic RAG.
A2A (Agent-to-Agent) Protocol is an open protocol by Google enabling interoperability and collaborative communication between AI agents built across different frameworks and vendors.
Roo Code is an autonomous coding agent extension for VS Code and JetBrains that can create/edit files and run terminal commands directly in your editor.
Open-source agentic coding CLI by the Charm team, supporting multiple LLM backends for autonomous coding in the terminal
Mastra is a TypeScript-first agent platform that combines workflows, memory, RAG, evaluation, and deployment for scalable full-stack AI agent applications.
Powerful MCP toolkit for coding that provides semantic retrieval and editing capabilities, serving as an IDE for AI agents
Crawl4AI is a web crawling toolkit for LLM and agent systems, offering structured extraction, site traversal, cleanup, and crawl controls for external knowledge acquisition.
A web scraping and browser automation library for Node.js to build reliable crawlers, supporting Puppeteer, Playwright, Cheerio, and raw HTTP. Extract data for AI, LLMs, RAG, or GPTs with proxy rotation and both headful and headless modes.
MCP Python SDK is the official Python implementation for building MCP servers and agent-side integrations with a standardized tool protocol.
OpenViking is an open-source context database from Volcengine that unifies management of agent memory, resources, and skills through a filesystem paradigm, enabling hierarchical context delivery and self-evolution.
An AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket and the web, then synthesizes a grounded summary.
Open-source coding agent CLI supporting OpenAI, Gemini, DeepSeek, Ollama, Codex, GitHub Models, and 200+ models via OpenAI-compatible APIs.
A workflow orchestration framework for building resilient data pipelines and AI workflows in Python, with task scheduling, state management, and failure recovery from local to distributed deployments.
Letta (formerly MemGPT) is an open-source framework for building stateful AI agents with advanced reasoning and transparent long-term memory. It allows you to visually test, debug, and observe agents.
Stagehand is Browserbase's browser-control framework for agents, focused on structured web actions, repeatable automation, and developer-friendly browser task orchestration.
A free, local, open-source 24/7 cowork app supporting multiple coding agents like Gemini CLI, Claude Code, and Codex with unified management and collaboration features.
Jina AI Serve is a cloud-native framework for building multimodal AI applications, supporting RAG pipelines, agent systems, and multimodal search.
Marketing skills for Claude Code and AI agents, covering CRO, copywriting, SEO, analytics and growth engineering.
Activepieces is an open-source AI workflow automation platform with 400+ MCP servers for AI agents, enabling no-code business process orchestration.
A powerful GUI app and Toolkit for Claude Code — create custom agents, manage interactive Claude Code sessions, run secure background agents, and more.
An autonomous agent for deep financial research. Automatically analyzes financial reports, market data, company filings, generates investment recommendations and risk assessment reports.
OpenAI Swarm is a lightweight multi-agent collaboration framework focused on simplicity and controllability, ideal for learning and prototyping.
GenAI Agents is a comprehensive collection of 50+ tutorials and implementations for Generative AI Agent techniques, from basic conversational bots to complex multi-agent systems.
Skyvern is an agent platform for browser task automation, using page understanding and action planning to complete complex web workflows such as forms and back-office tasks.
Agent harness built with LangChain and LangGraph. Equipped with a planning tool, filesystem backend, and ability to spawn subagents for complex agentic tasks.
A memory upgrade for coding agents. Provides persistent contextual memory for Claude Code, Codex, and other coding agents to improve long-task consistency.
Open-source vector similarity search extension for PostgreSQL, enabling native vector storage and ANN retrieval in relational databases, a foundational component for building agent memory and RAG systems.
MaxKB is an open-source knowledge base Q&A and agent building platform powered by LLMs, with vector retrieval, workflow orchestration, and multi-model support out of the box.
Open-source AI agent development platform from Coze, providing visual tools to simplify agent creation, debugging, and deployment with one-click publishing to multiple channels.
A simple, open format for guiding coding agents. Define agent behavior, rules, and skills through structured AGENTS.md files to help AI coding assistants better understand project requirements.
Promptfoo is an evaluation and regression testing tool for LLM apps and agents, useful for comparing prompts, tool-call results, and model outputs over time.
Dyad is a local AI app builder that lets users create and iterate on web applications through natural language conversations, supporting multiple LLM providers.
A Claude Code plugin that shows what's happening — context usage, active tools, running agents, and todo progress for enhanced agent workflow visibility.
OWL (Optimized Workforce Learning) is a multi-agent collaboration framework for real-world task automation, decomposing and executing complex tasks through agent interaction.
An autonomous company operating system powered by AI agents, providing intelligent workflow automation for research, data analysis, customer communication and other business processes.
Open-source agentic software engineer and Devin alternative with planning, reasoning, web browsing, and multi-model support
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?
bolt.diy is an open-source platform to prompt, run, edit, and deploy full-stack web applications using any LLM you want, providing a visual development environment for AI-powered app creation.
Google Agent Development Kit (ADK) is Google's agent development framework for building complex AI agent systems with tool integration and multimodal processing capabilities.
SWE-agent takes a GitHub issue and automatically generates fixes using your LLM of choice, also applicable to cybersecurity auditing and competitive coding. NeurIPS 2024 paper.
Claude Code skill implementing Manus-style persistent markdown planning — the structured workflow pattern for agent task management.
Opik is an open-source LLM observability platform providing agent tracing, evaluation testing, and prompt experiment management to help developers monitor and optimize AI agent systems.
End-to-end, code-first tutorials for building production-grade GenAI agents. From prototype to enterprise deployment.
AI-powered research assistant that performs iterative deep research on any topic by combining search engines, web scraping, and LLMs
A set of ready-to-use Agent Skills for research, science, engineering, analysis, finance and writing across multiple coding agents.
Open-source deep research agent from Alibaba Tongyi Lab, using multi-stage iterative information retrieval and reasoning to conduct deep analysis, synthesis, and summarization of complex topics with web search and document analysis.
DB-GPT is an open-source agentic AI data assistant framework integrating multi-agent collaboration, RAG, and AWEL workflow engine, purpose-built for AI+Data applications.
KiloCode is an all-in-one open-source coding agent platform for VS Code and JetBrains, integrating 200+ models with autonomous coding, debugging, and iteration capabilities.
OpenAI's framework for evaluating LLMs and LLM systems, providing an open-source registry of benchmarks and tools for systematic model assessment.
An autonomous agent framework for everyone, built in TypeScript with multi-platform deployment support and a rich plugin ecosystem for conversational AI agents and social bots.
The interaction control harness for customer-facing AI agents, optimized for building controlled, consistent and predictable customer interactions with LLMs.
Parlant is an agent platform for support and conversational workflows, focused on dialog state, tool invocation, and controlled interaction design.
A private AI platform for agents, assistants, and enterprise search with built-in agent builder, deep research, document analysis, and multi-model support.
Use your Neovim like using Cursor AI IDE. AI-powered code generation, editing, and chat deeply integrated into the Neovim ecosystem.
Give your AI agent eyes to see the entire internet. Read and search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu with one CLI and zero API fees.
SuperAGI is a dev-first open-source autonomous AI agent framework for building, managing, and running useful autonomous agents quickly and reliably.
Page Agent is a JavaScript in-page GUI agent by Alibaba that controls web interfaces with natural language, enabling automated form filling, page navigation, and element interaction.
An autonomous AI agent loop that runs repeatedly until all PRD items are complete, automating the entire development cycle.
Agent Zero is a general-purpose AI agent framework supporting autonomous task planning, tool use, and code execution for building self-directed AI assistants.
Agent Skills is a curated collection of production-grade engineering skills for AI coding agents, maintained by Addy Osmani, providing battle-tested best practices and operational conventions.
Agent Lightning is Microsoft's open-source training framework for AI agents, using reinforcement learning to enhance agent capabilities.
CAMEL is an open-source framework for multi-agent collaboration, supporting role-play, task decomposition, and coordinated execution.
PydanticAI builds agents on top of type systems, emphasizing verifiable data structures, tool calling, and production-grade reliability.
ChatALL lets you concurrently chat with ChatGPT, Bing, Bard, Claude, ChatGLM, and many more LLMs to discover the best answers through side-by-side comparison.
PUA is a highly proactive AI agent skill that motivates agents to continuously improve and deliver high-quality results within 30 days, using a performance-driven persona approach.
A universal CLI Hub and AI-native runtime that transforms any website, Electron app, or local binary into a standardized command-line interface built for AI agents.
A knowledge engine for AI agent memory that builds knowledge graphs and memory layers in 6 lines of code, supporting graph databases, vector stores, and more for knowledge extraction and retrieval.
RagaAI Catalyst is an observability, monitoring, and evaluation framework for Agent AI, supporting agent/LLM/tool tracing, multi-agent debugging, and self-hosted dashboard analytics.
Agent framework built on Qwen LLM, featuring function calling, MCP tool integration, code interpreter, RAG, and browser extension support.
All-in-one RAG framework supporting text, images, tables, equations and more document formats for retrieval-augmented generation with unified knowledge QA.
Weaviate is an open-source vector database that stores objects and vectors, allowing for combining vector search with structured filtering. It has built-in vectorization modules and supports multimodal data search.
Open Multi-Agent Interactive Classroom — Get an immersive, multi-agent learning experience in just one click. Features multi-role AI teachers, intelligent Q&A, and personalized learning paths to redefine online education.
A web interface for running AI agents in the browser, providing a visual experience for browser automation operations.
Multica is the open-source managed agents platform that turns coding agents into real teammates with task assignment, progress tracking, and compound skill accumulation.
An orchestration platform for developing, producing, and observing data assets and AI workflows, with built-in asset definitions, scheduling, and monitoring.
ChuanhuChatGPT is a lightweight GUI for ChatGPT API and many LLMs, supporting agents, file-based QA, web search, and GPT finetuning with a neat UI.
A multi-agent orchestration system inspired by ancient governance structures, featuring 9 specialized AI agents with a real-time dashboard, model configuration, and full audit trails for complex multi-agent collaboration scenarios.
Open source AI coding agent designed for large projects and real world tasks, providing terminal-based code generation with multi-step planning and file management.
An open-source tool by OpenAI that turns project work into isolated, autonomous implementation runs, allowing teams to manage work instead of supervising coding agents.
DeepCode is an open agentic coding platform supporting Paper2Code, Text2Web, and Text2Backend, leveraging agent technology for automated software development workflows.
A comprehensive collection of Agent Skills for context engineering, multi-agent architectures, and production agent systems. Use when building, optimizing, or debugging agent systems.
MemVid is a long-term memory layer for AI agents that uses video encoding for lightweight single-file storage, replacing complex RAG pipelines with instant retrieval.
OpenAI Agents SDK is OpenAI's official agent development toolkit, supporting the building of multi-step workflow AI agents with core features like tool calling and state management.
Open-source text-to-SQL and text-to-chart GenBI agent with a semantic layer. Ask your database questions in natural language and get accurate SQL, charts, and BI insights. Supports 12+ data sources and any LLM.
llmware is a unified enterprise RAG framework for deploying small specialized models, featuring knowledge graphs, document parsing, vector indexing, and agent toolchains for building private, compliant AI applications.
DeepEval is an open-source evaluation framework for LLM applications. It provides rich evaluation metrics and tools, supporting unit testing and integration testing to help developers build reliable LLM applications.
MCP Toolbox is an open-source MCP server for databases by Google, enabling agent access to PostgreSQL, MySQL, BigQuery, Spanner, and more.
Botpress is an open-source conversational AI platform with a visual flow editor, knowledge base integration, multi-channel deployment, and GPT/LLM agent building capabilities for enterprise chatbot development.
Trigger.dev is an open-source platform for background jobs and workflow automation, well suited for long-running asynchronous agent execution in production.
Unstructured provides document parsing and cleaning capabilities, commonly used in RAG ingestion and preprocessing pipelines.
MCP server that provides Figma layout information to AI coding agents like Cursor, enabling precise design-to-code conversion.
Multi-agent workspace manager that supports agent team collaboration, task scheduling, and resource allocation. Provides a unified workspace view for efficient multi-agent coordination.
QAnything is an open-source local knowledge base Q&A system by NetEase Youdao, supporting any file format with offline RAG capabilities for building private knowledge Q&A.
CUA provides open-source infrastructure for Computer-Use Agents, including sandboxes, SDKs, and benchmarks to train and evaluate AI agents that control full desktops (macOS, Linux, Windows).
A CLI tool for code structural search, lint, and rewriting based on AST. Written in Rust, supports 20+ languages, providing precise code pattern matching for AI coding agents.
Ragas is a framework for evaluating RAG (Retrieval Augmented Generation) systems. It provides various evaluation metrics including faithfulness, answer relevance, context precision, helping developers optimize RAG application performance.
A memory system for 24/7 proactive agents with MCP protocol integration, providing long-term memory management, skill storage, and proactive reasoning capabilities for continuously running AI agents.
Agent-native memory infrastructure that turns agent execution and conversation into structured, persistent state with an LLM-agnostic memory layer, MCP integration, and Python/TypeScript dual SDK support.
An introductory guide to context engineering - the systematic approach to building high-quality context for AI coding assistants, centered on Claude Code but applicable to any AI coding tool.
AG-UI is the open-source implementation of the Agent-User Interaction Protocol, defining a standardized interaction protocol between AI agents and frontend applications, initiated by the CopilotKit team.
Instructor is a Python library providing structured outputs for LLMs using Pydantic models, enabling AI agents to receive reliable typed responses — a key building block for agent tool-use.
NanoBrowser is an open-source Chrome extension for AI-powered multi-agent browser automation, supporting web task workflows with your own LLM API key.
An open-source AI coworker with persistent memory, supporting multi-turn conversations and context retention for knowledge management and collaborative task completion.
AI-powered vision-driven UI automation that lets you describe actions in natural language instead of writing selectors, supporting browser and mobile platforms
An automated penetration testing agentic framework powered by large language models for security testing and vulnerability discovery.
All-in-one AI framework for semantic search, LLM orchestration, and language model workflows with agent support, RAG, and vector database
Run any open-source LLMs such as DeepSeek and Llama as OpenAI-compatible API endpoints in the cloud. Supports fine-tuning, quantization, and distributed inference for production-grade LLM deployment.
A framework for few-shot evaluation of language models by EleutherAI, providing standardized evaluation pipelines supporting hundreds of benchmark tasks and widely adopted as a core LLM evaluation tool in the community.
MCP TypeScript SDK is the official TypeScript implementation for building MCP servers and clients, standardizing protocol integrations across JS/TS agent ecosystems.
A powerful AI coding agent. Built for the terminal. Supports code generation, refactoring, debugging with intelligent suggestions and automated workflows.
Chainlit is an open-source UI and development framework for LLM and agent chat applications, enabling fast delivery of interactive assistants.
Library to expose FastAPI endpoints as Model Context Protocol tools with authentication support, enabling AI agents to call existing APIs directly
E2B provides secure cloud sandboxes for AI agents, supporting code execution, file operations, and isolated compute as an execution layer for coding and automation workflows.
LangChain4j is a Java library that simplifies LLM integration through a unified API, supporting popular models and vector databases with built-in RAG, tool calling, MCP, and agent capabilities that integrate seamlessly with enterprise Java frameworks.
LLM is Simon Willison's open-source CLI and plugin framework for working with multiple models through one interface, with embeddings, templates, tool extensions, and lightweight agent workflows.
Open-source BGE series embedding models and retrieval tools from BAAI, providing state-of-the-art text embeddings and rerankers for Chinese and English, widely used in RAG systems and agent retrieval pipelines.
Industry-first professional AI Agent platform for controllable film and video production, covering the entire pipeline from shorts to live-action.
PAL MCP Server unifies Claude Code, GeminiCLI, and CodexCLI with multiple LLM providers (Gemini, OpenAI, OpenRouter, Azure, Grok, Ollama, and custom models) into a single collaborative MCP service.
Pipecat is an open-source framework for voice and multimodal conversational AI, enabling real-time voice assistants, video bots, and multimodal agents with integrated TTS, STT, and LLM services.
Portkey AI Gateway is a blazing fast AI gateway with integrated guardrails, routing to 200+ LLMs with 50+ AI guardrails through a single fast and friendly API.
Open-source LLM DevOps platform providing one-stop AI application development with GenAI workflow, RAG, Agent, model management, evaluation, and enterprise system administration.
A Chrome extension-based MCP server that exposes browser functionality to AI assistants, enabling complex browser automation, content analysis, and semantic search.
TensorZero is an open-source inference gateway and optimization platform for LLM apps and agent systems, focused on high-performance serving, experimentation, routing, and production observability.
Promptflow is a development and evaluation toolkit for LLM applications and agent workflows, with visual orchestration and debugging.
Unofficial Python API and agentic skill for Google NotebookLM. Full programmatic access to NotebookLM features including capabilities the web UI doesn't expose, via Python, CLI, and AI agents like Claude Code, Codex, and OpenClaw.
LlamaGPT is a self-hosted, offline ChatGPT-like chatbot powered by Llama 2. 100% private with no data leaving your device, with Code Llama support and one-click deployment via Umbrel.
The original open-source AI PR reviewer. Automatically analyzes pull requests and generates code review feedback, improvement suggestions, and PR descriptions across GitHub, GitLab, and Bitbucket.
Open-source agentic framework that uses computers like a human, capable of completing complex GUI tasks with autonomous learning and experience accumulation.
RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device. Published at MLsys 2026.
Eino is an open-source Go framework from ByteDance for building LLM applications, offering type-safe orchestration, streaming, tool calling, and RAG pipelines for high-performance AI agent applications.
HuggingChat UI is the open-source chat interface by HuggingFace powering the HuggingChat service, supporting conversations with various open-source LLMs.
The open-source Agentic browser that transforms your browser into an AI-powered operating system. Alternative to ChatGPT Atlas, Perplexity Comet, and Dia.
HumanLayer provides a human-in-the-loop layer for AI coding agents, enabling them to seek human approval and guidance when solving hard problems in complex codebases.
PocketFlow is a minimalist 100-line LLM framework that lets Agents build Agents, enabling complex AI agent workflows through a clean abstraction layer.
TEN Framework is an open-source framework for building conversational voice AI agents with real-time multi-modal interaction support.
ValueCell is a community-driven multi-agent platform for financial applications, enabling collaborative financial analysis, trading strategies, and market research through multi-agent orchestration.
A complete search engine and RAG pipeline in your browser, server, or edge network. Supports full-text, vector, and hybrid search in less than 2kb. Perfect for building AI-powered search experiences anywhere.
OpenHarness is an open agent harness platform with a built-in personal agent called Ohmo, providing an integrated solution for agent development, testing, and deployment.
A collection of projects showcasing RAG, agents, workflows, and other AI use cases with practical examples and tutorials.
GitHub Copilot CLI brings the power of Copilot coding agent directly to your terminal. Supports code generation, command suggestions, error fixing and more.
LiveKit Agents is LiveKit's real-time voice and multimodal agent framework for phone, assistant, and interactive use cases that need low-latency experiences.
Hive is a production-ready multi-agent execution harness providing state management, failure recovery, observability, and human-in-the-loop control with auto-generated multi-agent topologies for complex business workflows.
OpenSandbox is an open-source, secure, fast, and extensible sandbox runtime for AI agents, developed by Alibaba.
An open-source embedded retrieval library for multimodal AI with zero server configuration, using the Lance columnar format for efficient vector search and filtering, ideal for agent memory and RAG applications.
Open-source web UI for Claude Code, Cursor CLI, and Codex enabling remote management of AI coding sessions and projects from mobile and web.
Universal skills loader for AI coding agents. One-command installation of skill packages. Extends agent capabilities with code review, test generation, documentation writing and more.
MCP Use is a Model Context Protocol orchestration project that helps agents connect to MCP servers, unify tool invocation, and improve portability across toolchains.
Code editor for the AI agents era that runs an army of Claude Code, Codex, and other coding agent instances in parallel
An open-source asynchronous coding agent by LangChain built on LangGraph, autonomously handling software engineering tasks including code generation, debugging, and file editing.
Microsoft's comprehensive multi-language framework for building, orchestrating, and deploying AI agents and multi-agent workflows with support for Python and .NET.
Hindsight is an agent memory system that learns autonomously, supporting memory retention, recall, and reflection to give AI agents persistent experiential memory.
TypeScript/React component library for building AI chat interfaces with customizable, production-ready UI components supporting multiple AI providers.
MCP Inspector is a debugging and inspection tool for the Model Context Protocol ecosystem, useful for validating MCP server behavior and troubleshooting integrations.
A lightweight, lightning-fast, in-process vector database by Alibaba with C++ core, Node.js and Python bindings, designed for RAG, agent memory, and vector search use cases.
Enterprise-grade agentic workflow platform from iFlytek, offering commercial-friendly SuperAgent building capabilities with complex workflow orchestration and multi-agent coordination.
AI-powered autonomous web browsing framework that enables agents to click, type, navigate, and extract data like a human, with support for OpenAI, Anthropic, and Google models.
OpenRLHF is a high-performance agentic RL framework based on Ray and vLLM, offering PPO, DAPO, and REINFORCE++ algorithms for large-scale training of agents and vision-language models.
Phoenix is an open-source observability and evaluation tool for LLM and agent applications, supporting online tracing and offline diagnosis.
Spring AI Alibaba is an Agentic AI Framework for Java developers, built on the Spring ecosystem to provide multi-agent collaboration, workflow orchestration, and RAG capabilities.
A Data Agent Ready Warehouse unifying Analytics, Search, AI, and Python Sandbox in one system. Runs on your S3 with built-in vector search, full-text search, and Python execution for AI-powered data analysis.
ART (Agent Reinforcement Trainer) trains multi-step agents for real-world tasks using GRPO reinforcement learning, enabling on-the-job training for models like Qwen, Llama, and more.
Fully-automated and zero-code LLM agent framework that enables users to build and deploy custom AI agents through natural language without writing code.
Your favorite Terminal Coding Agent, now in Rust. Provides high performance and memory safety. Supports code generation, file editing, command execution and complete development workflow.
LangChainGo is the Go implementation of LangChain, providing the easiest way to write LLM-based programs in Go with chains, agents, and tool integrations.
AI Data Runtime for Agents. Provides serverless Postgres with a multimodal datalake, enabling scalable retrieval and training. Unifies vector storage, dataset management, and streaming data loading for AI agent workflows.
Next Generation Multi-tenant AI One-Stop Solution with built-in admin and billing. Enterprise-grade unified LLM gateway supporting 200+ models and 35+ providers.
Crucix is a personal intelligence agent that watches the world from multiple data sources and pings you when something changes, helping you stay on top of information in real time.
Official MCP server collection from AWS, providing AI agents with integration to core AWS services including Lambda, S3, DynamoDB, and Bedrock.
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs for building logical reasoning and factual Q&A solutions for professional domain knowledge bases, effectively overcoming the limitations of traditional RAG vector similarity models.
An MCP bridge connecting AI assistants to the Unity Editor, enabling LLMs like Claude and Cursor to manage assets, control scenes, edit scripts, and automate game development tasks through natural language.
Go implementation of the Model Context Protocol SDK enabling seamless integration between LLM applications and external data sources and tools
A private and local AI personal knowledge management app. All data and processing stay on-device with built-in RAG, semantic search, and knowledge graph features for managing personal knowledge bases with full privacy.
An autonomous LLM agent framework for complex task solving with automatic task decomposition, tool usage, and multi-step reasoning from the OpenBMB team
An application framework for AI engineering from the Spring team, providing unified LLM integration, vector storage, function calling, RAG, and agent development for Java and Spring ecosystems with support for OpenAI, Anthropic, Ollama, and more.
UFO is a Windows GUI automation agent by Microsoft that understands screen interfaces and executes complex OS tasks through natural language commands.
MCP server for Ghidra reverse engineering platform, enabling AI agents to autonomously perform binary analysis and vulnerability discovery.
Multi-platform SDK for integrating GitHub Copilot Agent into apps and services. Supports multiple programming languages and platforms with unified Agent API interface.
MemOS is a Memory Operating System for LLMs and AI agents that unifies store, retrieve, and manage for long-term memory, with built-in KB, multi-modal, and tool memory support.
An amazing UI for OpenAI's ChatGPT with enhanced conversation management, prompt templates, and model parameter tuning across Web, Windows, MacOS, and Linux.
mcp-agent is LastMile AI's toolkit for building agents around Model Context Protocol integrations, making it easier to connect MCP tools into multi-step workflows.
The first GitHub Copilot, Codeium, and ChatGPT Xcode Source Editor Extension, bringing AI code completion and chat directly into Apple development workflows.
VoltAgent is an agent platform for the modern TypeScript ecosystem, focused on workflows, tool orchestration, and application integrations for production web agents.
A 24/7 online AI agent team that automates information collection, data analysis and content generation for continuous operations.
HexStrike AI is an advanced MCP server that lets AI agents autonomously run 150+ cybersecurity tools for automated pentesting, vulnerability discovery, and security research.
The official Lark/Feishu CLI tool maintained by the larksuite team, covering Messenger, Docs, Base, Sheets, Calendar, Mail, Tasks, Meetings and more with 200+ commands and 20+ AI Agent Skills for both humans and AI agents.
A deep research agent framework optimized for complex research and prediction tasks, with MiroThinker-1.7 and MiroThinker-H1 models achieving 74.0 and 88.2 on BrowseComp benchmark, supporting multi-step reasoning and information retrieval.
BAML is an AI framework that adds engineering rigor to prompt engineering, offering type-safe prompt definitions, automatic testing, version management, and multi-model support across Python, TypeScript, Ruby, Java, C#, Rust, and Go.
GitMCP is a free remote MCP server that enables AI agents to understand and access any GitHub project repository, eliminating code hallucinations.
Agent framework designed for fintech and enterprise scenarios, providing task orchestration, tool integration, and production-grade reliability with multi-LLM backend support.
A production-ready Agentic RAG system with RESTful API, featuring multimodal document ingestion, hybrid search, knowledge graph construction, and agent-driven retrieval-augmented generation workflows.
A next-generation Android RPA agent framework that enables intelligent device control through agent-driven automation, with smart UI element recognition and automated operations for mobile testing and intelligent assistants.
Sweep is an AI coding assistant for JetBrains IDEs that automatically resolves GitHub issues and submits code changes, automating software development workflows.
Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider, using Stream's edge network for ultra-low latency realtime interactions.
An open-source RAG chatbot powered by Weaviate vector database, supporting multiple data import methods, LLM backends, and embedding models for out-of-the-box retrieval-augmented generation.
Mintlify is a developer documentation and AI-search platform that gives agent toolchains, SDKs, and APIs a structured knowledge surface for both humans and assistants.
An open-source, code-first Go toolkit by Google for building, evaluating, and deploying sophisticated AI agents with flexible tool integration, multi-turn conversation management, and streaming responses.
Polyglot document intelligence framework with a Rust core, extracting text, metadata, and structured data from PDFs, Office documents, images and 91+ formats via MCP server, CLI, and REST API.
Flexible and powerful framework for managing multiple AI agents and handling complex conversations across providers like OpenAI, Anthropic, and AWS Bedrock.
Agent Squad is an open-source multi-agent orchestration framework from AWS for managing multiple AI agents and handling complex conversations.
NVIDIA's open-source LLM vulnerability scanner that automatically detects security issues in language models including safety vulnerabilities, hallucination tendencies, jailbreak risks, and prompt injection attacks.
Yao is a single-binary runtime to build and run autonomous agents — no Python, no Node.js, just define the role. Provides lightweight, high-performance agent development framework.
Context Mode is a context window optimization tool for AI coding agents that sandboxes tool output for 98% context reduction across 12 major platforms.
Evidently is an open-source ML and LLM observability framework with 100+ metrics for evaluating, testing, and monitoring any AI-powered system or data pipeline.
Refly is the first open-source agent skills builder. Define skills through vibe workflows and run them on Claude Code, Cursor, Codex and more. Skills are infrastructure, not prompts.
Manage multiple AI terminal agents like Claude Code, Codex, OpenCode, and Amp in a unified terminal interface
OpenLLMetry is an open-source observability tool for LLM applications based on OpenTelemetry, providing tracing, metrics, and monitoring capabilities.
PraisonAI is a low-code multi-agent framework with handoffs, guardrails, memory, RAG, and 100+ LLM providers, deployable to Telegram, Discord, and WhatsApp.
Build modular and scalable LLM Applications in Rust. Provides agent orchestration, tool-use, RAG pipelines, and other core capabilities for high-performance AI agent systems.
big-AGI is a feature-rich AI suite providing multi-model parallel chats, AI personas, text-to-image, voice synthesis, code highlighting and execution, PDF import, and more. Deploy on-prem or in the cloud.
OpenCompass is a comprehensive LLM evaluation platform supporting a wide range of models including Llama, Mistral, GPT-4, Qwen, GLM, and Claude across 100+ benchmark datasets.
Steel Browser is an open-source browser sandbox purpose-built for AI agents and applications. It provides a full browser API with session management, proxy integration, and built-in anti-detection, enabling web automation without infrastructure headaches.
An LLM-based multi-agent framework for web search engines, similar to Perplexity.ai Pro and SearchGPT, enabling intelligent web search.
Open-source AI agent platform for financial analysis using LLMs, featuring intelligent research, market forecasting, and automated financial report generation.
The official community-driven registry service for Model Context Protocol servers, providing discovery, publishing, and version management for the MCP ecosystem.
Guardrails AI adds programmable guardrails to large language models, ensuring reliability and safety through input/output validation, structured data extraction, and custom validators.
AppAgent is an LLM-based multimodal agent framework designed to operate smartphone apps like a human, supporting touch interaction and autonomous exploration.
TalkToFigma is an MCP integration tool that enables AI agents (Cursor, Claude Code) to communicate with Figma for reading designs and modifying them programmatically.
Julep is a serverless AI workflow deployment platform for building and scaling AI agent applications, described as Firebase for AI agents.
stagewise is a purpose-built browser for developers with a coding agent integrated right in, enabling direct code interaction from the web interface.
LLM-driven extraction of unstructured data, built for API deployments and ETL pipeline workflows. Automates document parsing, PDF extraction, and intelligent data processing with LLM-powered intelligence.
Superagent protects AI applications against prompt injections, data leaks, and harmful outputs, embedding safety directly into your app.
IntentKit is an open-source, self-hosted cloud agent cluster that manages a collaborative team of AI agents for complex task completion.
An LLM playground you can run on your laptop. Compare models side-by-side for prompt testing and model evaluation in a local environment.
BrowserMCP is a browser extension-based MCP server that allows AI applications like Claude and Cursor to directly control and automate your browser.
An AI-native proxy and data plane for agentic apps with built-in orchestration, safety, observability, and smart LLM routing so developers can focus on agent core logic.
LaVague is a Large Action Model (LAM) framework for developing AI web agents, combining RAG techniques for natural-language-driven browser automation.
An agentic orchestrator for parallel coding agents that plans tasks, spawns agents, and autonomously handles CI fixes, merge conflicts, and code reviews for complex development workflows.
Swarms is an enterprise-grade production-ready multi-agent orchestration framework for deploying and scaling collaborative AI agent swarms.
TaskWeaver is Microsoft's open-source code-interpreter-style agent framework, suitable for data analysis and complex task automation.
Official Firecrawl MCP Server that adds powerful web scraping and search capabilities to Cursor, Claude, and other LLM clients.
GSD-2 is a powerful meta-prompting, context engineering, and spec-driven development system that enables agents to work autonomously for long periods without losing track of the big picture.
NVIDIA NeMo Guardrails is an open-source toolkit for adding programmable guardrails to LLM-based conversational systems, supporting topic control, safety enforcement, and dialog guidance.
DevOpsGPT is a multi-agent system for AI-driven software development that combines LLMs with DevOps tools to convert natural language requirements into working software, supporting any development language and extending existing codebases.
Code search MCP for Claude Code and coding agents. Makes entire codebases available as context for AI coding assistants using vector-based semantic code search for precise understanding of large projects.
Rediscover your social memories with local, AI-powered analysis. Import chat histories from multiple platforms and analyze them with AI agents for insights and visualization.
Zero-Config Code Flow for Claude Code and Codex, providing one-click project initialization and context management.
DesktopCommanderMCP is an MCP server that gives AI assistants like Claude terminal control, file system search, and diff file editing capabilities.
A Claude Skill that gives your AI coding agent the ability to use a web browser for browser automation.
Atomic Agents is a modular AI agent building framework with an atomic design philosophy, providing composable components including tools, pipelines, and memory management for constructing agent systems.
AI-powered PPT generation tool that creates natively editable PPTX from any document, producing real PowerPoint shapes instead of images.
Curated collection of system prompts for top AI tools. Perfect for AI agent builders and prompt engineers. Including: ChatGPT, Claude, Perplexity, Manus, Claude-Code, Loveable, v0, Grok, same new, windsurf, notion, and MetaAI.
Open Multi-Agent is a lightweight TypeScript multi-agent framework that auto-decomposes tasks and executes them in parallel with a single runTeam() call. Only 3 dependencies, deploys anywhere Node.js runs.
MCP integration platform that lets AI agents use tools reliably at any scale, providing MCP servers, clients, and integration solutions for production agent workflows.
PySpur is a visual agent workflow editor that supports drag-and-drop construction of AI agent pipelines with built-in evaluations and human-in-the-loop support.
Strands Agents SDK is an AWS open-source agent framework using a model-driven approach to build AI agents with built-in tool use, conversation memory, and multi-agent collaboration.
Cross-platform chatbot framework made with love. Supports Discord, Telegram, QQ and more through a highly extensible plugin architecture.
Ottomator Agents is a collection of runnable agent examples and automation patterns covering research, browser actions, tool use, and multi-step flows for practical learning.
WhatsApp MCP Server provides AI assistants with WhatsApp messaging capabilities, enabling LLMs like Claude to interact with WhatsApp directly via the MCP protocol.
Helicone is an open-source proxy and observability platform for LLM applications, offering request tracing, caching, and cost analytics.
A low-code MCP framework for building complex and innovative RAG pipelines. Combines visual pipeline design with MCP protocol integration for end-to-end RAG — from data ingestion and chunking to retrieval and generation.
AgentOps is an observability platform for AI agents, providing monitoring, debugging, and evaluation to help developers optimize agent performance.
Playwright Model Context Protocol server for automating browsers and APIs in Claude Desktop, Cline, Cursor IDE and other AI coding tools
Next-generation AI Agent optimization platform providing full-lifecycle management capabilities from development, debugging, evaluation to monitoring with prompt management, agent evaluation, and LLM observability.
OpenSpace is a platform that makes your agents smarter, lower-cost, and self-evolving, optimizing agent architectures and reasoning workflows for efficient autonomous evolution.
Multi-model AI agent desktop client that connects to any AI provider, extends with MCP and skills, and supports remote control from your phone. Built with Electron and Next.js.
Claude Coder is an autonomous coding agent as a VSCode extension. It transforms mockups to code, auto-fixes lint errors, writes tests, and performs complex multi-file edits with an agent mode for autonomous task execution.
An open-source evaluation and testing library for LLM agents providing automated model scanning, bias detection, performance benchmarking, and compliance checks.
MCP server and CLI by Sentry providing AI agents with build, test, and development tools for iOS and macOS projects.
AI Chat Browser: Fast, full webapp access to ChatGPT, Claude, Bard, Bing, Llama and more. Quick switching and parallel usage across models.
A cross-platform desktop AI assistant and MCP client compatible with major LLM providers, featuring local knowledge base support and MCP server integration for a unified chat and tool-use experience.
Sparrow is a structured data extraction tool that supports instruction calling with ML, LLM, and Vision LLM for extracting structured information from documents, suitable for document parsing in RAG pipelines.
Windows MCP is an MCP server for the Windows desktop, providing AI agents with computer-use capabilities for desktop automation and system operations.
The RL bridge for LLM-based agent applications, providing a simple and flexible reinforcement learning framework to optimize agent performance.
ROMA (Recursive-Open-Meta-Agent) is a meta-agent framework for building high-performance multi-agent systems with recursive task decomposition and coordination.
AgentVerse is a multi-agent deployment framework by Tsinghua OpenBMB, offering task-solving and simulation paradigms for collaborative multi-LLM-agent systems.
An AI-powered custom node for ComfyUI that enhances workflow automation through natural language interaction, with intelligent node recommendations and parameter configuration.
MCP server for Atlassian tools (Confluence, Jira) that enables AI agents to directly read and interact with Jira issues, Confluence pages, and enterprise collaboration data.
Kodezi Chronos is a debugging-first language model achieving state-of-the-art performance on SWE-bench, capable of autonomously handling software debugging and code repair tasks.
A multi-tenant agent harness platform integrating LightRAG knowledge base and knowledge graphs, built with LangChain, Vue, and FastAPI, supporting DeepAgents, MinerU PDF parsing, Neo4j graph database, and MCP protocol.
Argilla is a collaboration platform for AI engineers and domain experts to build high-quality datasets, collect human feedback, and evaluate models.
Build production-ready agentic workflows with natural language, supporting browser automation, computer use, and RAG workflows
RouteLLM is a framework for serving and evaluating LLM routers, enabling cost reduction without compromising quality through intelligent request routing across multiple model tiers.
Cloudflare Agents is Cloudflare's platform for building agents on the edge runtime, combining Workers, durable state, and tool execution for low-latency production services.
Open-source all-in-one AI productivity platform combining a generalist AI agent, workflow engine, instant messaging, and online documents
ACI.dev is an open-source tool-calling platform that hooks up 600+ tools into any agentic IDE or custom AI agent through direct function calling or a unified MCP server.
21st Magic MCP is a frontend-focused MCP server providing v0-like AI component generation capabilities inside Cursor, WindSurf, Cline, and other IDEs.
An interactive visualization tool for large embeddings by Apple. Explore, cross-filter, and search embeddings and metadata to understand and debug embedding models, vector retrieval, and RAG system behavior.
SWE-bench is a benchmark for evaluating language models on real-world GitHub issue resolution, featuring genuine problems from popular Python repositories, now a core standard for measuring AI coding agent capabilities.
An open-source AI presentation generator and API that creates professional slides from text, as an alternative to Gamma, Beautiful AI and Decktopus.
AutoRAG is an open-source RAG evaluation and optimization framework using AutoML-style automation to help developers automatically find the best RAG pipeline configurations and benchmark them.
A blazing fast inference solution for text embeddings models built in Rust, serving as core infrastructure for building RAG systems and vector retrieval pipelines with high throughput and low latency.
Build local voice agents with open-source models. An end-to-end speech-to-speech pipeline from HuggingFace for fully local voice AI agent deployment.
An LLM-based intelligent agent as a digital lifeform that values warmth, authenticity and genuine connection, with long-term memory and personalized conversation.
An MCP server and CLI that turns the browser into an API, allowing AI agents to control Chrome with existing login sessions for web operations, data scraping, and automation tasks without re-authentication.
Model Context Protocol server for mobile automation and scraping on iOS, Android, emulators, simulators and real devices
A CLI for Git worktree management, designed for parallel AI agent workflows. Run multiple AI coding agents simultaneously across branches.
The first AI agent that builds permissionless integrations through reverse engineering platforms' internal APIs for cross-platform automation.
Deep Research enables deep research using any LLM provider, offering SSE API and MCP server support with OpenAI, Gemini, DeepSeek, Ollama, and more.
ByteRover CLI provides persistent structured memory for autonomous coding agents. It features context tree management, git-like version control, and cloud sync, compatible with Cursor, Claude Code, Windsurf, and 22+ coding agents via MCP integration.
Youtu Agent is a lightweight agent framework by Tencent that delivers out-of-the-box support for open-source LLMs, simplifying agent development and deployment.
An autonomous novel writing AI agent where multiple agents write, audit and revise novels with human review gates for quality control.
Claude Code Router is a model routing tool for coding-agent scenarios, unifying requests across providers to optimize cost, latency, and task-specific routing strategies.
An open-source enterprise-level AI knowledge base and MCP management platform with integrated knowledge retrieval, model management, and agent chat for enterprise AI applications.
Infinity is an AI-native database providing incredibly fast hybrid search of dense vectors, sparse vectors, tensors, and full-text, designed for LLM applications and RAG systems.
Zep is an AI agent memory management platform providing long-term memory, context management, and conversation history understanding through knowledge graph technology.
The AI-native Multi-Agent development platform built on Kotlin Multiplatform, covering all 7 phases of SDLC. Supports automated code generation, testing, deployment, documentation and full development workflow.
The easiest way to use Agentic RAG in any enterprise. Provides out-of-the-box retrieval-augmented generation capabilities with Docker-based deployment for simplified enterprise RAG application building and management.
Cognita is a modular RAG framework for production environments by TrueFoundry, supporting flexible document parsing, vector storage, and retrieval pipeline orchestration for scalable knowledge QA systems.
AG2 (formerly AutoGen) is an open-source AgentOS providing a multi-agent conversation framework with flexible agent orchestration, tool integration, and distributed collaboration for building complex multi-agent systems.
The official Go SDK for the Model Context Protocol, maintained in collaboration with Google, enabling developers to build MCP servers and clients in the Go ecosystem.
The UI design language and React library for Conversational UI by Alibaba, providing complete chat interface components for building customer service and conversational applications.
A zero-code platform for auto-generating production-grade AI agents using Harness Engineering principles with unified tools, skills, memory, and orchestration with built-in constraints and feedback loops.
Official GitHub CLI extension for Agentic Workflows, enabling definition and execution of AI agent workflows within the GitHub ecosystem for automated code reviews, issue handling, and more.
All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container, providing a secure isolated execution environment for agents.
gptme is a terminal-based AI agent equipped with local tools for writing code, using the terminal, and browsing the web to build persistent autonomous agents.
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent with long-term memory and task tracking.
The official Exa MCP server providing AI coding assistants and chat tools with powerful web search and crawling capabilities, including semantic search, precise content extraction, and deep crawling for real-time web information.
The official Notion MCP server enabling AI assistants to directly read and manipulate pages, databases, and content in Notion workspaces, with full API support for search, creation, and editing.
Turn any AI agent into a living microservice that is interoperable, observable and composable, enabling standardized agent communication and orchestration.
Agency Swarm is a reliable multi-agent orchestration framework built on OpenAI API, providing structured multi-agent collaboration and communication.
The official C# SDK for the Model Context Protocol, maintained in collaboration with Microsoft, enabling developers to build MCP servers and clients in the .NET ecosystem.
Full toolkit for running an AI agent service built with LangGraph, FastAPI, and Streamlit, providing a complete reference architecture for agent service deployment.
An open-source Collaborative Multi-Agent OS for transparent, human-in-the-loop task coordination via Matrix rooms. Features real-time task tracking, agent status monitoring, and collaborative decision-making.
Self-hosted AI agent orchestration platform for dispatching tasks, running multi-agent workflows, monitoring spend, and governing operations
A simple, secure MCP-to-OpenAPI proxy server that converts MCP tools into OpenAI-compatible API endpoints for seamless integration with any AI application.
The most powerful AI agent and AI chat software on Android. Supports local LLM execution, terminal operations, file management and more.
Universal memory layer for AI Agents providing scalable, extensible, and interoperable memory storage and retrieval to streamline agent state management for autonomous systems.
A spec-driven development workflow MCP server for AI-assisted software development, featuring a real-time web dashboard and VSCode extension for monitoring and managing project progress in AI coding workflows.
Meta's set of tools to assess and improve LLM security, including safety benchmarks, prompt injection detection, and output auditing to help evaluate and enhance the safety of large language models.
EverOS is a platform for building, evaluating, and integrating long-term memory for self-evolving agents, enabling AI agents to continuously accumulate experience and optimize themselves.
A JVM framework by JetBrains for building predictable, fault-tolerant, enterprise-ready AI agents across all platforms — from backend services to Android, iOS, and in-browser environments, with built-in MCP and multi-provider LLM support.
An open-source graph-vector database built from scratch in Rust, combining graph database and vector retrieval capabilities to provide AI agents with unified storage for both knowledge graphs and semantic search.
USearch is a fast open-source search and clustering engine for vectors and arbitrary objects, with bindings in C++, Python, JavaScript, Rust, Java, Swift, C#, Go, and Wolfram for large-scale vector retrieval.
Agenta is an open-source LLMOps platform providing prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.
An open-source, vision-first browser agent that drives web automation through visual understanding, supporting complex web interaction tasks for QA testing and workflow automation.
A visualization MCP server by AntV with 25+ chart types, enabling AI assistants to generate line charts, bar charts, pie charts, maps, and more through MCP for data analysis and reporting.
Langroid is a Python multi-agent programming framework that leverages an intuitive Agent-Task-Tool abstraction to help developers build LLM-powered multi-agent applications.
Local persistent memory store for LLM applications including Claude Desktop, GitHub Copilot, Codex, and more. Provides durable context memory capabilities for AI agents.
A high-performance graph database built on GraphBLAS, optimized for LLM and GraphRAG scenarios with real-time knowledge graph construction and querying for graph-structured AI agent retrieval.
CozoDB is a transactional, relational-graph-vector database that uses Datalog for queries. Designed as the hippocampus for AI, it unifies graph traversal, vector search, and relational queries.
Open-source agentic development environment (YC W26) that runs multiple coding agents in parallel with any LLM provider
The lightweight ingestion library for fast, efficient and robust RAG pipelines. Supports multiple chunking strategies and embedding models to significantly improve retrieval-augmented generation results.
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
Mini SWE-Agent is a minimalist AI agent in just 100 lines of code that solves GitHub issues or assists developers in the command line, demonstrating core coding agent capabilities with minimal implementation.
An AI-driven local automation assistant like Manus, a computer use agent that uses natural language to make computers work autonomously.
A comprehensive single-package Retrieval-Augmented Generation platform built on Langflow, Docling, and OpenSearch, providing a complete pipeline from document parsing to vector retrieval and generation with multi-model and multi-vector-database support.
An embedded property graph database built for speed with built-in vector search and full-text search, implementing Cypher query language for knowledge graph construction and AI agent structured knowledge retrieval.
A task-aware agent-driven prompt optimization framework from Microsoft Research that iteratively refines prompts for better LLM performance.
LazyLLM is a lightweight multi-agent LLM application framework offering the easiest way to build multi-agent LLM apps, with built-in RAG, knowledge graph, fine-tuning, and integration with LangChain and LlamaIndex ecosystems.
Motia is a TypeScript platform that models APIs, background jobs, agents, and workflows together for teams that want one structure for business logic and automation.
An enhanced MCP server for interactive user feedback and command execution in AI-assisted development, with dual Web UI and desktop app support, intelligent environment detection, and cross-platform compatibility.
Real-time transport layer for Java AI agents supporting WebSocket, SSE, gRPC, and WebTransport/HTTP3, with native MCP, A2A, and AG-UI protocol support for building event-driven AI agent communication architectures.
Development environments for coding agents. Enable multiple agents to work safely and independently with your preferred stack. Provides isolated development environments to avoid conflicts and improve collaboration.
An agentic LLM-powered data processing and ETL system. Enables complex data transformations using natural language-defined pipelines, turning unstructured data into structured, analyzable outputs with LLM intelligence.
Model Context Protocol server for Excel file manipulation, enabling AI agents to read, create and modify spreadsheets
Context management for Claude Code with hooks for state maintenance via ledgers and handoffs. Enables MCP execution without context pollution and agent orchestration with isolated context windows for long-running conversations.
A self-evolving agent framework that grows a skill tree from a 3.3K-line seed, achieving full system control with 6x less token consumption.
A universal local knowledge base solution based on vector databases and GPT, providing one-stop document processing with vectorization, semantic search, and intelligent Q&A for building private knowledge bases.
OpenOperator is an open-source agent project for computer and browser control, focused on GUI automation, task execution, and human-in-the-loop workflows.
An AI Gateway, registry, and proxy by IBM that sits in front of any MCP, A2A, or REST/gRPC APIs, exposing a unified endpoint with centralized discovery, guardrails, and management.
OpenAgentsControl is an AI agent framework for plan-first development workflows with approval-based execution. Supports TypeScript, Python, Go, and Rust with automatic testing, code review, and validation.
Refact is a Rust-based AI coding agent that handles engineering tasks end-to-end, integrating into developer workflows with code completion, chat, agent actions, and self-hosted deployment.
A multi-agent personal assistant that captures real-time on-screen activities and consolidates them into structured memories, building a knowledge base that adapts to your digital experiences.
Tencent's full-stack AI red teaming platform integrating OpenClaw security scanning, agent scanning, skills scanning, MCP scanning, AI infrastructure scanning, and LLM jailbreak evaluation.
A meta-learning agent framework that learns and evolves through conversation, enabling agents to autonomously acquire new skills and optimize strategies.
Arrow is the first UI framework for the agentic era, tiny and performant with built-in WASM sandboxes for safe code execution, purpose-built for building AI agent interfaces.
Agentuity is a production-oriented agent platform focused on runtime, tool execution, and orchestration for teams building deployable agent services.
Expect tests your agent's code in a real browser, providing a visual browser testing environment to verify that AI agent-generated code works as expected.
Enterprise-grade multi-tenant AI agent development platform from China Unicom, featuring RAG, workflow orchestration, and MCP tool integration
NeurIPS 2024 RAG framework inspired by human long-term memory, combining knowledge graphs with personalized PageRank for continuous knowledge integration in LLMs.
The official Java SDK for Model Context Protocol servers and clients, maintained in collaboration with Spring AI, enabling MCP tool calling and context management in Java applications.
DeepResearchAgent is a hierarchical multi-agent system designed for deep research tasks and general-purpose problem solving, using a top-level planning agent to coordinate specialized sub-agents for automated task decomposition and efficient cross-domain execution.
A comprehensive benchmark to evaluate LLMs as agents (ICLR 2024), covering operating systems, databases, knowledge graphs, digital card games and more.
An agent framework for the JVM built in Kotlin, providing a complete toolchain for developing, orchestrating, and deploying AI agents in the Java/Kotlin ecosystem.
Dagu is a local-first declarative workflow engine that is file-based, self-contained, and air-gapped ready. A single binary that scales from laptop to distributed cluster with a persistent Workflow Operator.
TruLens is an open-source tool for evaluating and tracking LLM apps. It provides specialized evaluation for RAG applications including context relevance, groundedness, and answer relevance.
An open-source, developer-first LLMOps platform for streamlined prompt design, version management, real-time observability, monitoring, and team collaboration across LLM applications.
Bee Agent Framework is a production-ready AI agent development framework supporting both Python and TypeScript, offering multi-modal agent building, tool integration, and observability capabilities for rapid production deployment.
MTEB (Massive Text Embedding Benchmark) is a comprehensive benchmark framework for evaluating text embeddings across classification, retrieval, clustering, reranking, and more, helping select optimal embedding models for RAG systems.
Platform for LLM evaluations and AI agent testing, providing comprehensive tracing, evaluation, and quality monitoring to help teams build reliable AI applications.
Solace Agent Mesh is an event-driven multi-agent AI framework for building and orchestrating multi-agent systems with MCP integration and complex multi-step workflows.
Dynamic AI agent automation platform with multi-provider orchestration, adaptive memory, smart features, and a versatile plugin system
Deepgram Agent API is a real-time interface layer for voice agents, combining speech recognition, TTS, and dialog control for phone, assistant, and voice workflow applications.
Zed Agentic is Zed's open-source project for in-editor agent collaboration, focused on code understanding, editing suggestions, and enhanced developer workflows.
Collection of Apple-native tools for the Model Context Protocol, giving AI agents access to macOS system features like Notes, Calendar, Reminders and more
AutoCodeRover is a project structure-aware autonomous software engineer agent that achieves automated program repair and issue resolution by understanding the overall codebase architecture.
A fast TypeScript framework for building MCP servers with a clean, developer-friendly API for creating Model Context Protocol tools and services.
An AI multi-agent framework for .NET with multi-LLM backend integration, providing agent management, tool calling, and conversation state management for enterprise agent development.
PromptTools provides open-source tools for prompt testing and experimentation, supporting multiple LLMs (OpenAI, LLaMA) and vector databases (Chroma, Weaviate, LanceDB) to help developers systematically evaluate and optimize RAG systems.
Open Lovable is an open-source experimental project for conversational app generation, combining agent-style interaction, generative UI, and rapid prototyping.
OpenRouter Agents is OpenRouter's platform capability for multi-model agent use cases, focused on routing, tool calling, and unified access layers.
DIMOS is an agentic operating system for physical space, enabling natural language control of humanoids, quadrupeds, drones, and other hardware platforms, with multi-agent systems that seamlessly integrate cameras, lidar, and actuators.
An open-source autonomous agent powered by Grok, capable of executing tasks, browsing the web, and generating code directly from your terminal.
AgentStation is an open-source platform focused on agent runtime orchestration, tool execution, and developer workflows for unifying multiple automation capabilities.
Block Open is infrastructure for the open agent ecosystem, focused on runtimes, tool connectivity, and task orchestration for teams that want standardized agent platforms.
OpenPipe Artifacts is a data and artifact management tool for agent and LLM applications, helping teams track prompts, outputs, experiments, and evaluation records.
Open-source AI native terminal for cloud and infrastructure management, enabling you to deploy, troubleshoot, and automate services using natural language and intelligent agents.
The security toolkit for LLM interactions, providing prompt injection detection, PII anonymization, content safety auditing, and more to secure production LLM deployments.
Building a self-evolving ecosystem of AI agents with automatic optimization, role evolution, and multi-agent collaboration from single agent to complex systems.
An AI Agent builder and runtime by Docker Engineering, bringing container-native isolation, portability, and standardization to AI agent lifecycle management from development through production deployment.
DO Browser is a browser-task agent tool focused on page understanding, action planning, and automation, serving as a lighter alternative to browser-use or Stagehand.
Gradio Agents is Gradio's interaction-layer toolkit for agent interfaces, helping developers build demoable and testable agent UIs for prototyping and human-in-the-loop workflows.
MCP Gateway is a gateway layer for Model Context Protocol integrations, providing unified access, permission boundaries, and routing control between agents and tool services.
LMNR is an open-source observability platform for LLM and agent applications, focused on tracing, quality analysis, and production diagnostics.
Browser automation tool for AI agents and humans, providing high-performance web interaction capabilities built in Go
HELM (Holistic Evaluation of Language Models) is Stanford CRFM's open-source framework for holistic, reproducible, and transparent evaluation of foundation models including LLMs and multimodal models.
Oxylabs AI Studio Python SDK provides an all-in-one AI-powered web scraping toolkit integrating an AI scraper, crawler, browser agent, search engine, and sitemap tool for structured data extraction driven by natural language instructions.
Web app for interacting with any LangGraph agent (Python and TypeScript) via a chat interface
An improved implementation of the Ralph Wiggum technique for autonomous AI agent orchestration, built in Rust for reliable multi-agent task coordination and scheduling.
Camofox Browser is a headless browser automation server powered by Camoufox, a Firefox fork with C++-level fingerprint spoofing. It bypasses Google, Cloudflare, and most bot detection, providing token-efficient accessibility snapshots and stable element references for AI agents.
OpenAI Agents JS is the JavaScript version of the OpenAI Agents SDK, bringing tool calling, state orchestration, and runtime interfaces to JS/TS web stacks.
HyperAgent is an agent platform focused on task decomposition, toolchain coordination, and complex execution-flow control for advanced automation systems.
Coval is an evaluation tool for voice and conversational agents, helping teams test response quality, interaction stability, and real dialog behavior.
LangDB is a data and operations tool for LLM and agent applications, helping teams manage prompts, traces, and experiment versions as a lightweight operational layer.
GitAgent is a framework-agnostic, git-native standard for defining AI agents where identity, rules, memory, tools, and skills are version-controlled files in a Git repository, enabling reproducible and collaborative agent development.
Aide is a VSCode extension for AI-powered coding assistance, featuring one-click comments, code conversions, UI-to-code generation, and AI batch file processing.
Ruler applies the same rules to all coding agents. Unify your coding rules and configurations across Claude Code, Cursor, Copilot, and more.
All-in-one platform for search, recommendations, RAG, and analytics offered via API. Built in Rust with vector search, full-text search, and semantic reranking for enterprise-grade AI retrieval applications.
Engram is a persistent memory system for AI coding agents. Agent-agnostic Go binary with SQLite + FTS5, MCP server, HTTP API, CLI, and TUI interfaces.
Agent-oriented programming framework for building LLM applications in Java. Provides agent abstractions, tool calling, multi-agent collaboration, and other core capabilities for enterprise Java ecosystem integration.
Multi-agent orchestration workflow platform supporting Claude Code, Codex, Gemini, OpenCode and more. Provides unified orchestration interface for cross-platform agent collaboration.
FastRTC is a developer tool for real-time multimodal and voice applications, useful as a communication layer for low-latency agent conversations and interactive audio/video workflows.
Mem0 TS is the TypeScript version of Mem0, offering long-term memory management, preference extraction, and context compression for agent applications built in JS/TS stacks.
LLMTracer is a tracing tool for agent and LLM applications, helping developers capture call paths, tool execution, and state transitions for debugging and incident analysis.
Model Context Protocol server for converting web pages, PDFs, Office documents and other formats to Markdown for AI agent consumption
The Open Source Memory Layer For Autonomous Agents. Provides long-term memory, knowledge storage, context management with support for memory retrieval, associative reasoning, and knowledge graph construction.
Model Context Protocol server for searching and analyzing arXiv papers, enabling AI agents to retrieve and deeply analyze academic research
Griptape is a modular framework for building and deploying AI agents, supporting toolchains, memory, and multi-model integration.
Blaxel AI SDK is a production-focused toolkit for agent systems, emphasizing tool definitions, execution control, tracing, and service integrations for enterprise apps.
Contextal is a context management and retrieval-enhancement tool for multi-turn agents, long conversations, and complex knowledge injection workflows.
Gweaver is an experimental platform for multi-agent collaboration and task weaving, useful for exploring decomposition, coordination, and role-based execution.
Open-source local realtime voice AI system supporting fully offline real-time voice conversations, suitable for building private voice assistants and voice interaction applications.
Framework enabling AI agents to use real Android and iOS apps just like a human, supporting autonomous operation and interaction with mobile interfaces.
A library by Hugging Face for easily evaluating machine learning models and datasets, providing a wide range of metrics and evaluation methods.
Ouroboros is a spec-driven multi-agent framework that shifts from traditional prompting to specification-driven development, supporting multi-agent collaboration, MCP tool integration, and automated workflow orchestration for building high-quality agent systems.
HuggingFace's all-in-one toolkit for evaluating LLMs across multiple backends, deeply integrated with the HuggingFace ecosystem and providing flexible evaluation metrics and benchmark configuration.
OpenLIT is an open-source AI engineering platform providing OpenTelemetry-native LLM observability, GPU monitoring, guardrails, evaluations, prompt management, and playground, integrating with 50+ LLM providers and agent frameworks.
Versatile, UI-agnostic OpenAI-compatible plugin framework for adding custom pipelines like content filtering, RAG enhancement, and tool calling to any AI chat interface.
A full-stack AI infrastructure tool for data, model, and pipeline orchestration. Streamlines building versatile AI-first applications with a visual pipeline editor for end-to-end workflows from data ingestion to model inference.
No-code multi-agent framework to build LLM agents, workflows, and applications with your own data, supporting diverse data source integrations
Python and JS/TS SDK for running AI-generated code in secure cloud sandboxes with Jupyter-style code interpretation
Open-source Computer-Use-Agent that automates GUI interactions through natural language instructions, enabling intelligent desktop automation.
A CNCF Sandbox SRE Agent that automatically analyzes infrastructure logs and metrics to assist with incident diagnosis and system operations.
AI chat assistant for Obsidian with contextual awareness, smart writing assistance, and one-click edits. Features vault-aware conversations, semantic search, and local model support.
VectorAdmin is the universal tool suite for vector database management. Manage Pinecone, Chroma, Qdrant, Weaviate and more vector databases with an intuitive web interface for data import, querying, and maintenance.
An LLM-based multi-agent framework that lets developers easily build multi-agent applications with core abstractions for agent roles, tools, knowledge management, and collaboration patterns.
An open-source library by NVIDIA for efficiently connecting and optimizing teams of AI agents with orchestration, tool calling, and workflow management.
LLM Agent framework within ComfyUI integrating MCP server, TTS, OCR, GraphRAG, and other AI tool nodes for visual workflow building
AI chat client implementing the Model Context Protocol (MCP) with multi-model support and cross-platform desktop experience
Visual workflow builder for AI agents powered by Firecrawl - drag-and-drop web scraping pipelines with real-time execution. Build agent workflows without coding.
Official spec and SDK of MCP Apps protocol - the standard for UIs embedded in AI chatbots, served by MCP servers, enabling interactive user interfaces directly from MCP tools.
A graph-native context development platform for storing, enriching, and retrieving structured knowledge with semantic search and portable context cores, supporting RDF, SPARQL, and other standards for AI agent knowledge management.
Notte is a framework for building web agents and deploying serverless browser automation functions, providing reliable browser infrastructure and web-aware agent capabilities.
A framework for large language model evaluations developed by the UK AI Safety Institute (AISI), providing comprehensive model capability assessment tools with support for safety and alignment testing.
YoMo is a serverless AI Agent framework built on geo-distributed edge AI infrastructure, using low-latency stream processing for real-time agent orchestration and MCP tool integration, ideal for edge computing agent deployments.
Asynchronous coordination layer for AI coding agents providing identities, inboxes, searchable threads, and advisory file leases over FastMCP, Git, and SQLite.
AutoChain is a lightweight, extensible, and testable LLM Agent framework by Forethought, providing clean abstractions for agent building with automatic tool selection, conversation history management, and automated testing workflows.
An open-source chat UI for Ollama providing a clean, intuitive interface for local LLM conversations with model selection and conversation management.
OxyGent is an open-source multi-agent collaboration framework from JD.com, supporting flexible agent role definition, task decomposition, and collaborative orchestration for enterprise AI agent applications.
Open Agent Platform is LangChain's open-source deployment platform for agents, focused on multi-agent execution, long-running tasks, observability, and production orchestration.
An open-source LLM vulnerability scanner and AI red teaming kit for automated security fuzzing of LLM applications, detecting jailbreaks, prompt injection, and adversarial attacks.
Neuron AI is a PHP agentic framework for building production-ready AI applications, enabling developers to connect LLMs, vector databases, and memory systems to create agents that interact with data.
LLM Compiler for Parallel Function Calling (ICML 2024), significantly improving agent tool calling efficiency and speed through parallel execution.
Sourcery is an instant AI code review tool that automatically detects code issues, suggests refactoring, and improves code quality, integrating into developer workflows for real-time code review.
Elegant lightweight AI chat client with multi-workspace, plugin system, cross-platform sync, Artifacts, and MCP support — local first
CLI to control iOS and Android devices for AI agents, enabling coding agents to directly interact with mobile devices for testing and automation.
ShowUI is an open-source, end-to-end Vision-Language-Action model for GUI agents and computer use, capable of understanding screenshots and executing precise interface interactions.
An MCP client for Neovim that seamlessly integrates MCP servers into your editing workflow with an intuitive interface for managing, testing, and using MCP servers with your favorite chat plugins.
Powerful, self-hostable AI agent platform designed for maximum privacy and flexibility. A complete drop-in replacement for OpenAI Responses APIs running locally on consumer-grade hardware.
Desktop AI assistant with multi-model support (GPT-5, Claude, Gemini, Ollama, etc.), featuring chat, vision, voice, RAG, image generation, agents, and MCP plugins
An enterprise-grade platform for running and managing MCP servers with containerized deployment, security isolation, network policies, resource limits, and unified management of large-scale MCP server fleets via Kubernetes or Docker.
An open-source multi-agent simulation platform where multiple LLM-powered agents collaborate on complex tasks in shared environments, with customizable roles, memory systems, and environment interaction for studying multi-agent collaboration.
JVector is the most advanced embedded vector search engine, built in pure Java by DataStax. It provides high-performance ANN search for RAG and AI applications on the JVM.
Shannon is a production-oriented multi-agent orchestration framework built in Go, focusing on efficient and reliable agent coordination and task scheduling for enterprise-grade multi-agent systems.
A multi-modal vector database that supports upserts and vector queries using unified MySQL-compatible SQL on structured and unstructured data, meeting high concurrency and ultra-low latency requirements.
Vald is a highly scalable distributed vector search engine built on cloud-native architecture, designed for high-performance approximate nearest neighbor search across massive vector datasets.
Official sample application for OpenAI Computer Using Agent (CUA). Learn how to use CUA via the API on multiple computer environments.
Open-source persistent memory service for AI agents, supporting LangGraph, CrewAI, and AutoGen with REST API, knowledge graph, and autonomous memory consolidation.
An autonomous agent framework for Elixir built for distributed, autonomous behavior and dynamic workflows, leveraging BEAM VM concurrency and fault tolerance for production-grade agent systems with high availability.
An open-source MCP client that provides unified access to Model Context Protocol tools, enabling integration of any MCP server into AI applications with simplified tool calling.
High-performance code intelligence MCP server that indexes codebases into a persistent knowledge graph, supporting 66 languages with sub-millisecond queries and 99% fewer tokens.
A ChatGPT web client supporting multiple users, languages, and database connections for persistent storage. Provides Docker images and quick deployment scripts.
Sandbox your local AI agents so they can only read and write what they need. File system permission control for secure local agent execution.
OpenAdapt is an open-source agent tool for desktop automation and computer-use scenarios, capturing user interactions, replaying tasks, and enabling GUI automation workflows.
An open-source Cultivation World Simulator using Agentic Workflow to create a dynamic, emerging Xianxia world. Showcases multi-agent collaboration in complex scenarios.
LangGraph for Java — a library for developing AI agentic architectures in the Java ecosystem, designed to work seamlessly with both LangChain4j and Spring AI, supporting stateful graph-based workflows and complex agent orchestration.
Agentic AI framework for enterprise workflow automation. Uses LLM-powered pipelines for code reviews, DevOps, and other enterprise tasks.
A GenAI application development framework that simplifies agent interaction with structured data and chained-calls syntax, using event-driven flow for complex logic.
ChatArena is a multi-agent language game environment for LLMs, designed to develop and evaluate communication and collaboration capabilities of AI agents across diverse game scenarios.
An AI Agent workforce platform that assigns every team member an AI agent squad for multi-agent collaboration, task orchestration, and compound skill building to scale team capacity beyond headcount.
An open-source background agents coding system that autonomously executes coding tasks in the background, including code reviews, test generation, and feature implementation.
Superlinked Inference Engine is an open-source inference server and production cluster for embeddings, reranking, and extraction, providing high-performance data processing pipelines for RAG systems.
Self-hosted, always-on AI agent platform running in containers. Create multiple bots with long-term memory and connect them to Telegram, Discord, Feishu, Matrix, and more.
Framework for running agent evaluations and creating RL environments to measure and improve agent performance
A simple yet powerful agent framework for personal assistants, designed to enable intelligent interaction, multi-agent collaboration, and seamless tool integration with built-in memory and tree-of-thought reasoning.
MCP Language Server gives MCP-enabled clients access to semantic code tools like go-to-definition, find-references, rename, and diagnostics, providing AI agents with precise code navigation capabilities.
The first full-stack open-source self-evolving general AI agent, offering a fully local alternative to agentic platforms like Manus and Genspark AI with autonomous thinking, task planning, tool usage, and knowledge accumulation.
Open chat interface for all your models — a unified, modern frontend for connecting to various AI providers
Google ADK Java is Google's Java toolkit for building, evaluating, and deploying sophisticated AI agents, filling the agent framework gap in the Java ecosystem.
ColiVara is a suite of services for storing, searching, and retrieving documents based on visual embeddings. It uses vision models instead of chunking and text-processing, achieving state-of-the-art retrieval on both text and visual documents without OCR.
Realtime Voice AI on Arduino ESP32 with 100+ Voice AI Models for AI Toys, Companions, and Devices. Supports OpenAI Realtime, Gemini, Grok, and Eleven Labs.
An LLM prompt injection detector that combines heuristics, vector similarity, and language model-based detection to identify and block malicious prompt injection attacks.
Mirascope is a lightweight LLM development library that takes a type-safe, Pythonic approach to building LLM applications, emphasizing simplicity over framework constraints.
SWE-Lancer is an OpenAI benchmark dataset evaluating frontier language models on freelance software engineering tasks, covering real scenarios from simple bug fixes to complex feature development.
OctoTools is an agentic framework with extensible tools for complex reasoning, featuring a tool card system for flexible composition of diverse reasoning capabilities.
WebArena is a realistic benchmark environment for evaluating autonomous web agents. It provides Gym-like interactive website simulations covering e-commerce, forums, CMS, and more, enabling end-to-end task evaluation as a standard framework for web agent research.
A framework for building, running, and scaling AI agents as APIs and microservices, with built-in observability, auditability, and identity-aware access control from day one.
Adala is an autonomous data labeling agent framework that uses AI agents to automate data annotation, classification, and quality checks, significantly improving data processing efficiency.
Fully-featured web interface for Ollama LLMs built with Next.js. Supports local model conversations, multi-model switching, and browser-side persistent storage.
KaibanJS is a JavaScript-native multi-agent framework with a Kanban-inspired approach for managing agent collaboration, supporting task assignment, role definition, and parallel execution for rapid multi-agent system development.
CrewAI Tools provides reusable integrations for the CrewAI ecosystem, including search, scraping, database access, and code execution to extend multi-agent workflows quickly.
Playwright for Windows desktop automation, enabling AI agents to control desktop applications through natural language
A hyper-fast local vector database for use with LLM Agents, providing lightweight vector storage and similarity search capabilities for embedding as instant memory and knowledge retrieval components in agent applications.
LangMem is LangChain's memory layer for agents, helping developers add long-term memory, replay summaries, and context management to improve multi-turn performance.
An open source autonomous agent built in Rust that lives on your machines 24/7 and keeps your apps running on autopilot.
The Powerful Conversational AI JavaScript Library with UI for any LLM. Supports LangChain, HuggingFace, Vercel AI, and more. Works with React, Next.js, and plain JavaScript.
OpenReview is an open-source, self-hosted AI code review bot powered by Vercel that automatically analyzes pull requests and provides code review suggestions.
The first open-source Artificial Narrow Intelligence generalist agent that fully operates GUIs using only natural language. Uses Visualization-of-Thought and Chain-of-Thought reasoning for spatial perception and HID simulation.
A custom AI agent platform that lets teams build and deploy AI assistants by composing multiple agents, connecting them to internal knowledge bases and tools for trusted AI-powered collaboration in enterprise workflows.
A suite of tools for connecting AI to the web with a query language and Playwright integrations for precise, scalable web element interaction and data extraction.
Run coding agents in sandboxes. Control them over HTTP. Supports Claude Code, Codex, OpenCode, and Amp with isolated execution environments.
Phantom is an AI co-worker with its own computer, featuring self-evolving capabilities, persistent memory, and MCP server support, autonomously completing complex tasks like a virtual colleague.
HyperAgent is a Playwright-based AI browser automation framework offering high-level APIs like page.ai(), page.perform(), and page.extract(). It features built-in MCP client support and action caching, enabling AI agents to browse, interact, and extract data using natural language.
A high-performance vector database designed to handle up to 1 billion vectors on a single node, delivering significant performance gains through optimized indexing and execution. Also available as a cloud service.
TrustRAG is a RAG framework focused on reliable input and trusted output, providing complete RAG pipeline components including document parsing, chunking, retrieval, and reranking with multiple retrieval strategies and evaluation methods.
A Markdown-first memory system and standalone library for any AI agent. Provides memory storage and retrieval with vector search and semantic matching to help agents manage long-term context.
BaseAI is a serverless AI agent framework for web developers, enabling local-first agentic pipes, tools, and memory with one-command serverless deployment for rapid AI agent application delivery.
Agent-MCP is a multi-agent framework built on the Model Context Protocol (MCP) that enables coordinated, efficient AI collaboration with multiple specialized agents working in parallel on different aspects of a project.
BrowserWing turns browser actions into MCP commands or Claude Skills, allowing AI agents to control browsers efficiently and reliably with reduced dependency on heavy LLM interactions.
A self-organizing multi-agent collaboration platform where multiple AI agents work as an autonomous team, handling planning, executing, reviewing, and patrolling tasks with zero human intervention.
Langtrace is an open-source, OpenTelemetry-based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations, and metrics for popular LLMs, agent frameworks, and vector databases.
KubeAI is a Kubernetes-native AI inference operator that makes it easy to serve ML models in production, supporting LLMs, VLMs, embeddings, and speech-to-text with autoscaling.
KubeAI is an AI runtime and inference platform for Kubernetes, useful as scalable model-serving and deployment infrastructure for agent systems in cluster environments.
Browserable is a self-hostable browser automation tool purpose-built for AI agents. It provides secure Docker-based browser environments with a JavaScript SDK, achieving 90.4% accuracy on the Web Voyager benchmark for autonomous web navigation.
A curated list of papers and resources for multi-modal Graphical User Interface agents, systematically covering computer use, mobile interaction and more.
AI browser automation assistant as a Chrome extension, privacy-first with MCP support, alternative to Claude Chrome and Manus Browser Operator
CVS Health's open-source uncertainty quantification library for language models, providing UQ-based hallucination detection with confidence scoring and mitigation tools to identify and reduce unreliable LLM outputs.
A curated list of awesome LLM and AI Agent Skills, resources and tools for customising AI Agent workflows. Works with Claude Code, Codex, Gemini CLI and custom agents.
Official Neo4j GraphRAG Python SDK providing an integrated toolkit for knowledge graph construction, vector retrieval, and graph querying, supporting agent-driven graph retrieval-augmented generation workflows.
Open-sourced computer use agents that can operate on cross-platform environments including Windows, macOS, Ubuntu, and Android. ICLR 2026 Oral paper project.
trpc-agent-go is a powerful Go framework for building intelligent agent systems with LLM integration, tool calling, multi-step reasoning, and workflow orchestration, designed for enterprise-grade agent systems in microservice architectures.
EmbedAnything is a highly performant, modular, and memory-safe embedding inference and indexing framework built in Rust, providing production-ready RAG ingestion and indexing pipelines for local and cloud deployment.
Microsoft's AI Agent Governance Toolkit providing policy enforcement, zero-trust identity, execution sandboxing, and reliability engineering for autonomous AI agents. Covers 10/10 OWASP Agentic Top 10.
A toolkit by Weights & Biases for developing AI-powered applications, providing LLM call tracing, evaluation experiment management, and versioning from prototype to production.
A low-code tool to rapidly build and coordinate multi-agent teams for complex task execution.
VectorDBBench is a benchmarking tool for vector databases, providing standardized performance testing and comparative analysis for popular vector databases including Milvus, Qdrant, Chroma, Weaviate, and more.
A better chatbot platform powered by Agent, MCP, and Workflows. Supports multi-model integration, visual workflow orchestration, and low-code configuration.
Production-grade multi-agent orchestration platform with JSON-defined agents, multi-tier memory, and built-in observability, battle-tested on 200+ enterprise AI agents with full enterprise deployment support.
Kodus AI is an open-source AI code review tool with full control over model choice and costs, automatically analyzing pull requests and delivering high-quality code review feedback.
Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone. Provides out-of-box RAG solution with support for knowledge base building, semantic search, and context management.
AI Agent Evaluator and Red Team Platform. Provides systematic security evaluation and adversarial testing tools to discover and fix vulnerabilities in agent systems.
Agentica is a TypeScript AI function calling framework enhanced by compiler skills, using type-safe schemas to auto-generate function calls and boost LLM tool-use capabilities for reliable AI agent backends.
A MemAgent framework that can extrapolate to 3.5M context tokens, along with a training framework for RL training of any agent workflow.
An open-source AI Voice Agent that integrates with Asterisk/FreePBX using Audiosocket/RTP technology for low-latency AI-powered phone interactions.
The first open-source testing agent that enables UI, API, security, accessibility, and visual validations without writing code or maintaining tests
An agentic memory system for LLM agents inspired by human memory mechanisms, enabling dynamic memory generation, retrieval, and consolidation with automatic memory evolution and self-organization.
A security scanner for LLM agentic workflows. Automatically detects security vulnerabilities, prompt injection risks, and permission violations in agent pipelines before deployment.
An automatic prompt optimization framework by Salesforce AI Research that leverages LLMs to search for and refine prompts for improved model performance.
OpenTelemetry instrumentation for AI observability, providing standardized tracing, metrics collection, and span definitions for LLM inference processes to help developers monitor and debug AI agent systems.
Middleware providing an OpenAI-compatible API endpoint that bridges MCP tools to any client or framework supporting the OpenAI API format
chromem-go is an embeddable vector database for Go with a Chroma-like interface and zero third-party dependencies. It supports in-memory storage with optional persistence, ideal for lightweight RAG applications.
Layra is an enterprise-ready solution combining visual RAG with multi-step agent workflow orchestration, providing out-of-the-box document parsing, knowledge base construction, and intelligent Q&A capabilities.
A PostgreSQL vector database extension for building AI applications, adding high-performance vector search capabilities to PostgreSQL with support for generating and indexing embeddings directly in the database.
Augment SWE-bench Agent is the number one open-source SWE-bench Verified implementation, demonstrating how to build high-performance software engineering agents to automatically resolve GitHub issues.
Open-source AI agent desktop app for Windows and macOS with one-click install of Claude Code, MCP tools, and Skills, featuring sandbox isolation, multi-model support, and Feishu/Slack integration.
Lightweight AI agent framework with built-in memory, tool calling, and tree-of-thought reasoning, supporting multi-agent collaboration and self-learning, compatible with OpenAI, DeepSeek, Qwen, and other major LLMs with MCP/SSE protocol integration.
LangSmith SDK is LangChain's observability toolkit for LLM apps and agents, covering tracing, evaluation, dataset management, and debugging for production workflows.
A cross-platform, ultra-efficient SQLite extension that brings vector search capabilities to embedded databases, ideal for local-first RAG applications and agent memory storage.
Inngest Agent Kit is a TypeScript toolkit for agent development that combines step orchestration, tool calling, streaming execution, and event-driven workflows for production tasks.
A general memory system for AI agents powered by deep research, providing a flexible memory architecture that supports unified management and retrieval of multiple memory types including short-term, long-term, and episodic memory.
An AI agent framework focused on agent collaboration, featuring a clean API design and documentation-driven development approach, supporting task decomposition, coordination, and result aggregation across multiple agents.
TanStack Store is a lightweight state-management tool that works well for agent UIs, workflow frontends, and real-time consoles that need to manage agent state and event flows.
Golf MCP is a production-ready MCP Server framework with built-in auth, observability, debugger, telemetry, and runtime for building secure AI agent infrastructure.
Skales is a local AI desktop agent for Windows, macOS, and Linux. It features an agent skills system (SKILL.md), autonomous coding (Codework), multi-agent team collaboration, and desktop automation with 15+ AI providers, requiring no Docker or terminal.
Arrakis is a fully customizable and self-hosted sandboxing solution written in Go, designed specifically for AI agent code execution scenarios, providing a secure isolated runtime environment.
An open-source tool from Meta for LLM prompt optimization. Automates the process of continuously improving and refining LLM prompts.
A flexible multi-interface AI agent framework supporting reasoning, tool use, memory, deep research, blockchain interaction, and MCP protocol, capable of building agent applications ranging from simple conversations to complex research tasks.
Security gateway for AI coding agents providing security protection, workspace isolation, and multiplexing, supporting Claude, Copilot, Cline, and other IDE extensions to prevent sensitive data leaks and malicious prompt injections.
CodeFuse-muAgent is an innovative agent framework driven by a knowledge graph engine, integrating EKG (Enterprise Knowledge Graph) technology for multi-agent collaboration, RAG-enhanced retrieval, and tool learning.
Blades is a Go-based multimodal AI agent framework from the Kratos team, supporting vision, voice, and text interactions with built-in agent orchestration, tool calling, and memory management.
A private agent fleet platform with spec coding, where each agent gets its own GPU-accelerated desktop for running Claude, Codex, Gemini and open models.
A prompt management and debugging platform for LLMs, providing prompt logging, request tracking, replay capabilities, and debugging tools to help teams systematically manage LLM interactions and optimize prompts.
SWE-AF is an autonomous software engineering fleet platform using a multi-agent factory architecture. It orchestrates planner, coder, reviewer, and verifier agents to automate the full software engineering lifecycle from issue analysis to code fix, scoring 95/100 on benchmarks.
An end-to-end RL training framework by NVIDIA for orchestrating tools and agentic workflows. Optimizes multi-step agent decision-making and tool-use policies.
An MCP server powered by Mem0 for long-term agent memory, supporting user preference memory, context-aware retrieval, and cross-session memory persistence, also useful as a Python MCP server development template.
An MCP server for SearXNG, providing AI agents with privacy-friendly meta search engine capabilities.
Dapr Agents is a framework for building autonomous, resilient, and observable AI agents with built-in workflow orchestration, security, statefulness, and telemetry for production-grade agent deployments.
Git LRC is a free, unlimited AI code review tool that runs automatically on every commit, helping developers catch and fix code issues early in the development workflow.
Pipelex is a declarative language and devtool for building composable AI workflows, enabling definition, debugging, and execution of complex LLM pipelines and agent workflows.
An agent framework for llama.cpp that supports structured function calls and JSON output, enabling easy interaction with local LLMs without fine-tuning.
AI-powered software engineering multi-agent system with researcher and developer agents that automate code implementation through intelligent planning and execution.
Scaling data for SWE-agents (NeurIPS 2025 D&B Spotlight). A toolkit for automatically generating large-scale training datasets for software engineering agents.
Conversational voice AI agents platform for building natural language phone interactions with multilingual speech synthesis and real-time dialogue management.
Vectra is a local vector database for Node.js with features similar to Pinecone but built using local files. It supports semantic search and document embeddings with no external service dependencies, ideal for RAG application development in Node.js environments.
MCP Sequential Thinking server that recommends the most effective MCP tools at each reasoning stage, enhancing AI agent tool selection
A multi-agent framework written in Rust for building, deploying, and coordinating multiple intelligent agents, designed for high performance and memory safety in latency-sensitive production systems.
Agent SDK Go is a powerful Go framework for building production-ready AI agents, providing core features including tool use, conversation management, and multi-model support.
AgentLabs is a toolkit for agent development and testing, focused on experimentation, replay, and workflow support to improve iteration speed.
Implementing cognitive architecture and psychological memory concepts into Agentic LLM Systems. Explores short-term, long-term, and working memory engineering for AI agents.
Eidolon is the first AI Agent Server, providing a pluggable Agent SDK and enterprise-ready runtime supporting multi-agent orchestration, tool integration, and production deployment.
Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.
The official Docker MCP registry for centralized discovery, distribution, and management of MCP servers, providing standardized tool access for AI agents.
Quantalogic is a ReAct-based coding agent framework supporting multiple LLM backends, with tool use, reasoning chain management, and an extensible plugin system.
MCPAdapt is an adapter library that unlocks 650+ MCP server tools for use in popular agentic frameworks like LangChain, LlamaIndex, and more.
A multilingual benchmark for issue resolving. Extends SWE-bench to multiple programming languages for evaluating AI agent capabilities across diverse codebases.
LangChain AWS is LangChain's AWS integration library, supporting building AI agents using AWS Bedrock, Lambda, and other services with seamless AWS cloud integration.
Official Redis agent memory server providing fast and flexible persistent memory for AI agents and applications, with context management and session memory support.
Taskara is an orchestration platform for long-running agent tasks and multi-step automation, emphasizing persistence, scheduling, and execution control.