📚

RAG Tools

Tools for retrieval-augmented generation

170 projects

Firecrawl

Firecrawl is the Web Data API for AI, turning web pages into clean, structured, LLM-friendly data with crawl, scrape, and search capabilities.

web-scrapingmcprag +2

LangChain

140.6k · Python

Active

LangChain is the open-source agent engineering platform that unifies model IO, tool calling, RAG, memory and observability under one composable framework.

agent-frameworkragorchestration +2

llama.cpp

118.8k · C++

Active

llama.cpp is a lightweight C/C++ inference engine that runs a wide range of open-source large language models efficiently on consumer hardware.

llm-inferencellamagguf +2

Awesome LLM Apps

116.2k · Python

Active

100+ AI Agent and RAG apps you can actually run — clone, customize, and ship. A great reference for quickly building LLM-powered applications.

agentragllm +1

Supabase Vector

105.0k · TypeScript

Active

Supabase's built-in pgvector search, turning Postgres into a RAG database.

vector-dbpostgresrag +1

vLLM

84.9k · Python

Active

A high-throughput and memory-efficient inference and serving engine for LLMs, featuring PagedAttention, continuous batching, and optimized KV cache management for production deployments.

llmpythonframework +2

RAGFlow

84.0k · Go

Active

A leading open-source RAG engine that fuses cutting-edge retrieval-augmented generation with agent capabilities to create a superior context layer for LLMs.

ragdocument-understandingknowledge-base +3

Prompt Engineering Guide

76.1k · MDX

Stale

Comprehensive guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

prompt-engineeringragagent +2

MinerU

72.6k · Python

Active

Transforms complex documents like PDFs into LLM-ready markdown/JSON for Agentic workflows, supporting layout analysis, formula recognition, and table extraction.

data-processingragpython +2

Hello Agents

63.0k · Python

Active

A comprehensive tutorial on AI agent principles and practice, systematically covering core concepts, framework usage and hands-on projects.

agentpythonframework +1

Pathway

62.8k · Python

Active

Pathway is a Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG applications.

etlstreamingrag +2

Docling

62.4k · Python

Active

Docling is an open-source document processing tool by IBM that converts PDF, Word, PPT, HTML and more into structured data for AI, purpose-built for GenAI and RAG pipelines.

document-parsingpdfrag +1

TrendRadar

60.1k · Python

Active

AI-driven public opinion and trend monitor with multi-platform aggregation, RSS subscriptions, smart keyword filtering, AI-powered news analysis and briefings, supporting MCP integration and push notifications via WeChat, Feishu, DingTalk, Telegram and more.

automationllmpython +3

Embedchain

59.8k · Python

Active

Embedchain is a universal memory layer for AI agents, enabling quick integration of diverse data sources into LLMs for context-aware AI applications.

memoryragembeddings +2

Mem0

59.8k · Python

Active

Mem0 is a long-term memory layer for AI agents, supporting cross-session memory management and personalized context retrieval.

memoryragpersonalization +1

Pathway LLM App

59.2k · Jupyter Notebook

Active

Ready-to-run cloud templates for RAG, AI pipelines and enterprise search with live data, always in sync with Sharepoint, Google Drive, S3, Kafka and more.

ragpythondata-processing +1

Context7

58.4k · TypeScript

Active

Context7 is Upstash's context-engineering toolkit for agents, helping applications manage long context windows, retrieval injection, and history compression.

contextmemoryretrieval +1

codegraph

56.4k · TypeScript

Active

CodeGraph is a context graph for coding agents, mapping how a codebase is wired together so LLM-driven tools can navigate dependencies and produce more accurate edits.

code-intelligencemcprag +2

Daily Stock Analysis

52.5k · Python

Active

LLM-powered stock analysis system for A/H/US markets with multi-source quotes, real-time news, LLM decision dashboard and multi-channel push notifications.

agentpythonllm +2

LlamaIndex

50.5k · Python

Active

Leading data framework for LLM applications, with unified RAG, Agent, and Workflow capabilities.

ragllmagent-framework +2

LlamaIndex

50.5k · Python

Active

Data framework for LLM apps specializing in RAG and agent data integration.

ragllamaindexagent-tools +1

LlamaIndex

50.5k · Python

Active

LlamaIndex is a data framework for building LLM applications. It provides data connectors, indexing, query engines, and agent workflow orchestration — a core tool in the RAG ecosystem.

ragdata-frameworkindexing +2

LlamaIndex

50.5k · Python

Active

LlamaIndex is a data framework that provides the data connection layer for LLM applications, with strong RAG capabilities across diverse data sources and vector databases.

ragllmindexing +1

LocalAI

47.2k · Go

Active

Open-source AI engine to run any model — LLMs, vision, voice, image, video — on any hardware without GPU. Provides OpenAI-compatible API for fully local, privacy-first AI inference.

llmapilocal +3

(24 / 170)

Memory记忆系统长期记忆

Agent Memory Architecture: Working, Long-term, and Shared Memory Trade-offs

A systematic comparison of the three categories of agent memory -- working, long-term, and shared -- covering storage media, lifecycle, retrieval methods, typical frameworks, and design patterns, fully addressing agent personalization and multi-agent collaboration engineering.

AI Agent记忆系统向量检索

Designing Agent Memory Systems: From Short-Term Context to Persistent Knowledge

A deep dive into the four-layer agent memory architecture, with practical code for vector retrieval and memory compression to help you build scalable long-term memory systems.

small-language-modelsedge-inferencefine-tuning

Agent Small-Model Finetuning and Edge Inference

Exploring how small language models are fine-tuned and deployed for agent workloads at the edge, balancing latency, cost, and accuracy for production AI agents.

容错工具调用重试

Agent Tool-Call Fault Tolerance: Timeouts, Retries, Fallbacks, Idempotency

A systematic guide to seven tool-call fault tolerance patterns: timeout hierarchy, exponential backoff with jitter, circuit breakers, fallback provider chains, recoverable error classification, structured validation, and idempotency keys -- keeping agents stable in unstable real-world environments.

LettaMemGPTAI Agent

Building Stateful AI Agents: A Deep Dive into Letta (MemGPT)

Learn how to build stateful AI agents with long-term memory using Letta (formerly MemGPT), solving the LLM context window limitation.

上下文工程长上下文RAG

Context Engineering: Context Decay and Recovery in Long-Conversation Agents

Long-conversation agents fail at context management, not model capability. A systematic comparison of sliding window, retrieval injection, and layered compression strategies with practical decay diagnosis and recovery patterns.

RAG Tools

170 projects

Firecrawl

LangChain

llama.cpp

Awesome LLM Apps

Supabase Vector

vLLM

RAGFlow

Prompt Engineering Guide

MinerU

Hello Agents

Pathway

Docling

TrendRadar

Embedchain

Mem0

Pathway LLM App

Context7

codegraph

Daily Stock Analysis

LlamaIndex

LlamaIndex

LlamaIndex

LlamaIndex

LocalAI

Related Articles

Agent Memory Architecture: Working, Long-term, and Shared Memory Trade-offs

Designing Agent Memory Systems: From Short-Term Context to Persistent Knowledge

Agent Small-Model Finetuning and Edge Inference

Agent Tool-Call Fault Tolerance: Timeouts, Retries, Fallbacks, Idempotency

Building Stateful AI Agents: A Deep Dive into Letta (MemGPT)

Context Engineering: Context Decay and Recovery in Long-Conversation Agents