📚

Best RAG Tools Top 20

Top 20 most popular open-source RAG Tools projects, ranked by GitHub Stars.

Firecrawl

Firecrawl is the Web Data API for AI, turning web pages into clean, structured, LLM-friendly data with crawl, scrape, and search capabilities.

web-scrapingmcpragdata-extraction

LangChain

140.6k Stars

LangChain is the open-source agent engineering platform that unifies model IO, tool calling, RAG, memory and observability under one composable framework.

agent-frameworkragorchestrationllm

llama.cpp

118.8k Stars

llama.cpp is a lightweight C/C++ inference engine that runs a wide range of open-source large language models efficiently on consumer hardware.

llm-inferencellamaggufcpp

Awesome LLM Apps

116.2k Stars

100+ AI Agent and RAG apps you can actually run — clone, customize, and ship. A great reference for quickly building LLM-powered applications.

agentragllmpython

Supabase Vector

105.0k Stars

Supabase's built-in pgvector search, turning Postgres into a RAG database.

vector-dbpostgresragsupabase

vLLM

84.9k Stars

A high-throughput and memory-efficient inference and serving engine for LLMs, featuring PagedAttention, continuous batching, and optimized KV cache management for production deployments.

llmpythonframeworkapi

RAGFlow

84.0k Stars

A leading open-source RAG engine that fuses cutting-edge retrieval-augmented generation with agent capabilities to create a superior context layer for LLMs.

ragdocument-understandingknowledge-baseretrieval

Prompt Engineering Guide

76.1k Stars

Comprehensive guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

prompt-engineeringragagentllm

MinerU

72.6k Stars

Transforms complex documents like PDFs into LLM-ready markdown/JSON for Agentic workflows, supporting layout analysis, formula recognition, and table extraction.

data-processingragpythonllm

Hello Agents

63.0k Stars

A comprehensive tutorial on AI agent principles and practice, systematically covering core concepts, framework usage and hands-on projects.

agentpythonframeworkrag

Pathway

62.8k Stars

Pathway is a Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG applications.

etlstreamingragreal-time

Docling

62.4k Stars

Docling is an open-source document processing tool by IBM that converts PDF, Word, PPT, HTML and more into structured data for AI, purpose-built for GenAI and RAG pipelines.

document-parsingpdfragpython

TrendRadar

60.1k Stars

AI-driven public opinion and trend monitor with multi-platform aggregation, RSS subscriptions, smart keyword filtering, AI-powered news analysis and briefings, supporting MCP integration and push notifications via WeChat, Feishu, DingTalk, Telegram and more.

automationllmpythontools

Embedchain

59.8k Stars

Embedchain is a universal memory layer for AI agents, enabling quick integration of diverse data sources into LLMs for context-aware AI applications.

memoryragembeddingsagent-tools

Mem0

59.8k Stars

Mem0 is a long-term memory layer for AI agents, supporting cross-session memory management and personalized context retrieval.

memoryragpersonalizationagent

Pathway LLM App

59.2k Stars

Ready-to-run cloud templates for RAG, AI pipelines and enterprise search with live data, always in sync with Sharepoint, Google Drive, S3, Kafka and more.

ragpythondata-processingframework

Context7

58.4k Stars

Context7 is Upstash's context-engineering toolkit for agents, helping applications manage long context windows, retrieval injection, and history compression.

contextmemoryretrievaltypescript

codegraph

56.4k Stars

CodeGraph is a context graph for coding agents, mapping how a codebase is wired together so LLM-driven tools can navigate dependencies and produce more accurate edits.

code-intelligencemcpraglocal-first

Daily Stock Analysis

52.5k Stars

LLM-powered stock analysis system for A/H/US markets with multi-source quotes, real-time news, LLM decision dashboard and multi-channel push notifications.

agentpythonllmdata-processing

LlamaIndex

50.5k Stars

Leading data framework for LLM applications, with unified RAG, Agent, and Workflow capabilities.

ragllmagent-frameworkdata

Memory记忆系统长期记忆

Agent Memory Architecture: Working, Long-term, and Shared Memory Trade-offs

A systematic comparison of the three categories of agent memory -- working, long-term, and shared -- covering storage media, lifecycle, retrieval methods, typical frameworks, and design patterns, fully addressing agent personalization and multi-agent collaboration engineering.

AI Agent记忆系统向量检索

Designing Agent Memory Systems: From Short-Term Context to Persistent Knowledge

A deep dive into the four-layer agent memory architecture, with practical code for vector retrieval and memory compression to help you build scalable long-term memory systems.

small-language-modelsedge-inferencefine-tuning

Agent Small-Model Finetuning and Edge Inference

Exploring how small language models are fine-tuned and deployed for agent workloads at the edge, balancing latency, cost, and accuracy for production AI agents.