Best RAG Tools Top 20
Top 20 most popular open-source RAG Tools projects, ranked by GitHub Stars.
Firecrawl
142.2k StarsFirecrawl is the Web Data API for AI, turning web pages into clean, structured, LLM-friendly data with crawl, scrape, and search capabilities.
LangChain
140.6k StarsLangChain is the open-source agent engineering platform that unifies model IO, tool calling, RAG, memory and observability under one composable framework.
llama.cpp
118.8k Starsllama.cpp is a lightweight C/C++ inference engine that runs a wide range of open-source large language models efficiently on consumer hardware.
Awesome LLM Apps
116.2k Stars100+ AI Agent and RAG apps you can actually run — clone, customize, and ship. A great reference for quickly building LLM-powered applications.
Supabase Vector
105.0k StarsSupabase's built-in pgvector search, turning Postgres into a RAG database.
vLLM
84.9k StarsA high-throughput and memory-efficient inference and serving engine for LLMs, featuring PagedAttention, continuous batching, and optimized KV cache management for production deployments.
RAGFlow
84.0k StarsA leading open-source RAG engine that fuses cutting-edge retrieval-augmented generation with agent capabilities to create a superior context layer for LLMs.
Prompt Engineering Guide
76.1k StarsComprehensive guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
MinerU
72.6k StarsTransforms complex documents like PDFs into LLM-ready markdown/JSON for Agentic workflows, supporting layout analysis, formula recognition, and table extraction.
Hello Agents
63.0k StarsA comprehensive tutorial on AI agent principles and practice, systematically covering core concepts, framework usage and hands-on projects.
Pathway
62.8k StarsPathway is a Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG applications.
Docling
62.4k StarsDocling is an open-source document processing tool by IBM that converts PDF, Word, PPT, HTML and more into structured data for AI, purpose-built for GenAI and RAG pipelines.
TrendRadar
60.1k StarsAI-driven public opinion and trend monitor with multi-platform aggregation, RSS subscriptions, smart keyword filtering, AI-powered news analysis and briefings, supporting MCP integration and push notifications via WeChat, Feishu, DingTalk, Telegram and more.
Embedchain
59.8k StarsEmbedchain is a universal memory layer for AI agents, enabling quick integration of diverse data sources into LLMs for context-aware AI applications.
Mem0
59.8k StarsMem0 is a long-term memory layer for AI agents, supporting cross-session memory management and personalized context retrieval.
Pathway LLM App
59.2k StarsReady-to-run cloud templates for RAG, AI pipelines and enterprise search with live data, always in sync with Sharepoint, Google Drive, S3, Kafka and more.
Context7
58.4k StarsContext7 is Upstash's context-engineering toolkit for agents, helping applications manage long context windows, retrieval injection, and history compression.
codegraph
56.4k StarsCodeGraph is a context graph for coding agents, mapping how a codebase is wired together so LLM-driven tools can navigate dependencies and produce more accurate edits.
Daily Stock Analysis
52.5k StarsLLM-powered stock analysis system for A/H/US markets with multi-source quotes, real-time news, LLM decision dashboard and multi-channel push notifications.
LlamaIndex
50.5k StarsLeading data framework for LLM applications, with unified RAG, Agent, and Workflow capabilities.
Related Articles
Agent Memory Architecture: Working, Long-term, and Shared Memory Trade-offs
A systematic comparison of the three categories of agent memory -- working, long-term, and shared -- covering storage media, lifecycle, retrieval methods, typical frameworks, and design patterns, fully addressing agent personalization and multi-agent collaboration engineering.
Designing Agent Memory Systems: From Short-Term Context to Persistent Knowledge
A deep dive into the four-layer agent memory architecture, with practical code for vector retrieval and memory compression to help you build scalable long-term memory systems.
Agent Small-Model Finetuning and Edge Inference
Exploring how small language models are fine-tuned and deployed for agent workloads at the edge, balancing latency, cost, and accuracy for production AI agents.