📚

Best RAG Tools Top 20

Top 20 most popular open-source RAG Tools projects, ranked by GitHub Stars.

1

Firecrawl

142.2k Stars

Firecrawl is the Web Data API for AI, turning web pages into clean, structured, LLM-friendly data with crawl, scrape, and search capabilities.

web-scrapingmcpragdata-extraction
2

LangChain

140.6k Stars

LangChain is the open-source agent engineering platform that unifies model IO, tool calling, RAG, memory and observability under one composable framework.

agent-frameworkragorchestrationllm
3

llama.cpp

118.8k Stars

llama.cpp is a lightweight C/C++ inference engine that runs a wide range of open-source large language models efficiently on consumer hardware.

llm-inferencellamaggufcpp
4

Awesome LLM Apps

116.2k Stars

100+ AI Agent and RAG apps you can actually run — clone, customize, and ship. A great reference for quickly building LLM-powered applications.

agentragllmpython
5

Supabase Vector

105.0k Stars

Supabase's built-in pgvector search, turning Postgres into a RAG database.

vector-dbpostgresragsupabase
6

vLLM

84.9k Stars

A high-throughput and memory-efficient inference and serving engine for LLMs, featuring PagedAttention, continuous batching, and optimized KV cache management for production deployments.

llmpythonframeworkapi
7

RAGFlow

84.0k Stars

A leading open-source RAG engine that fuses cutting-edge retrieval-augmented generation with agent capabilities to create a superior context layer for LLMs.

ragdocument-understandingknowledge-baseretrieval
8

Prompt Engineering Guide

76.1k Stars

Comprehensive guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

prompt-engineeringragagentllm
9

MinerU

72.6k Stars

Transforms complex documents like PDFs into LLM-ready markdown/JSON for Agentic workflows, supporting layout analysis, formula recognition, and table extraction.

data-processingragpythonllm
10

Hello Agents

63.0k Stars

A comprehensive tutorial on AI agent principles and practice, systematically covering core concepts, framework usage and hands-on projects.

agentpythonframeworkrag
11

Pathway

62.8k Stars

Pathway is a Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG applications.

etlstreamingragreal-time
12

Docling

62.4k Stars

Docling is an open-source document processing tool by IBM that converts PDF, Word, PPT, HTML and more into structured data for AI, purpose-built for GenAI and RAG pipelines.

document-parsingpdfragpython
13

TrendRadar

60.1k Stars

AI-driven public opinion and trend monitor with multi-platform aggregation, RSS subscriptions, smart keyword filtering, AI-powered news analysis and briefings, supporting MCP integration and push notifications via WeChat, Feishu, DingTalk, Telegram and more.

automationllmpythontools
14

Embedchain

59.8k Stars

Embedchain is a universal memory layer for AI agents, enabling quick integration of diverse data sources into LLMs for context-aware AI applications.

memoryragembeddingsagent-tools
15

Mem0

59.8k Stars

Mem0 is a long-term memory layer for AI agents, supporting cross-session memory management and personalized context retrieval.

memoryragpersonalizationagent
16

Pathway LLM App

59.2k Stars

Ready-to-run cloud templates for RAG, AI pipelines and enterprise search with live data, always in sync with Sharepoint, Google Drive, S3, Kafka and more.

ragpythondata-processingframework
17

Context7

58.4k Stars

Context7 is Upstash's context-engineering toolkit for agents, helping applications manage long context windows, retrieval injection, and history compression.

contextmemoryretrievaltypescript
18

codegraph

56.4k Stars

CodeGraph is a context graph for coding agents, mapping how a codebase is wired together so LLM-driven tools can navigate dependencies and produce more accurate edits.

code-intelligencemcpraglocal-first
19

Daily Stock Analysis

52.5k Stars

LLM-powered stock analysis system for A/H/US markets with multi-source quotes, real-time news, LLM decision dashboard and multi-channel push notifications.

agentpythonllmdata-processing
20

LlamaIndex

50.5k Stars

Leading data framework for LLM applications, with unified RAG, Agent, and Workflow capabilities.

ragllmagent-frameworkdata

Related Articles