📚

Best RAG Tools Top 20

Top 20 most popular open-source RAG Tools projects, ranked by GitHub Stars.

1

Firecrawl

118.5k Stars

Firecrawl is the Web Data API for AI, turning web pages into clean, structured, LLM-friendly data with crawl, scrape, and search capabilities.

web-scrapingmcpragdata-extraction
2

Awesome LLM Apps

109.8k Stars

100+ AI Agent and RAG apps you can actually run — clone, customize, and ship. A great reference for quickly building LLM-powered applications.

agentragllmpython
3

RAGFlow

80.3k Stars

A leading open-source RAG engine that fuses cutting-edge retrieval-augmented generation with agent capabilities to create a superior context layer for LLMs.

ragdocument-understandingknowledge-baseretrieval
4

vLLM

79.7k Stars

A high-throughput and memory-efficient inference and serving engine for LLMs, featuring PagedAttention, continuous batching, and optimized KV cache management for production deployments.

llmpythonframeworkapi
5

Prompt Engineering Guide

74.4k Stars

Comprehensive guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

prompt-engineeringragagentllm
6

Pathway

63.3k Stars

Pathway is a Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG applications.

etlstreamingragreal-time
7

MinerU

62.7k Stars

Transforms complex documents like PDFs into LLM-ready markdown/JSON for Agentic workflows, supporting layout analysis, formula recognition, and table extraction.

data-processingragpythonllm
8

Pathway LLM App

59.8k Stars

Ready-to-run cloud templates for RAG, AI pipelines and enterprise search with live data, always in sync with Sharepoint, Google Drive, S3, Kafka and more.

ragpythondata-processingframework
9

Docling

59.6k Stars

Docling is an open-source document processing tool by IBM that converts PDF, Word, PPT, HTML and more into structured data for AI, purpose-built for GenAI and RAG pipelines.

document-parsingpdfragpython
10

TrendRadar

57.3k Stars

AI-driven public opinion and trend monitor with multi-platform aggregation, RSS subscriptions, smart keyword filtering, AI-powered news analysis and briefings, supporting MCP integration and push notifications via WeChat, Feishu, DingTalk, Telegram and more.

automationllmpythontools
11

Embedchain

55.4k Stars

Embedchain is a universal memory layer for AI agents, enabling quick integration of diverse data sources into LLMs for context-aware AI applications.

memoryragembeddingsagent-tools
12

Mem0

55.4k Stars

Mem0 is a long-term memory layer for AI agents, supporting cross-session memory management and personalized context retrieval.

memoryragpersonalizationagent
13

Context7

55.0k Stars

Context7 is Upstash's context-engineering toolkit for agents, helping applications manage long context windows, retrieval injection, and history compression.

contextmemoryretrievaltypescript
14

LlamaIndex

49.3k Stars

LlamaIndex is a data framework that provides the data connection layer for LLM applications, with strong RAG capabilities across diverse data sources and vector databases.

ragllmindexingpython
15

Hello Agents

47.6k Stars

A comprehensive tutorial on AI agent principles and practice, systematically covering core concepts, framework usage and hands-on projects.

agentpythonframeworkrag
16

LocalAI

46.2k Stars

Open-source AI engine to run any model — LLMs, vision, voice, image, video — on any hardware without GPU. Provides OpenAI-compatible API for fully local, privacy-first AI inference.

llmapilocalrag
17

Milvus

44.2k Stars

Milvus is a high-performance open-source vector database built for AI applications. It supports storage, indexing, and similarity search of large-scale vector data, ideal for RAG, recommendation systems, and more.

vector-databaseragsimilarity-searchgo
18

Quivr

39.1k Stars

Opinionated RAG framework for integrating GenAI into your apps. Works with any LLM, any vectorstore, any files — so you can focus on your product instead of building RAG pipelines.

ragpythonvector-databasellm
19

Langchain-Chatchat

38.0k Stars

A local knowledge base RAG and Agent application platform built on Langchain with support for ChatGLM, Qwen, Llama and other LLMs, offering conversation, knowledge base management, and agent capabilities.

ragllmpythonchatbot
20

LangExtract

36.4k Stars

A Python library by Google for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization, designed for data annotation and knowledge extraction workflows.

data-processingllmpythontools

Related Articles