Transforms complex documents like PDFs into LLM-ready markdown/JSON for Agentic workflows, supporting layout analysis, formula recognition, and table extraction.

data-processingragpython +2

RAG Techniques

26.9k · Jupyter Notebook

Active

A comprehensive showcase of advanced Retrieval-Augmented Generation (RAG) techniques with detailed notebook tutorials and code examples, covering foundational to cutting-edge RAG implementations.

ragpythonprompt-engineering +1

PyMuPDF

Description

Tags

Categories

Related Projects

PDFMathTranslate

Docstrange

MinerU

RAG Techniques