Docling
ActiveDescription
Docling is an open-source document processing tool by IBM that converts PDF, Word, PPT, HTML and more into structured data for AI, purpose-built for GenAI and RAG pipelines.
Docling is an open-source document processing tool by IBM that converts PDF, Word, PPT, HTML and more into structured data for AI, purpose-built for GenAI and RAG pipelines.
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
A collection of projects showcasing RAG, agents, workflows, and other AI use cases with practical examples and tutorials.
A MemAgent framework that can extrapolate to 3.5M context tokens, along with a training framework for RL training of any agent workflow.
LightRAG is a simple and fast Retrieval-Augmented Generation framework using graph-enhanced retrieval, published at EMNLP 2025.