AgentList
HomeProjectsArticlesAbout
Explore Projects
HomeProjectsArticlesAbout
Explore Projects
Home / Projects / Docling

Docling

Active
GitHub Python MIT

Description

Docling is an open-source document processing tool by IBM that converts PDF, Word, PPT, HTML and more into structured data for AI, purpose-built for GenAI and RAG pipelines.

Tags

document-parsing pdf rag python

Categories

📚 RAG Tools
Visit GitHub Visit Website

Project Metrics

Stars 58.1k
Forks 4.0k
Watchers 58.1k
Issues 868
Created July 9, 2024
Last commit April 17, 2026

Deployment

Local

Related Projects

RAGatouille

3.9k · Python
Stale

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

ragpythonembedding +1

Awesome AI Apps

10.2k · Python
Active

A collection of projects showcasing RAG, agents, workflows, and other AI use cases with practical examples and tutorials.

agentragworkflow +1

MemAgent

1.0k · Python
Stale

A MemAgent framework that can extrapolate to 3.5M context tokens, along with a training framework for RL training of any agent workflow.

memoryagentrag +2

LightRAG

33.6k · Python
Active

LightRAG is a simple and fast Retrieval-Augmented Generation framework using graph-enhanced retrieval, published at EMNLP 2025.

raggraphretrieval +2
AgentList

Curated directory of open-source AI agent projects

Quick Links

  • Project List
  • Featured Articles
  • Browse Categories

Contact

  • About
  • Privacy Policy
  • Contact Us

© 2026 AgentList. All rights reserved.

Made with for the open source community