AgentList
HomeProjectsArticlesAbout
Explore Projects
HomeProjectsArticlesAbout
Explore Projects
Projects Docling

Docling

Active
GitHub Python MIT

Description

Docling is an open-source document processing tool by IBM that converts PDF, Word, PPT, HTML and more into structured data for AI, purpose-built for GenAI and RAG pipelines.

Tags

document-parsing pdf rag python

Categories

📚 RAG Tools
Visit GitHub Visit Website

Project Metrics

Stars 60.9k
Forks 4.2k
Watchers 60.9k
Issues 929
Created July 9, 2024
Last commit June 2, 2026

Deployment

Local

Related Projects

RAGatouille

3.9k · Python
Stale

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

ragpythonembedding +1

Awesome AI Apps

12.6k · Python
Active

A collection of projects showcasing RAG, agents, workflows, and other AI use cases with practical examples and tutorials.

agentragworkflow +1

MemAgent

1.1k · Python
Active

A MemAgent framework that can extrapolate to 3.5M context tokens, along with a training framework for RL training of any agent workflow.

memoryagentrag +2

LightRAG

36.1k · Python
Active

LightRAG is a simple and fast Retrieval-Augmented Generation framework using graph-enhanced retrieval, published at EMNLP 2025.

raggraphretrieval +2
AgentList

The most comprehensive directory of open-source AI Agent projects. Discover and compare top Agent frameworks like LangChain, CrewAI, and more.

Quick Links

  • Project List
  • Featured Articles
  • Browse Categories

Contact

  • About
  • Privacy Policy
  • Contact Us

© 2026 AgentList. All rights reserved.

Made with for the open source community