AgentList
HomeProjectsArticlesAbout
Explore Projects
HomeProjectsArticlesAbout
Explore Projects
Home / Projects / document.ai

document.ai

Stale
GitHub Python AGPL-3.0

Description

A universal local knowledge base solution based on vector databases and GPT, providing one-stop document processing with vectorization, semantic search, and intelligent Q&A for building private knowledge bases.

Tags

rag vector-database python llm tools

Categories

📚 RAG Tools
Visit GitHub

Project Metrics

Stars 3.7k
Forks 326
Watchers 3.7k
Issues 1
Created March 10, 2023
Last commit May 12, 2023

Deployment

Local

Related Projects

Quivr

39.1k · Python
Stale

Opinionated RAG framework for integrating GenAI into your apps. Works with any LLM, any vectorstore, any files — so you can focus on your product instead of building RAG pipelines.

ragpythonvector-database +3

PromptTools

3.0k · Python
Normal

PromptTools provides open-source tools for prompt testing and experimentation, supporting multiple LLMs (OpenAI, LLaMA) and vector databases (Chroma, Weaviate, LanceDB) to help developers systematically evaluate and optimize RAG systems.

prompt-testingragevaluation +3

txtai

12.4k · Python
Active

All-in-one AI framework for semantic search, LLM orchestration, and language model workflows with agent support, RAG, and vector database

semantic-searchragembeddings +4

MinerU

60.3k · Python
Active

Transforms complex documents like PDFs into LLM-ready markdown/JSON for Agentic workflows, supporting layout analysis, formula recognition, and table extraction.

data-processingragpython +2
AgentList

Curated directory of open-source AI agent projects

Quick Links

  • Project List
  • Featured Articles
  • Browse Categories

Contact

  • About
  • Privacy Policy
  • Contact Us

© 2026 AgentList. All rights reserved.

Made with for the open source community