AgentList
首页项目文章关于
探索项目
首页项目文章关于
探索项目
首页 / 项目 / MinerU

MinerU

活跃
GitHub Python NOASSERTION

简介

Transforms complex documents like PDFs into LLM-ready markdown/JSON for Agentic workflows, supporting layout analysis, formula recognition, and table extraction.

标签

data-processing rag python llm tools

分类

📚 RAG 工具
访问 GitHub

项目指标

Stars 60.3k
Forks 5.0k
Watchers 60.3k
Issues 79
创建时间 2024年2月29日
最近提交 2026年4月17日

部署方式

本地部署

相关项目

PDFMathTranslate

33.2k · Python
活跃

AI-powered PDF scientific paper translation with preserved formats, supporting Google/DeepL/Ollama/OpenAI services via CLI/GUI/MCP/Docker/Zotero.

ragpythontools +2

Quivr

39.1k · Python
不活跃

Opinionated RAG framework for integrating GenAI into your apps. Works with any LLM, any vectorstore, any files — so you can focus on your product instead of building RAG pipelines.

ragpythonvector-database +3

Unstract

6.5k · Python
活跃

LLM-driven extraction of unstructured data, built for API deployments and ETL pipeline workflows. Automates document parsing, PDF extraction, and intelligent data processing with LLM-powered intelligence.

data-processingragpython +3

document.ai

3.7k · Python
不活跃

A universal local knowledge base solution based on vector databases and GPT, providing one-stop document processing with vectorization, semantic search, and intelligent Q&A for building private knowledge bases.

ragvector-databasepython +2
AgentList

开源机器人/Agent 项目导航站

快速链接

  • 项目列表
  • 精选文章
  • 分类浏览

联系我们

  • 关于我们
  • 隐私政策
  • 联系我们

© 2026 AgentList. 保留所有权利。

Made with for the open source community