AgentList
HomeProjectsArticlesAbout
Explore Projects
HomeProjectsArticlesAbout
Explore Projects
Projects Zerox

Zerox

Stale
GitHub TypeScript MIT

Description

OCR and document extraction tool using vision models, efficiently converting PDFs and images into structured text.

Tags

typescript rag tools data-processing llm

Categories

📚 RAG Tools
Visit GitHub

Project Metrics

Stars 12.2k
Forks 846
Watchers 12.2k
Issues 88
Created July 21, 2024
Last commit May 20, 2025

Deployment

Local

Related Projects

Crawlee

23.6k · TypeScript
Active

A web scraping and browser automation library for Node.js to build reliable crawlers, supporting Puppeteer, Playwright, Cheerio, and raw HTTP. Extract data for AI, LLMs, RAG, or GPTs with proxy rotation and both headful and headless modes.

typescriptjavascriptdata-processing +3

MinerU

66.2k · Python
Active

Transforms complex documents like PDFs into LLM-ready markdown/JSON for Agentic workflows, supporting layout analysis, formula recognition, and table extraction.

data-processingragpython +2

Scira

11.7k · TypeScript
Normal

A minimalistic AI-powered search engine that helps you find information on the internet and cites it too. Powered by Vercel AI SDK.

typescriptllmrag +3

Vane

35.1k · TypeScript
Normal

An AI-powered answering engine with multi-model integration, web search and local knowledge base, providing a Perplexity-like search experience.

ragtypescriptllm +2
AgentList

The most comprehensive directory of open-source AI Agent projects. Discover and compare top Agent frameworks like LangChain, CrewAI, and more.

Quick Links

  • Project List
  • Featured Articles
  • Browse Categories

Contact

  • About
  • Privacy Policy
  • Contact Us

© 2026 AgentList. All rights reserved.

Made with for the open source community