Zerox
ActiveDescription
OCR and document extraction tool using vision models, efficiently converting PDFs and images into structured text.
OCR and document extraction tool using vision models, efficiently converting PDFs and images into structured text.
A web scraping and browser automation library for Node.js to build reliable crawlers, supporting Puppeteer, Playwright, Cheerio, and raw HTTP. Extract data for AI, LLMs, RAG, or GPTs with proxy rotation and both headful and headless modes.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for Agentic workflows, supporting layout analysis, formula recognition, and table extraction.
An AI-powered answering engine with multi-model integration, web search and local knowledge base, providing a Perplexity-like search experience.
AI-powered PDF scientific paper translation with preserved formats, supporting Google/DeepL/Ollama/OpenAI services via CLI/GUI/MCP/Docker/Zotero.