Parsr
活跃简介
Transforms PDF, documents and images into enriched structured data with table recognition, reading order restoration, and Markdown output.
Transforms PDF, documents and images into enriched structured data with table recognition, reading order restoration, and Markdown output.
A web scraping and browser automation library for Node.js to build reliable crawlers, supporting Puppeteer, Playwright, Cheerio, and raw HTTP. Extract data for AI, LLMs, RAG, or GPTs with proxy rotation and both headful and headless modes.
AI-powered PDF scientific paper translation with preserved formats, supporting Google/DeepL/Ollama/OpenAI services via CLI/GUI/MCP/Docker/Zotero.
LLM-driven extraction of unstructured data, built for API deployments and ETL pipeline workflows. Automates document parsing, PDF extraction, and intelligent data processing with LLM-powered intelligence.
OCR and document extraction tool using vision models, efficiently converting PDFs and images into structured text.