Unstructured
ActiveDescription
Unstructured provides document parsing and cleaning capabilities, commonly used in RAG ingestion and preprocessing pipelines.
Unstructured provides document parsing and cleaning capabilities, commonly used in RAG ingestion and preprocessing pipelines.
Sparrow is a structured data extraction tool that supports instruction calling with ML, LLM, and Vision LLM for extracting structured information from documents, suitable for document parsing in RAG pipelines.
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
A collection of projects showcasing RAG, agents, workflows, and other AI use cases with practical examples and tutorials.
A MemAgent framework that can extrapolate to 3.5M context tokens, along with a training framework for RL training of any agent workflow.