OpenMetadata

Active
GitHub TypeScript Apache-2.0

Description

OpenMetadata is a unified metadata platform for data and AI, providing data asset discovery, lineage, governance, and agent context retrieval capabilities.

Key Features

  • Unified metadata model — One-stop management of tables, columns, metrics, dashboards, pipelines, and ML models
  • Automatic lineage — End-to-end data lineage across databases, BI tools, and orchestration platforms
  • Data quality SLA — Built-in data quality testing framework with rules, SLA alerts, and failure tracking
  • Data asset discovery — Full-text and column-level search so agents can find the right tables and columns as context
  • Collaboration and governance — Glossary, tags, classifications, RBAC, and audit logs
  • Agent / LLM integration — Provides APIs and metadata embeddings to feed data catalog context to agents

Use Cases

💡 Provide a unified data asset retrieval layer for RAG agents so LLMs can find the right business tables and column definitions
💡 Track cross-team data lineage to locate the source of data errors in agent decision chains
💡 Govern LLM training data and feature stores for compliance and traceability
💡 Centralize metrics and dashboards so agents reference consistent metric definitions when answering business questions
💡 Use metadata APIs to let AI workflows access structured data context and descriptions

Quick Start

git clone https://github.com/open-metadata/OpenMetadata
cd OpenMetadata
docker compose -f docker/docker-compose-quickstart/docker-compose.yml up -d
# Visit http://localhost:8585 and log in with admin@openmetadata.org / admin
# Use the Python SDK to ingest a metadata entity
pip install openmetadata-ingestion

Related Projects