Data Governance & Metadata
Modern Metadata Platform
★ 4.6
Metadata Service for Data Lineage
★ 4.3
pip install acryl-datahubpip install marquez-clientpip install acryl-datahubpip install marquez-clientPython data engineers use DataHub's Python SDK and ingestion framework to crawl metadata from databases, dbt projects, and Airflow — writing YAML recipe files that the `datahub` CLI ingests on a schedule. Custom Python emitters push metadata about internal pipeline assets that built-in connectors don't cover.
Python data engineers integrate Marquez with Airflow using the `openlineage-airflow` package, which automatically emits lineage events for each task — capturing which datasets a task reads and writes without any code changes. Engineers query the Marquez API to build impact analysis tools that identify downstream jobs affected by an upstream schema change.
Data Governance & Metadata
Amundsen vs Apache Atlas
Data Governance & Metadata
Apache Atlas vs CKAN
Data Governance & Metadata
Apache Atlas vs Marquez
Data Governance & Metadata
Apache Atlas vs DataHub
Data Governance & Metadata
Apache Atlas vs Collibra
Data Governance & Metadata
Apache Atlas vs Apache Gravitino
Individual Tool Pages