Data Governance & Metadata
Data Discovery & Metadata Engine
★ 4.5
Metadata Service for Data Lineage
★ 4.3
pip install amundsen-commonpip install marquez-clientpip install amundsen-commonpip install marquez-clientPython data engineers use Amundsen's databuilder library to write custom extractor jobs that pull metadata from internal databases and push it to Amundsen's index. Engineers also use the Amundsen API to programmatically tag datasets with ownership, freshness SLAs, and quality tier labels that the search UI surfaces to data consumers.
Python data engineers integrate Marquez with Airflow using the `openlineage-airflow` package, which automatically emits lineage events for each task — capturing which datasets a task reads and writes without any code changes. Engineers query the Marquez API to build impact analysis tools that identify downstream jobs affected by an upstream schema change.
Data Governance & Metadata
Amundsen vs Apache Atlas
Data Governance & Metadata
Apache Atlas vs CKAN
Data Governance & Metadata
Apache Atlas vs Marquez
Data Governance & Metadata
Apache Atlas vs DataHub
Data Governance & Metadata
Apache Atlas vs Collibra
Data Governance & Metadata
Apache Atlas vs Apache Gravitino
Individual Tool Pages