Data Governance & Metadata
Unified Metadata Management
★ 4.0
Metadata Service for Data Lineage
★ 4.3
pip install apache-gravitinopip install marquez-clientpip install apache-gravitinopip install marquez-clientPython data engineers use Gravitino's REST API to register and discover table schemas centrally when working across multiple compute engines — registering an Iceberg table in Gravitino makes it discoverable to Spark, Trino, and Flink without duplicating schema definitions. Python scripts automate schema registration after new pipeline outputs are created.
Python data engineers integrate Marquez with Airflow using the `openlineage-airflow` package, which automatically emits lineage events for each task — capturing which datasets a task reads and writes without any code changes. Engineers query the Marquez API to build impact analysis tools that identify downstream jobs affected by an upstream schema change.
Data Governance & Metadata
Amundsen vs Apache Atlas
Data Governance & Metadata
Apache Atlas vs CKAN
Data Governance & Metadata
Apache Atlas vs Marquez
Data Governance & Metadata
Apache Atlas vs DataHub
Data Governance & Metadata
Apache Atlas vs Collibra
Data Governance & Metadata
Apache Atlas vs Apache Gravitino
Individual Tool Pages