Data Governance & Metadata
Enterprise Data Governance
★ 4.2
Modern Metadata Platform
★ 4.6
pip install apache-atlaspip install acryl-datahubpip install apache-atlaspip install acryl-datahubPython data engineers integrate with Apache Atlas via its REST API to register custom data assets, query lineage graphs, and enforce data classification policies. Post-ingestion scripts tag newly created tables with PII labels, and lineage queries trace how specific columns flow from source systems through transformations to the final warehouse tables.
Python data engineers use DataHub's Python SDK and ingestion framework to crawl metadata from databases, dbt projects, and Airflow — writing YAML recipe files that the `datahub` CLI ingests on a schedule. Custom Python emitters push metadata about internal pipeline assets that built-in connectors don't cover.
Individual Tool Pages