Data Governance & Metadata
Data Discovery & Metadata Engine
★ 4.5
Modern Metadata Platform
★ 4.6
pip install amundsen-commonpip install acryl-datahubpip install amundsen-commonpip install acryl-datahubPython data engineers use Amundsen's databuilder library to write custom extractor jobs that pull metadata from internal databases and push it to Amundsen's index. Engineers also use the Amundsen API to programmatically tag datasets with ownership, freshness SLAs, and quality tier labels that the search UI surfaces to data consumers.
Python data engineers use DataHub's Python SDK and ingestion framework to crawl metadata from databases, dbt projects, and Airflow — writing YAML recipe files that the `datahub` CLI ingests on a schedule. Custom Python emitters push metadata about internal pipeline assets that built-in connectors don't cover.
Data Governance & Metadata
Amundsen vs Apache Atlas
Data Governance & Metadata
Apache Atlas vs CKAN
Data Governance & Metadata
Apache Atlas vs Marquez
Data Governance & Metadata
Apache Atlas vs DataHub
Data Governance & Metadata
Apache Atlas vs Collibra
Data Governance & Metadata
Apache Atlas vs Apache Gravitino
Individual Tool Pages