Data Governance & Metadata
Open Data Management System
★ 4.1
Modern Metadata Platform
★ 4.6
pip install ckanapipip install acryl-datahubpip install ckanapipip install acryl-datahubPython data engineers use the `ckanapi` library to programmatically harvest open datasets from CKAN portals — listing available datasets, downloading CSV or JSON resources, and ingesting them into internal pipelines. Government open data platforms (data.gov, data.gov.uk) run on CKAN, making it the standard entry point for public data ingestion workflows.
Python data engineers use DataHub's Python SDK and ingestion framework to crawl metadata from databases, dbt projects, and Airflow — writing YAML recipe files that the `datahub` CLI ingests on a schedule. Custom Python emitters push metadata about internal pipeline assets that built-in connectors don't cover.
Data Governance & Metadata
Amundsen vs Apache Atlas
Data Governance & Metadata
Apache Atlas vs CKAN
Data Governance & Metadata
Apache Atlas vs Marquez
Data Governance & Metadata
Apache Atlas vs DataHub
Data Governance & Metadata
Apache Atlas vs Collibra
Data Governance & Metadata
Apache Atlas vs Apache Gravitino
Individual Tool Pages