Data Governance & Metadata
Enterprise Data Governance
★ 4.2
Open Data Management System
★ 4.1
pip install apache-atlaspip install ckanapipip install apache-atlaspip install ckanapiPython data engineers integrate with Apache Atlas via its REST API to register custom data assets, query lineage graphs, and enforce data classification policies. Post-ingestion scripts tag newly created tables with PII labels, and lineage queries trace how specific columns flow from source systems through transformations to the final warehouse tables.
Python data engineers use the `ckanapi` library to programmatically harvest open datasets from CKAN portals — listing available datasets, downloading CSV or JSON resources, and ingesting them into internal pipelines. Government open data platforms (data.gov, data.gov.uk) run on CKAN, making it the standard entry point for public data ingestion workflows.
Data Governance & Metadata
Amundsen vs Apache Atlas
Data Governance & Metadata
Apache Atlas vs Marquez
Data Governance & Metadata
Apache Atlas vs DataHub
Data Governance & Metadata
Apache Atlas vs Collibra
Data Governance & Metadata
Apache Atlas vs Apache Gravitino
Data Governance & Metadata
Apache Atlas vs PACE
Individual Tool Pages