Data Lake Management
Data Lake Bronze Layer Gateway
★ 3.7
Git-Like Data Lake Versioning
★ 4.5
pip install flightpathpip install lakefspip install flightpathpip install lakefsPython data engineers integrate FlightPath into Airflow-based pipelines to automatically capture data lineage for each DAG run. The lineage graph reveals which source tables feed each transformation and which downstream datasets depend on each pipeline output — enabling engineers to quickly assess the blast radius of schema changes before deploying them.
Python data engineers use lakeFS to apply software engineering practices to data lake management. A pipeline writes to a lakeFS branch, data quality tests run against the branch, and the Python SDK merges the branch to main only on test success. This prevents bad pipeline outputs from reaching production consumers — the same guarantee that Git branches provide for code changes.
Individual Tool Pages