Data Lake Management
Data Lake Bronze Layer Gateway
★ 3.7
Data Lakehouse Platform
★ 3.9
pip install flightpathN/A — web applicationpip install flightpathN/A — web applicationPython data engineers integrate FlightPath into Airflow-based pipelines to automatically capture data lineage for each DAG run. The lineage graph reveals which source tables feed each transformation and which downstream datasets depend on each pipeline output — enabling engineers to quickly assess the blast radius of schema changes before deploying them.
Python data engineers use Ilum to submit and manage PySpark jobs without managing Spark cluster infrastructure directly. Ilum's REST API enables Python orchestration tools like Airflow to trigger Spark jobs programmatically as pipeline steps. It is used in organisations that need a self-hosted alternative to managed services like AWS EMR or Databricks, providing a control plane for Spark workloads running on Kubernetes or bare metal.
Individual Tool Pages