Orchestration Tools
Workflow Orchestration Platform
★ 4.8
Data Flow Automation
★ 4.5
pip install apache-airflowpip install nipyapipip install apache-airflowpip install nipyapiPython data engineers define pipelines as Directed Acyclic Graphs (DAGs) using Airflow's Python SDK. DAGs are written as .py files that instantiate Operators — PythonOperator for custom logic, BashOperator for shell commands, and provider-specific operators for Postgres, S3, BigQuery, and Snowflake. Airflow is the industry-standard orchestrator for scheduling ETL jobs, managing dependencies between tasks, and handling retries in production data pipelines.
Python data engineers integrate with NiFi by deploying ExecuteScript processors that run Python code for custom transformations within a visual flow. NiFi also exposes a REST API that Python scripts use to dynamically configure flows, monitor processor status, and retrieve provenance data for audit purposes.
Individual Tool Pages