Orchestration Tools
Workflow Orchestration Platform
★ 4.8
Data Orchestrator for ML & Analytics
★ 4.7
pip install apache-airflowpip install dagsterpip install apache-airflowpip install dagsterPython data engineers define pipelines as Directed Acyclic Graphs (DAGs) using Airflow's Python SDK. DAGs are written as .py files that instantiate Operators — PythonOperator for custom logic, BashOperator for shell commands, and provider-specific operators for Postgres, S3, BigQuery, and Snowflake. Airflow is the industry-standard orchestrator for scheduling ETL jobs, managing dependencies between tasks, and handling retries in production data pipelines.
Dagster's asset-based model lets Python engineers define their ETL outputs as `@asset` functions — Dagster infers the dependency graph automatically. Engineers materialize assets on demand or on schedule, observe run history and lineage in the Dagit UI, and use Dagster's type system to catch schema mismatches between pipeline steps at development time.
Individual Tool Pages