ETL Frameworks
Transform Data in Your Warehouse
★ 4.9
Data Manipulation & Analysis Library
★ 4.9
pip install dbt-corepip install pandaspip install dbt-corepip install pandasData engineers use dbt to manage all transformation logic inside the warehouse — writing SELECT statements as `.sql` model files that dbt compiles and runs in the right order. Python engineers also write custom dbt tests and macros in Python, and use dbt's Python models feature to run pandas or Spark logic alongside SQL in the same project.
Pandas is the go-to tool for data wrangling in Python pipelines. Engineers use DataFrames to load raw data from CSVs or databases, clean and transform it (renaming columns, filtering rows, filling nulls), then write results to Parquet or a data warehouse. It is the standard intermediate layer between data ingestion and downstream processing.
Individual Tool Pages