ETL Frameworks
Lightweight ETL Framework
★ 4.2
Data Manipulation & Analysis Library
★ 4.9
pip install bonobopip install pandaspip install bonobopip install pandasBonobo is used for straightforward ETL scripts where simplicity matters. Engineers define a pipeline graph by connecting Python functions — one reads rows from a CSV, another cleans them, a third writes to a database — and Bonobo handles the plumbing, parallelism, and error propagation between nodes.
Pandas is the go-to tool for data wrangling in Python pipelines. Engineers use DataFrames to load raw data from CSVs or databases, clean and transform it (renaming columns, filtering rows, filling nulls), then write results to Parquet or a data warehouse. It is the standard intermediate layer between data ingestion and downstream processing.
Individual Tool Pages