ETL Frameworks
Python Data Loading Library
★ 4.5
Data Manipulation & Analysis Library
★ 4.9
pip install dltpip install pandaspip install dltpip install pandasPython data engineers use dlt to replace hand-written ingestion scripts. You decorate a Python generator function as a `@dlt.source`, define resources with `@dlt.resource`, and call `pipeline.run()` — dlt handles schema creation, type casting, incremental state, and writing to your destination warehouse automatically.
Pandas is the go-to tool for data wrangling in Python pipelines. Engineers use DataFrames to load raw data from CSVs or databases, clean and transform it (renaming columns, filtering rows, filling nulls), then write results to Parquet or a data warehouse. It is the standard intermediate layer between data ingestion and downstream processing.
Individual Tool Pages