Data Ingestion
AWS Data Utility Belt for Python
★ 4.3
Open Source Message Broker
★ 4.6
pip install awswranglerpip install pikapip install awswranglerpip install pikaAWS Data Wrangler (now called `awswrangler`) is the standard tool for AWS-native Python data pipelines. Engineers replace `boto3` + `pandas` boilerplate with single calls: `wr.s3.read_parquet('s3://bucket/prefix/')` reads all files into a DataFrame, and `wr.s3.to_parquet(df, 's3://bucket/output/', dataset=True)` writes back with Glue catalog registration and partitioning.
Python data engineers use `pika` or `aio-pika` to connect pipelines to RabbitMQ. A common pattern is a Python producer that publishes enriched records to a topic exchange after transformation, and multiple consumer processes that subscribe to routing key patterns for parallel downstream processing. RabbitMQ's dead-letter queues handle failed processing with configurable retry logic.
Individual Tool Pages