Data Ingestion
Managed Real-Time Streaming
★ 4.4
Delimited Data Preboarding
★ 3.7
pip install boto3pip install csvpathpip install boto3pip install csvpathPython data engineers use `boto3`'s Kinesis client to put records onto a Data Stream from Lambda functions or EC2-based producers. Consumer applications use the Kinesis Client Library (KCL) with Python bindings, or the `amazon-kinesis-client` Python wrapper, to process shards in parallel with automatic checkpointing — a common pattern for real-time log processing and event enrichment.
Python data engineers use CSVPath to validate complex CSV files with business rules that go beyond column type checking — enforcing conditional constraints (if column A has value X, column B must be non-null), cross-row lookups, and custom matching expressions. CSVPath rules are stored as text files separate from the Python pipeline code, making them auditable by non-developers.
Individual Tool Pages