Data/Schema Validation
Lightweight Data Validation
★ 4.5
Object Serialization & Validation
★ 4.7
pip install cerberuspip install marshmallowpip install cerberuspip install marshmallowEngineers use Cerberus to validate incoming records in ETL pipelines — defining a schema dict that specifies expected types and constraints, then calling `v.validate(record)` for each row. Invalid records are logged or quarantined based on `v.errors`, keeping bad data out of the warehouse while processing continues.
Data engineers use Marshmallow to define schemas for API request/response objects and database records — deserializing raw JSON from external APIs into validated Python dicts, and serializing SQLAlchemy model instances back to JSON for downstream consumers. Marshmallow's nested schemas cleanly handle parent-child relationships common in data pipelines.
Individual Tool Pages