Data Quality
Data Validation & Documentation
★ 4.7
Data Quality Testing
★ 4.6
pip install great-expectationspip install soda-corepip install great-expectationspip install soda-coreData engineers integrate Great Expectations into pipelines as a quality gate — defining expectations for each dataset (row counts, column nullability, value ranges), then running a Checkpoint after each ingestion job to validate the data. Failed validations trigger alerts or halt the pipeline before bad data reaches the warehouse.
Data engineers integrate Soda Core into Airflow or dbt pipelines to run data quality scans after each transformation step. A scan YAML file defines checks on a specific table, the Python SDK runs them, and failed checks are reported to Soda Cloud or surfaced as pipeline task failures to block bad data from advancing.
Individual Tool Pages