Data Quality
Schema Validation Tool
★ 4.1
Data Validation & Documentation
★ 4.7
pip install data-linterpip install great-expectationspip install data-linterpip install great-expectationsData engineers use Data Linter in CI pipelines to enforce dataset standards before promotion to production. Running `data-linter` on a new dataset file flags issues like potential PII in column names or inefficient data types — catching structural problems early in the development workflow rather than after data lands in the warehouse.
Data engineers integrate Great Expectations into pipelines as a quality gate — defining expectations for each dataset (row counts, column nullability, value ranges), then running a Checkpoint after each ingestion job to validate the data. Failed validations trigger alerts or halt the pipeline before bad data reaches the warehouse.
Individual Tool Pages