// data-lake-management
Git-Like Data Lake Versioning
An open-source platform that delivers resilience and manageability to object-storage-based data lakes. lakeFS provides git-like branching, merging, and versioning for data, enabling safe experimentation and CI/CD workflows for data pipelines.
Python data engineers use lakeFS to apply software engineering practices to data lake management. A pipeline writes to a lakeFS branch, data quality tests run against the branch, and the Python SDK merges the branch to main only on test success. This prevents bad pipeline outputs from reaching production consumers — the same guarantee that Git branches provide for code changes.
An open-source platform that delivers resilience and manageability to object-storage-based data lakes. lakeFS provides git-like branching, merging, and versioning for data, enabling safe experimentation and CI/CD workflows for data pipelines.
lakeFS offers freemium pricing options.
lakeFS is listed under the Data Lake Management category on Python Data Engineering.
Details
Related
| Tool | Pricing | Rating | |
|---|---|---|---|
KE Kestranew Event-Driven Orchestration Platform | Freemium | ★ 4.4 | → |
PR Prometheusfeatured Open-Source Monitoring System | Free | ★ 4.7 | → |
PN Project Nessie Transactional Data Lake Catalog | Free | ★ 4.3 | → |