File Systems & Storage
Unified Distributed Storage
★ 4.4
Cloud-Backed File System
★ 3.8
pip install cephpip install s3qlpip install cephpip install s3qlPython data engineers in on-premise or private cloud environments use Ceph's S3-compatible RADOS Gateway as a drop-in replacement for AWS S3 — boto3 and awswrangler work unchanged by pointing them at the Ceph endpoint URL. CephFS is mounted as a shared file system that multiple Python pipeline worker nodes read from and write to simultaneously.
Python data engineers use S3QL to mount cloud object storage as an encrypted local file system — writing pipeline output files to a mounted S3QL volume using standard Python file I/O (`open()`, `write()`) without any cloud SDK code. S3QL's encryption-at-rest is useful for storing sensitive pipeline outputs in cloud storage with a stronger encryption posture than default S3 SSE.
Individual Tool Pages