Databases & Data Warehouses
Distributed Wide-Column Store
★ 4.3
Advanced Open Source Database
★ 4.8
pip install cassandra-driverpip install psycopg2-binarypip install cassandra-driverpip install psycopg2-binaryPython data engineers use the `cassandra-driver` library to read from and write to Cassandra clusters in high-write-throughput scenarios — IoT event ingestion, clickstream storage, and time-series data. Engineers model data around query patterns rather than normalization, designing partition keys that keep related rows co-located for fast reads.
PostgreSQL is the most popular database target for Python data pipelines. Engineers use `psycopg2` or `asyncpg` for direct connections, SQLAlchemy for ORM-based access, and `pd.read_sql()` for pulling query results into DataFrames. PostgreSQL's JSONB support is frequently used to store semi-structured API responses before they are normalized into relational tables.
Individual Tool Pages