Databases & Data Warehouses
In-Process Analytical Database
★ 4.8
Scalable Graph Database
★ 3.6
pip install duckdbN/A — archived Java projectpip install duckdbN/A — archived Java projectPython data engineers use DuckDB to run fast analytical SQL queries directly on Parquet files in a data lake without a database server. `duckdb.query('SELECT * FROM parquet_scan("s3://bucket/file.parquet")')` returns an Arrow table convertible to pandas — enabling complex aggregations on large files in seconds without loading them fully into memory.
Python data engineers use Titan (now superseded by JanusGraph) with the `gremlin-python` driver to traverse large graph datasets stored in Cassandra or HBase. Gremlin traversal queries find multi-hop relationships in fraud detection, recommendation, and knowledge graph pipelines — the Python Gremlin driver sends queries to the Titan/JanusGraph server and processes results as Python dicts.
Databases & Data Warehouses
MongoDB vs PostgreSQL
Databases & Data Warehouses
PostgreSQL vs Redis
Databases & Data Warehouses
Apache Cassandra vs PostgreSQL
Databases & Data Warehouses
Neo4j vs PostgreSQL
Databases & Data Warehouses
InfluxDB vs PostgreSQL
Databases & Data Warehouses
Elasticsearch vs PostgreSQL
Individual Tool Pages