Discover 2 tools tagged with Java for Python data engineering.
Tools built on the Java Virtual Machine (JVM) are common infrastructure components in Python data engineering stacks. Systems like Apache Kafka, Hadoop, Spark, and Flink run on the JVM but are accessed from Python via official clients and APIs. Understanding JVM-based tooling is important for operating large-scale distributed data systems.
Open-Source Change Data Capture Platform
An open-source CDC platform that monitors databases and streams every committed row-level change as a structured event. Debezium reads directly from database replication logs — capturing inserts, updates, and deletes in real time with no polling and no impact on query performance.