A fast and efficient object graph serialization framework for Java. Kryo is commonly used as the serialization backend for Apache Spark and other JVM-based data processing frameworks for high-performance data exchange.
Python data engineers encounter Kryo when tuning PySpark job performance — enabling Kryo serialization in Spark config (`spark.serializer=org.apache.spark.serializer.KryoSerializer`) reduces shuffle data size and speeds up operations that cross network boundaries between Spark executors. PySpark's Python UDFs still use pickle for Python objects, but JVM-side data uses Kryo.
A fast and efficient object graph serialization framework for Java. Kryo is commonly used as the serialization backend for Apache Spark and other JVM-based data processing frameworks for high-performance data exchange.
Yes, Kryo is free to use.
Kryo is listed under the Serialization Formats category on Python Data Engineering.
Details
Related
| Tool | Pricing | Rating | |
|---|---|---|---|
AA Apache Avrofeatured Schema-Based Data Serialization | Free | ★ 4.5 | → |
AO Apache ORC Optimized Row Columnar Format | Free | ★ 4.3 | → |
AP Apache Parquetfeatured Columnar Storage Format | Free | ★ 4.8 | → |