Data Ingestion
Distributed Pub-Sub Messaging
★ 4.5
Unified Logging Layer
★ 4.4
pip install pulsar-clientN/A — Ruby daemon, install via package managerpip install pulsar-clientN/A — Ruby daemon, install via package managerPython data engineers use the `pulsar-client` Python SDK to produce and consume messages from Pulsar topics. Pulsar Functions can be written in Python to perform lightweight transformations — filtering, enriching, or routing messages — without deploying a separate Faust or Spark Streaming cluster. Pulsar's topic compaction and retention policies simplify stateful event stream management.
Python data engineers use Fluentd to collect application logs from Python services and route them to Elasticsearch, BigQuery, or S3 for analysis. Python applications emit structured JSON logs which Fluentd's tail input plugin reads, applies filter plugins to parse and enrich, and forwards to the analytics destination — decoupling log production from storage decisions.
Individual Tool Pages