Simple yet powerful live data computation framework.
created at April 14, 2021, 7:35 p.m.
Python port of Scramjet framework
created at Aug. 19, 2021, 8:56 a.m.
Apache StreamPipes - A self-service (Industrial) IoT toolbox to enable non-technical users to connect, analyze and explore IoT data streams.
created at April 22, 2018, 8:06 p.m.
Substation is a toolkit for routing, normalizing, and enriching security event and audit logs.
created at April 15, 2022, 2:23 p.m.
Distributed, Scalable, Fault-Tolerant, Minimalistic Workflow Engine - No DAGs, No YAML, No Cumbersome Diagrams, Just Code
created at Dec. 27, 2022, 5:42 p.m.
FastStream is a powerful and easy-to-use Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.
created at Dec. 1, 2022, 9:46 a.m.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
created at Nov. 27, 2022, 1:01 p.m.
A streaming SQL engine, a fast and lightweight alternative to ksqlDB and Apache Flink, 🚀 powered by ClickHouse.
created at Aug. 14, 2023, 3:11 a.m.
Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-like data
created at Aug. 12, 2011, 6:20 p.m.
Run Python in Apache Storm topologies. Pythonic API, CLI tooling, and a topology DSL.
created at May 2, 2014, 8:33 p.m.
A scalable, mature and versatile web crawler based on Apache Storm
created at April 12, 2013, 2:13 p.m.
Apache DataFusion Ballista Distributed Query Engine
created at May 19, 2022, 2:32 p.m.
Cross-platform, customizable ML solutions for live and streaming media.
created at June 13, 2019, 7:16 p.m.