framework-js by scramjetorg

Simple yet powerful live data computation framework.

created at April 14, 2021, 7:35 p.m.

TypeScript

11 +0

38 +0

0 +0

GitHub
framework-python by scramjetorg

Python port of Scramjet framework

created at Aug. 19, 2021, 8:56 a.m.

Python

12 +0

35 +0

1 +0

GitHub
streampipes by apache

Apache StreamPipes - A self-service (Industrial) IoT toolbox to enable non-technical users to connect, analyze and explore IoT data streams.

created at April 22, 2018, 8:06 p.m.

Java

26 +0

605 +1

181 +5

GitHub
bytewax by bytewax

Python Stream Processing

created at Feb. 4, 2022, 6:29 p.m.

Python

18 +0

1,559 +12

64 +0

GitHub
quix-streams by quixio

Python stream processing for Kafka

created at Nov. 17, 2022, 8:36 p.m.

Python

19 +0

1,190 +6

69 +1

GitHub
substation by brexhq

Substation is a toolkit for routing, normalizing, and enriching security event and audit logs.

created at April 15, 2022, 2:23 p.m.

Go

8 +0

330 +3

20 +0

GitHub
zilla by aklivity

🦎 A multi-protocol edge & service proxy. Seamlessly interface web apps, IoT clients, & microservices to Apache Kafka® via declaratively defined, stateless APIs.

created at Dec. 7, 2021, 10:10 p.m.

Java

8 -1

543 +0

50 +0

GitHub
javactrl-kafka by javactrl

Distributed, Scalable, Fault-Tolerant, Minimalistic Workflow Engine - No DAGs, No YAML, No Cumbersome Diagrams, Just Code

created at Dec. 27, 2022, 5:42 p.m.

Java

4 +0

9 +0

2 +0

GitHub
faststream by airtai

FastStream is a powerful and easy-to-use Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.

created at Dec. 1, 2022, 9:46 a.m.

Python

22 +0

3,133 +39

161 +1

GitHub
pathway by pathwaycom

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

created at Nov. 27, 2022, 1:01 p.m.

Python

29 +1

4,325 +54

139 +2

GitHub
proton by timeplus-io

A streaming SQL engine, a fast and lightweight alternative to ksqlDB and Apache Flink, 🚀 powered by ClickHouse.

created at Aug. 14, 2023, 3:11 a.m.

C++

20 +0

1,570 +8

68 +0

GitHub
numalogic by numaproj

Collection of operational time series ML models and tools

created at July 12, 2022, 11:48 p.m.

Python

8 +0

167 +1

29 +1

GitHub
numaflow by numaproj

Kubernetes-native platform to run massively parallel data/streaming jobs

created at May 20, 2022, 10:05 p.m.

Go

22 +0

1,291 +163

114 +1

GitHub
logging-flume by apache

Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log-like data

created at Aug. 12, 2011, 6:20 p.m.

Java

225 +1

2,537 +1

1,571 +0

GitHub
pekko by apache

Build highly concurrent, distributed, and resilient message-driven applications using Java/Scala

created at Oct. 31, 2022, 8:40 a.m.

Scala

55 +0

1,222 +6

149 +1

GitHub
streamparse by pystorm

Run Python in Apache Storm topologies. Pythonic API, CLI tooling, and a topology DSL.

created at May 2, 2014, 8:33 p.m.

Python

101 +0

1,495 +0

218 +0

GitHub
incubator-stormcrawler by apache

A scalable, mature and versatile web crawler based on Apache Storm

created at April 12, 2013, 2:13 p.m.

Java

66 +0

891 +2

260 +0

GitHub
datafusion-ballista by apache

Apache DataFusion Ballista Distributed Query Engine

created at May 19, 2022, 2:32 p.m.

Rust

52 +0

1,544 +10

196 +0

GitHub
automq by AutoMQ

AutoMQ is a cloud-first alternative to Kafka by decoupling durability to S3 and EBS. 10x Cost-Effective. No Cross-AZ Traffic Cost. Autoscale in seconds. Single-digit ms latency.

created at Aug. 17, 2023, 7:50 a.m.

Java

36 +0

3,836 +15

214 -2

GitHub
mediapipe by google-ai-edge

Cross-platform, customizable ML solutions for live and streaming media.

created at June 13, 2019, 7:16 p.m.

C++

511 -3

27,601 +89

5,165 +11

GitHub