hazelcast-jet by hazelcast

Distributed Stream and Batch Processing

created at Dec. 15, 2015, 2:01 p.m.

Java

77 +0

1,105 +1

206 +1

GitHub
onyx by onyx-platform

Distributed, masterless, high performance, fault tolerant data processing

created at Dec. 2, 2013, 1:21 a.m.

Clojure

122 +0

2,050 +0

205 +0

GitHub
mantis by Netflix

A platform that makes it easy for developers to build realtime, cost-effective, operations-focused applications

created at June 6, 2019, 11:44 p.m.

Java

222 +1

1,416 +1

202 +0

GitHub
datafusion-ballista by apache

Apache DataFusion Ballista Distributed Query Engine

created at May 19, 2022, 2:32 p.m.

Rust

52 +0

1,544 +10

196 +0

GitHub
streampipes by apache

Apache StreamPipes - A self-service (Industrial) IoT toolbox to enable non-technical users to connect, analyze and explore IoT data streams.

created at April 22, 2018, 8:06 p.m.

Java

26 +0

605 +1

181 +5

GitHub
apex-core by apache

Mirror of Apache Apex core

created at Aug. 25, 2015, 7 a.m.

Java

50 +0

350 +0

176 +0

GitHub
suro by Netflix

Netflix's distributed Data Pipeline

created at March 20, 2013, 9:02 p.m.

Java

514 +1

794 +0

171 +0

GitHub
faststream by airtai

FastStream is a powerful and easy-to-use Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.

created at Dec. 1, 2022, 9:46 a.m.

Python

22 +0

3,133 +39

161 +1

GitHub
gearpump by gearpump

Lightweight real-time big data streaming engine over Akka

created at July 23, 2014, 8:55 a.m.

Scala

91 +0

763 +0

152 +0

GitHub
pekko by apache

Build highly concurrent, distributed, and resilient message-driven applications using Java/Scala

created at Oct. 31, 2022, 8:40 a.m.

Scala

55 +0

1,222 +6

149 +1

GitHub
streamz by python-streamz

Real-time stream processing for python

created at April 4, 2017, 9:45 p.m.

Python

35 +1

1,244 +1

148 +0

GitHub
streamDM by huawei-noah

Stream Data Mining Library for Spark Streaming

created at June 8, 2015, 1:28 a.m.

Scala

67 +0

492 +0

147 +0

GitHub
pathway by pathwaycom

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

created at Nov. 27, 2022, 1:01 p.m.

Python

29 +1

4,325 +54

139 +2

GitHub
incubator-retired-edgent by apache

Mirror of Apache Edgent (Incubating)

created at March 10, 2016, 8 a.m.

Java

50 +0

217 +0

136 +0

GitHub
Trill by Microsoft

Trill is a single-node query processor for temporal or streaming data.

created at Sept. 26, 2018, 6:48 p.m.

C#

63 +0

1,248 +0

132 +0

GitHub
yomo by yomorun

🦖 Stateful Serverless Framework for Geo-distributed Edge AI Infra. with function calling support, write once, run on any model.

created at July 1, 2020, 5:48 a.m.

Go

43 +0

1,669 +1

127 -1

GitHub
numaflow by numaproj

Kubernetes-native platform to run massively parallel data/streaming jobs

created at May 20, 2022, 10:05 p.m.

Go

22 +0

1,291 +163

114 +1

GitHub
incubator-samoa by apache

Mirror of Apache Samoa (Incubating)

created at Jan. 27, 2015, 8 a.m.

Java

39 +0

248 +0

107 -1

GitHub
datacollector-oss by streamsets

datacollector-oss

created at May 6, 2021, 9:13 p.m.

Java

10 +0

90 +0

99 +0

GitHub
squall by epfldata

A streaming / online query processing / analytics engine based on Apache Storm

created at March 29, 2012, 5:02 p.m.

Java

62 +0

270 +0

96 +0

GitHub