numaflow by numaproj

Kubernetes-native platform to run massively parallel data/streaming jobs

created at May 20, 2022, 10:05 p.m.

Go

22 +0

1,291 +163

114 +1

GitHub
Trill by Microsoft

Trill is a single-node query processor for temporal or streaming data.

created at Sept. 26, 2018, 6:48 p.m.

C#

63 +0

1,248 +0

132 +0

GitHub
streamz by python-streamz

Real-time stream processing for python

created at April 4, 2017, 9:45 p.m.

Python

35 +1

1,244 +1

148 +0

GitHub
AthenaX by uber-archive

SQL-based streaming analytics platform at scale

created at Sept. 18, 2017, 8:37 p.m.

Java

79 +0

1,224 +0

287 +0

GitHub
pekko by apache

Build highly concurrent, distributed, and resilient message-driven applications using Java/Scala

created at Oct. 31, 2022, 8:40 a.m.

Scala

55 +0

1,222 +6

149 +1

GitHub
quix-streams by quixio

Python stream processing for Kafka

created at Nov. 17, 2022, 8:36 p.m.

Python

19 +0

1,190 +6

69 +1

GitHub
hazelcast-jet by hazelcast

Distributed Stream and Batch Processing

created at Dec. 15, 2015, 2:01 p.m.

Java

77 +0

1,105 +1

206 +1

GitHub
datasketches-java by apache

A software library of stochastic streaming algorithms, a.k.a. sketches.

created at June 30, 2015, 1:05 a.m.

Java

58 +0

896 +2

209 +0

GitHub
incubator-stormcrawler by apache

A scalable, mature and versatile web crawler based on Apache Storm

created at April 12, 2013, 2:13 p.m.

Java

66 +0

891 +2

260 +0

GitHub
camus by LinkedInAttic

LinkedIn's previous generation Kafka to HDFS pipeline.

created at Dec. 20, 2012, 11:54 p.m.

Java

143 +0

881 -1

457 +0

GitHub
esper by espertechinc

Esper Complex Event Processing, Streaming SQL and Event Series Analysis

created at April 6, 2015, 4:21 p.m.

Java

66 +0

840 +1

259 +0

GitHub
Turbine by Netflix

SSE Stream Aggregator

created at Dec. 6, 2012, 9:44 p.m.

Java

497 +1

835 +0

255 +0

GitHub
samza by apache

Mirror of Apache Samza

created at March 14, 2015, 7 a.m.

Java

58 +0

820 +1

334 +0

GitHub
suro by Netflix

Netflix's distributed Data Pipeline

created at March 20, 2013, 9:02 p.m.

Java

514 +1

794 +0

171 +0

GitHub
gearpump by gearpump

Lightweight real-time big data streaming engine over Akka

created at July 23, 2014, 8:55 a.m.

Scala

91 +0

763 +0

152 +0

GitHub
core by gazette

Build platforms that flexibly mix SQL, batch, and stream processing paradigms

created at Oct. 20, 2017, 6:54 p.m.

Go

34 +0

718 +1

52 +0

GitHub
hstream by hstreamdb

HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.

created at Aug. 31, 2020, 9:42 a.m.

Haskell

25 +0

707 +0

55 +0

GitHub
nussknacker by TouK

Low-code tool for automating actions on real time data | Stream processing for the users.

created at June 29, 2017, 2:09 p.m.

Scala

37 +2

659 +3

93 +0

GitHub
streaming-benchmarks by yahoo

Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...

created at Dec. 15, 2015, 4:41 p.m.

Jupyter Notebook

61 +0

633 +1

298 +0

GitHub
streampipes by apache

Apache StreamPipes - A self-service (Industrial) IoT toolbox to enable non-technical users to connect, analyze and explore IoT data streams.

created at April 22, 2018, 8:06 p.m.

Java

26 +0

605 +1

181 +5

GitHub