streamDM by huawei-noah

Stream Data Mining Library for Spark Streaming

created at June 8, 2015, 1:28 a.m.

Scala

68 +0

489 +0

147 +0

GitHub
zilla by aklivity

🦎 A multi-protocol, event-native proxy. Securely interface web apps, IoT clients, & microservices to Apache Kafka® via declaratively defined, stateless APIs.

created at Dec. 7, 2021, 10:10 p.m.

Java

9 +0

494 +3

47 +0

GitHub
core by gazette

Build platforms that flexibly mix SQL, batch, and stream processing paradigms

created at Oct. 20, 2017, 6:54 p.m.

Go

33 +0

523 +1

51 +0

GitHub
streampipes by apache

Apache StreamPipes - A self-service (Industrial) IoT toolbox to enable non-technical users to connect, analyze and explore IoT data streams.

created at April 22, 2018, 8:06 p.m.

Java

27 +0

556 +1

171 -1

GitHub
nussknacker by TouK

Low-code tool for automating actions on real time data | Stream processing for the users.

created at June 29, 2017, 2:09 p.m.

Scala

33 +0

615 +0

90 +0

GitHub
streaming-benchmarks by yahoo

Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...

created at Dec. 15, 2015, 4:41 p.m.

Jupyter Notebook

61 +0

620 +0

294 +1

GitHub
hstream by hstreamdb

HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.

created at Aug. 31, 2020, 9:42 a.m.

Haskell

23 +0

693 +1

56 +0

GitHub
gearpump by gearpump

Lightweight real-time big data streaming engine over Akka

created at July 23, 2014, 8:55 a.m.

Scala

92 +0

763 +0

153 +0

GitHub
suro by Netflix

Netflix's distributed Data Pipeline

created at March 20, 2013, 9:02 p.m.

Java

511 +2

791 +0

168 +0

GitHub
samza by apache

Mirror of Apache Samza

created at March 14, 2015, 7 a.m.

Java

60 +0

798 +0

327 +0

GitHub
esper by espertechinc

Esper Complex Event Processing, Streaming SQL and Event Series Analysis

created at April 6, 2015, 4:21 p.m.

Java

66 +0

824 +2

256 +0

GitHub
Turbine by Netflix

SSE Stream Aggregator

created at Dec. 6, 2012, 9:44 p.m.

Java

491 +2

836 +0

256 +0

GitHub
quix-streams by quixio

100% Python stream processing with Streaming DataFrames

created at Nov. 17, 2022, 8:36 p.m.

Python

17 +1

836 +21

42 +2

GitHub
incubator-stormcrawler by apache

A scalable, mature and versatile web crawler based on Apache Storm

created at April 12, 2013, 2:13 p.m.

HTML

70 -1

862 +0

256 +0

GitHub
datasketches-java by apache

A software library of stochastic streaming algorithms, a.k.a. sketches.

created at June 30, 2015, 1:05 a.m.

Java

60 +0

871 +1

205 +0

GitHub
camus by LinkedInAttic

LinkedIn's previous generation Kafka to HDFS pipeline.

created at Dec. 20, 2012, 11:54 p.m.

Java

143 +0

883 +0

461 +0

GitHub
numaflow by numaproj

Kubernetes-native platform to run massively parallel data/streaming jobs

created at May 20, 2022, 10:05 p.m.

Go

17 +1

930 +7

95 +0

GitHub
pekko by apache

Build highly concurrent, distributed, and resilient message-driven applications using Java/Scala

created at Oct. 31, 2022, 8:40 a.m.

Scala

58 +0

1,090 +8

133 +3

GitHub
hazelcast-jet by hazelcast

Distributed Stream and Batch Processing

created at Dec. 15, 2015, 2:01 p.m.

Java

78 +0

1,091 +1

206 +0

GitHub
streamz by python-streamz

Real-time stream processing for python

created at April 4, 2017, 9:45 p.m.

Python

35 +0

1,222 +0

144 +0

GitHub