incubator-stormcrawler by apache

A scalable, mature and versatile web crawler based on Apache Storm

created at April 12, 2013, 2:13 p.m.

Java

66 +0

891 +2

260 +0

GitHub
rudder-server by rudderlabs

Privacy and Security focused Segment-alternative, in Golang and React

created at July 19, 2019, 9:24 a.m.

Go

63 +1

4,093 +11

317 +0

GitHub
Trill by Microsoft

Trill is a single-node query processor for temporal or streaming data.

created at Sept. 26, 2018, 6:48 p.m.

C#

63 +0

1,248 +0

132 +0

GitHub
squall by epfldata

A streaming / online query processing / analytics engine based on Apache Storm

created at March 29, 2012, 5:02 p.m.

Java

62 +0

270 +0

96 +0

GitHub
streaming-benchmarks by yahoo

Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...

created at Dec. 15, 2015, 4:41 p.m.

Jupyter Notebook

61 +0

633 +1

298 +0

GitHub
monix by monix

Asynchronous, Reactive Programming for Scala and Scala.js.

created at Jan. 6, 2014, 12:56 p.m.

Scala

59 +0

1,929 +1

246 +0

GitHub
samza by apache

Mirror of Apache Samza

created at March 14, 2015, 7 a.m.

Java

58 +0

820 +1

334 +0

GitHub
datasketches-java by apache

A software library of stochastic streaming algorithms, a.k.a. sketches.

created at June 30, 2015, 1:05 a.m.

Java

58 +0

896 +2

209 +0

GitHub
pekko by apache

Build highly concurrent, distributed, and resilient message-driven applications using Java/Scala

created at Oct. 31, 2022, 8:40 a.m.

Scala

55 +0

1,222 +6

149 +1

GitHub
datafusion-ballista by apache

Apache DataFusion Ballista Distributed Query Engine

created at May 19, 2022, 2:32 p.m.

Rust

52 +0

1,544 +10

196 +0

GitHub
incubator-retired-edgent by apache

Mirror of Apache Edgent (Incubating)

created at March 10, 2016, 8 a.m.

Java

50 +0

217 +0

136 +0

GitHub
apex-core by apache

Mirror of Apache Apex core

created at Aug. 25, 2015, 7 a.m.

Java

50 +0

350 +0

176 +0

GitHub
mupd8 by walmartlabs

Muppet

created at Aug. 2, 2012, 11:49 p.m.

Scala

48 +0

126 +0

35 +0

GitHub
ekuiper by lf-edge

Lightweight data stream processing engine for IoT edge

created at July 3, 2019, 7:37 a.m.

Go

45 +0

1,480 +4

414 +1

GitHub
streamflow by lmco

StreamFlow™ is a stream processing tool designed to help build and monitor processing workflows.

created at Oct. 6, 2014, 4:57 p.m.

Java

44 +0

253 +0

69 +0

GitHub
yomo by yomorun

🦖 Stateful Serverless Framework for Geo-distributed Edge AI Infra. with function calling support, write once, run on any model.

created at July 1, 2020, 5:48 a.m.

Go

43 +0

1,669 +1

127 -1

GitHub
trident-ml by pmerienne

Trident-ML : A realtime online machine learning library

created at March 22, 2013, 5:01 p.m.

Java

42 +0

382 +0

87 +0

GitHub
fluvio by infinyon

Lean and mean distributed stream processing system written in rust and web assembly. Alternative to Kafka + Flink in one.

created at Aug. 31, 2019, 12:11 a.m.

Rust

42 +0

3,880 +12

491 -1

GitHub
incubator-samoa by apache

Mirror of Apache Samoa (Incubating)

created at Jan. 27, 2015, 8 a.m.

Java

39 +0

248 +0

107 -1

GitHub
nussknacker by TouK

Low-code tool for automating actions on real time data | Stream processing for the users.

created at June 29, 2017, 2:09 p.m.

Scala

37 +2

659 +3

93 +0

GitHub