spark by apache

Apache Spark - A unified analytics engine for large-scale data processing

created at Feb. 25, 2014, 8 a.m.

Scala

2,023 +0

40,184 +52

28,378 +8

GitHub
kafka by apache

Mirror of Apache Kafka

created at Aug. 15, 2011, 6:06 p.m.

Java

1,061 +2

29,065 +49

14,027 +7

GitHub
mediapipe by google-ai-edge

Cross-platform, customizable ML solutions for live and streaming media.

created at June 13, 2019, 7:16 p.m.

C++

516 +3

27,937 +71

5,193 +12

GitHub
nsq by nsqio

A realtime distributed messaging platform

created at May 12, 2012, 2:37 p.m.

Go

840 +0

25,029 +12

2,901 +0

GitHub
flink by apache

Apache Flink

created at June 7, 2014, 7 a.m.

Java

942 +0

24,263 +32

13,409 +16

GitHub
rocketmq by apache

Apache RocketMQ is a cloud native messaging and streaming platform, making it simple to build event-driven applications.

created at Nov. 30, 2016, 8 a.m.

Java

869 +1

21,355 +20

11,727 +5

GitHub
pulsar by apache

Apache Pulsar - distributed pub-sub messaging system

created at June 28, 2016, 7 a.m.

Java

403 +0

14,317 +12

3,601 +2

GitHub
akka by akka

A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.

created at Feb. 16, 2009, 12:51 p.m.

Scala

572 +0

13,072 +2

3,592 +0

GitHub
redpanda by vectorizedio

Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!

created at Nov. 2, 2020, 10:43 p.m.

C++

137 +0

9,763 +14

592 -1

GitHub
connect by vectorizedio

Fancy stream processing made operationally mundane

created at March 22, 2016, 1:18 a.m.

Go

120 +0

8,165 +5

843 +0

GitHub
beam by apache

Apache Beam is a unified programming model for Batch and Streaming data processing.

created at Feb. 2, 2016, 8 a.m.

Java

255 +0

7,911 +9

4,274 +3

GitHub
aeron by real-logic

Efficient reliable UDP unicast, UDP multicast, and IPC message transport

created at Feb. 7, 2014, 5:16 p.m.

Java

358 +0

7,468 +6

894 +0

GitHub
risingwave by singularity-data

Best-in-class stream processing, analytics, and management. Perform continuous analytics, or build event-driven applications, real-time ETL pipelines, and feature stores in minutes. Unified streaming and batch. PostgreSQL compatible.

created at Jan. 28, 2022, 12:58 p.m.

Rust

79 +0

7,135 +25

590 +2

GitHub
pathway by pathwaycom

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

created at Nov. 27, 2022, 1:01 p.m.

Python

29 +1

6,989 +1193

155 +11

GitHub
faust by robinhood

Python Stream Processing

created at March 8, 2017, 6:36 p.m.

Python

139 +0

6,750 -1

534 +0

GitHub
storm by apache

Apache Storm

created at Nov. 5, 2013, 8 a.m.

Java

577 +0

6,603 +2

4,077 -1

GitHub
river by creme-ml

🌊 Online machine learning in Python

created at Jan. 24, 2019, 3:18 p.m.

Python

85 +1

5,120 +6

553 +2

GitHub
rudder-server by rudderlabs

Privacy and Security focused Segment-alternative, in Golang and React

created at July 19, 2019, 9:24 a.m.

Go

62 +0

4,108 +2

321 +2

GitHub
fluvio by infinyon

Lean and mean distributed stream processing system written in rust and web assembly. Alternative to Kafka + Flink in one.

created at Aug. 31, 2019, 12:11 a.m.

Rust

42 +0

3,928 +9

486 -1

GitHub
automq by AutoMQ

AutoMQ is a cloud-first alternative to Kafka by decoupling durability to S3 and EBS. 10x Cost-Effective. No Cross-AZ Traffic Cost. Autoscale in seconds. Single-digit ms latency.

created at Aug. 17, 2023, 7:50 a.m.

Java

35 +0

3,916 +23

223 +3

GitHub