core by gazette

Build platforms that flexibly mix SQL, batch, and stream processing paradigms

created at Oct. 20, 2017, 6:54 p.m.

Go

33 +0

522 +3

51 +1

GitHub
rudder-server by rudderlabs

Privacy and Security focused Segment-alternative, in Golang and React

created at July 19, 2019, 9:24 a.m.

Go

61 +0

3,945 +5

291 +2

GitHub
siddhi by siddhi-io

Stream Processing and Complex Event Processing Engine

created at April 21, 2014, 9:51 a.m.

Java

122 +0

1,502 +2

522 +0

GitHub
fs2 by typelevel

Compositional, streaming I/O library for Scala

created at March 6, 2013, 5:34 p.m.

Scala

88 +0

2,330 +1

590 +1

GitHub
AthenaX by uber-archive

SQL-based streaming analytics platform at scale

created at Sept. 18, 2017, 8:37 p.m.

Java

79 +0

1,224 +2

289 +0

GitHub
wally by WallarooLabs

Distributed Stream Processing

created at Dec. 30, 2015, 3:11 p.m.

Pony

71 +0

1,480 +0

67 +0

GitHub
river by creme-ml

🌊 Online machine learning in Python

created at Jan. 24, 2019, 3:18 p.m.

Python

86 +0

4,790 +10

527 +0

GitHub
mediapipe by google

Cross-platform, customizable ML solutions for live and streaming media.

created at June 13, 2019, 7:16 p.m.

C++

491 -1

25,610 +78

4,974 +8

GitHub
datasketches-java by apache

A software library of stochastic streaming algorithms, a.k.a. sketches.

created at June 30, 2015, 1:05 a.m.

Java

60 +0

869 +0

205 +0

GitHub
makinage by maki-nage

Stream Processing Made Easy

created at March 2, 2020, 10:18 p.m.

Python

4 +0

38 +0

1 +0

GitHub
camus by LinkedInAttic

LinkedIn's previous generation Kafka to HDFS pipeline.

created at Dec. 20, 2012, 11:54 p.m.

Java

143 -1

884 +0

461 -1

GitHub
hstream by hstreamdb

HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.

created at Aug. 31, 2020, 9:42 a.m.

Haskell

23 +0

692 +2

56 +0

GitHub
datacollector-oss by streamsets

datacollector-oss

created at May 6, 2021, 9:13 p.m.

Java

10 +0

84 +0

95 +1

GitHub
ekuiper by lf-edge

Lightweight data stream processing engine for IoT edge

created at July 3, 2019, 7:37 a.m.

Go

41 +0

1,371 +6

390 +3

GitHub
fluvio by infinyon

Lean and mean distributed stream processing system written in rust and web assembly.

created at Aug. 31, 2019, 12:11 a.m.

Rust

36 -1

2,680 +10

193 +2

GitHub
scramjet by scramjetorg

Public tracker for Scramjet Cloud Platform, a platform that bring data from many environments together.

created at Jan. 18, 2017, 12:34 p.m.

Unknown languages

14 +0

254 +0

20 +0

GitHub
transform-hub by scramjetorg

Scramjet Transform Hub (STH) is a runtime supervisor that can run data processing programs called Sequences and manage local resources on any Linux server, Docker on small edge servers, and even large-scale Kubernetes clusters in the cloud or datacenters. It connects to Scramjet Spaces in Scramjet Cloud Platform.

created at June 30, 2021, 1:18 p.m.

TypeScript

13 +0

65 +1

8 +0

GitHub
yomo by yomorun

🦖 Stateful Serverless Framework for building Geo-distributed Edge AI Infra

created at July 1, 2020, 5:48 a.m.

Go

43 +0

1,618 +4

128 +0

GitHub
redpanda by vectorizedio

Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!

created at Nov. 2, 2020, 10:43 p.m.

C++

138 +1

8,895 +40

542 +1

GitHub
daggy by synacker

Daggy - Data Aggregation Utility and C/C++ developer library for data streams catching

created at Sept. 15, 2018, 6:17 p.m.

C++

5 +0

148 -1

15 +0

GitHub