Gaffer by gchq

A large-scale entity and relation database supporting aggregation of properties

created at Dec. 14, 2015, 12:12 p.m.

Java

143 +0

1,731 +1

353 +0

GitHub
kairosdb by kairosdb

Fast scalable time series database

created at Feb. 5, 2013, 10:27 p.m.

Java

118 +0

1,724 +0

345 +0

GitHub
PyHive by dropbox

Python interface to Hive and Presto. 🐝

created at Feb. 1, 2014, 9:05 a.m.

Python

62 +0

1,665 +2

552 +0

GitHub
bottledwater-pg by confluentinc

Change data capture from PostgreSQL into Kafka

created at Feb. 11, 2015, 6:02 p.m.

C

367 +0

1,526 +0

155 +0

GitHub
faust by faust-streaming

Python Stream Processing. A Faust fork

created at Oct. 22, 2020, 3:32 p.m.

Python

28 +0

1,445 +6

170 -1

GitHub
FiloDB by filodb

Distributed Prometheus time series database

created at Jan. 14, 2015, 6:35 p.m.

Scala

89 +0

1,412 +0

229 +0

GitHub
HyperDex by rescrv

HyperDex is a scalable, searchable key-value store

created at Feb. 20, 2012, 11:32 a.m.

C++

88 +0

1,394 +1

168 +0

GitHub
DataProfiler by capitalone

What's in your data? Extract schema, statistics and entities from datasets

created at Nov. 9, 2020, 3:20 p.m.

Python

21 +0

1,359 +5

154 +0

GitHub
ekuiper by lf-edge

Lightweight data stream processing engine for IoT edge

created at July 3, 2019, 7:37 a.m.

Go

41 +0

1,357 +5

381 +3

GitHub
ccm by riptano

A script to easily create and destroy an Apache Cassandra cluster on localhost

created at March 1, 2011, 9:42 a.m.

Python

76 +0

1,212 +0

302 +0

GitHub
pinball by pinterest

Pinball is a scalable workflow manager

created at March 4, 2015, 3:13 a.m.

JavaScript

54 +0

1,047 +0

143 +0

GitHub
snappydata by TIBCOSoftware

Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster

created at Sept. 16, 2015, 10:36 a.m.

Scala

84 +0

1,037 +0

203 +0

GitHub
mysql_utils by pinterest

Pinterest MySQL Management Tools

created at Oct. 24, 2015, 5:33 p.m.

Python

72 +0

878 +0

146 +0

GitHub
snakebite by spotify

A pure python HDFS client

created at May 7, 2013, 9:44 a.m.

Python

130 +0

858 +0

216 +0

GitHub
heroic by spotify

The Heroic Time Series Database

created at May 29, 2015, 5:20 a.m.

Java

58 +0

843 +0

109 +0

GitHub
Akumuli by akumuli

Time-series database

created at Jan. 28, 2014, 9:31 p.m.

C++

44 +0

837 +0

86 +0

GitHub
nessie by projectnessie

Nessie: Transactional Catalog for Data Lakes with Git-like semantics

created at April 9, 2020, 6:39 p.m.

Java

28 +0

831 +10

115 -1

GitHub
dalmatinerdb by dalmatinerdb

See gitlab: https://gitlab.com/Project-FiFo/DalmatinerDB/dalmatinerdb

created at June 13, 2014, 7:08 p.m.

Erlang

37 +0

697 +0

44 +0

GitHub
hstream by hstreamdb

HStreamDB is an open-source, cloud-native streaming database for IoT and beyond. Modernize your data stack for real-time applications.

created at Aug. 31, 2020, 9:42 a.m.

Haskell

23 +0

691 +0

56 +0

GitHub
gockerize by redbooth

Package golang service into minimal docker containers.

created at Aug. 4, 2015, 2:02 a.m.

Shell

66 +0

667 +0

20 +0

GitHub