FiloDB by filodb

Distributed Prometheus time series database

created at Jan. 14, 2015, 6:35 p.m.

Scala

89 +0

1,413 +0

223 +0

GitHub
blueflood by rax-maas

A distributed system designed to ingest and process time series data

created at May 15, 2013, 2:50 p.m.

Java

95 +0

592 +0

102 +0

GitHub
zombodb by zombodb

Making Postgres and Elasticsearch work together like it's 2023

created at July 17, 2015, 4:53 p.m.

PLpgSQL

95 +0

4,609 -1

210 +1

GitHub
kafka-node by SOHU-Co

Node.js client for Apache Kafka 0.8 and later.

created at Oct. 23, 2013, 3:34 a.m.

JavaScript

99 +0

2,659 +1

630 +0

GitHub
pipelinedb by pipelinedb

High-performance time-series aggregation for PostgreSQL

created at Nov. 26, 2013, 12:11 a.m.

C

106 +0

2,615 +1

240 +2

GitHub
eventsim by Interana

Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.

created at Sept. 5, 2014, 4:27 p.m.

Scala

111 +0

486 +0

126 +0

GitHub
dagster by dagster-io

An orchestration platform for the development, production, and observation of data assets.

created at April 30, 2018, 4:30 p.m.

Python

114 +1

10,282 +59

1,277 +13

GitHub
deep-spark by Stratio

Connecting Apache Spark with different data stores [DEPRECATED]

created at Feb. 18, 2014, 8:34 a.m.

Java

115 +0

197 +0

42 +0

GitHub
kairosdb by kairosdb

Fast scalable time series database

created at Feb. 5, 2013, 10:27 p.m.

Java

118 +0

1,726 +2

344 -1

GitHub
snakebite by spotify

A pure python HDFS client

created at May 7, 2013, 9:44 a.m.

Python

129 -1

858 +0

216 +0

GitHub
Gaffer by gchq

A large-scale entity and relation database supporting aggregation of properties

created at Dec. 14, 2015, 12:12 p.m.

Java

142 +0

1,734 +1

354 +0

GitHub
kafka-docker by wurstmeister

Dockerfile for Apache Kafka

created at Dec. 23, 2013, 10:01 p.m.

Shell

160 +0

6,848 +7

2,717 -2

GitHub
gobblin by apache

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.

created at Dec. 1, 2014, 6:10 p.m.

Java

167 +0

2,190 +0

742 +0

GitHub
flocker by ClusterHQ

Container data volume manager for your Dockerized application

created at April 28, 2014, 6:02 p.m.

Python

168 +0

3,376 +0

285 -1

GitHub
snappy by google

A fast compressor/decompressor

created at March 3, 2014, 9:58 p.m.

C++

195 +0

5,995 +6

969 +1

GitHub
heka by mozilla-services

DEPRECATED: Data collection and processing made easy.

created at Oct. 16, 2012, 5:20 p.m.

Go

204 +0

3,399 +0

531 +0

GitHub
rqlite by rqlite

The lightweight, distributed relational database built on SQLite.

created at Aug. 23, 2014, 4:31 a.m.

Go

228 +0

14,909 +33

681 +1

GitHub
elasticsearch-jdbc by jprante

JDBC importer for Elasticsearch

created at June 2, 2012, 11:17 p.m.

Java

231 +0

2,838 +0

711 -1

GitHub
weave by weaveworks

Simple, resilient multi-host containers networking and more.

created at Aug. 18, 2014, 5:19 a.m.

Go

237 +0

6,584 +4

662 -1

GitHub
kafkat by airbnb

KafkaT-ool

created at Aug. 14, 2014, 10:09 p.m.

Ruby

243 +0

503 -1

86 +0

GitHub