opentsdb by OpenTSDB

A scalable, distributed Time Series Database.

created at Aug. 27, 2010, 2:05 a.m.

Java

337 +0

4,951 +2

1,253 +0

GitHub
kryo by EsotericSoftware

Java binary serialization and cloning: fast, efficient, automatic

created at Nov. 6, 2013, 1:24 p.m.

HTML

296 -1

6,080 +6

817 +0

GitHub
flockdb by twitter-archive

A distributed, fault-tolerant graph database

created at April 12, 2010, 3:53 a.m.

Scala

279 +0

3,330 +0

273 +0

GitHub
pyxley by stitchfix

Python helpers for building dashboards using Flask and React

created at June 22, 2015, 10:23 p.m.

JavaScript

277 +0

2,275 -1

258 +0

GitHub
kafkat by airbnb

KafkaT-ool

created at Aug. 14, 2014, 10:09 p.m.

Ruby

243 +0

503 -1

86 +0

GitHub
weave by weaveworks

Simple, resilient multi-host containers networking and more.

created at Aug. 18, 2014, 5:19 a.m.

Go

237 +0

6,584 +4

662 -1

GitHub
elasticsearch-jdbc by jprante

JDBC importer for Elasticsearch

created at June 2, 2012, 11:17 p.m.

Java

231 +0

2,838 +0

711 -1

GitHub
rqlite by rqlite

The lightweight, distributed relational database built on SQLite.

created at Aug. 23, 2014, 4:31 a.m.

Go

228 +0

14,909 +33

681 +1

GitHub
heka by mozilla-services

DEPRECATED: Data collection and processing made easy.

created at Oct. 16, 2012, 5:20 p.m.

Go

204 +0

3,399 +0

531 +0

GitHub
snappy by google

A fast compressor/decompressor

created at March 3, 2014, 9:58 p.m.

C++

195 +0

5,995 +6

969 +1

GitHub
flocker by ClusterHQ

Container data volume manager for your Dockerized application

created at April 28, 2014, 6:02 p.m.

Python

168 +0

3,376 +0

285 -1

GitHub
gobblin by apache

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.

created at Dec. 1, 2014, 6:10 p.m.

Java

167 +0

2,190 +0

742 +0

GitHub
kafka-docker by wurstmeister

Dockerfile for Apache Kafka

created at Dec. 23, 2013, 10:01 p.m.

Shell

160 +0

6,848 +7

2,717 -2

GitHub
Gaffer by gchq

A large-scale entity and relation database supporting aggregation of properties

created at Dec. 14, 2015, 12:12 p.m.

Java

142 +0

1,734 +1

354 +0

GitHub
snakebite by spotify

A pure python HDFS client

created at May 7, 2013, 9:44 a.m.

Python

129 -1

858 +0

216 +0

GitHub
kairosdb by kairosdb

Fast scalable time series database

created at Feb. 5, 2013, 10:27 p.m.

Java

118 +0

1,726 +2

344 -1

GitHub
deep-spark by Stratio

Connecting Apache Spark with different data stores [DEPRECATED]

created at Feb. 18, 2014, 8:34 a.m.

Java

115 +0

197 +0

42 +0

GitHub
dagster by dagster-io

An orchestration platform for the development, production, and observation of data assets.

created at April 30, 2018, 4:30 p.m.

Python

114 +1

10,282 +59

1,277 +13

GitHub
eventsim by Interana

Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.

created at Sept. 5, 2014, 4:27 p.m.

Scala

111 +0

486 +0

126 +0

GitHub
pipelinedb by pipelinedb

High-performance time-series aggregation for PostgreSQL

created at Nov. 26, 2013, 12:11 a.m.

C

106 +0

2,615 +1

240 +2

GitHub