snappy by google

A fast compressor/decompressor

created at March 3, 2014, 9:58 p.m.

C++

195 +0

5,995 +6

969 +1

GitHub
deep-spark by Stratio

Connecting Apache Spark with different data stores [DEPRECATED]

created at Feb. 18, 2014, 8:34 a.m.

Java

115 +0

197 +0

42 +0

GitHub
docker-logstash by pblittle

Docker image for Logstash 1.4

created at Feb. 4, 2014, 1:09 a.m.

Shell

7 +0

237 +0

90 +0

GitHub
PyHive by dropbox

Python interface to Hive and Presto. 🐝

created at Feb. 1, 2014, 9:05 a.m.

Python

62 +0

1,665 +0

552 +0

GitHub
Akumuli by akumuli

Time-series database

created at Jan. 28, 2014, 9:31 p.m.

C++

44 +0

838 +1

86 +0

GitHub
kafka-docker by wurstmeister

Dockerfile for Apache Kafka

created at Dec. 23, 2013, 10:01 p.m.

Shell

160 +0

6,848 +7

2,717 -2

GitHub
pipelinedb by pipelinedb

High-performance time-series aggregation for PostgreSQL

created at Nov. 26, 2013, 12:11 a.m.

C

106 +0

2,615 +1

240 +2

GitHub
kryo by EsotericSoftware

Java binary serialization and cloning: fast, efficient, automatic

created at Nov. 6, 2013, 1:24 p.m.

HTML

296 -1

6,080 +6

817 +0

GitHub
kafka-node by SOHU-Co

Node.js client for Apache Kafka 0.8 and later.

created at Oct. 23, 2013, 3:34 a.m.

JavaScript

99 +0

2,659 +1

630 +0

GitHub
influxdb by influxdata

Scalable datastore for metrics, events, and real-time analytics

created at Sept. 26, 2013, 2:31 p.m.

Rust

740 -1

27,808 +42

3,489 +5

GitHub
blueflood by rax-maas

A distributed system designed to ingest and process time series data

created at May 15, 2013, 2:50 p.m.

Java

95 +0

592 +0

102 +0

GitHub
snakebite by spotify

A pure python HDFS client

created at May 7, 2013, 9:44 a.m.

Python

129 -1

858 +0

216 +0

GitHub
kairosdb by kairosdb

Fast scalable time series database

created at Feb. 5, 2013, 10:27 p.m.

Java

118 +0

1,726 +2

344 -1

GitHub
haproxy_exporter by prometheus

Simple server that scrapes HAProxy stats and exports them via HTTP for Prometheus consumption

created at Jan. 31, 2013, 3:33 p.m.

Go

30 +0

609 +0

219 +0

GitHub
prometheus by prometheus

The Prometheus monitoring system and time series database.

created at Nov. 24, 2012, 11:14 a.m.

Go

1,125 -5

52,867 +88

8,754 +7

GitHub
druid by apache

Apache Druid: a high performance real-time analytics database.

created at Oct. 23, 2012, 7:08 p.m.

Java

592 -1

13,204 +9

3,637 +3

GitHub
heka by mozilla-services

DEPRECATED: Data collection and processing made easy.

created at Oct. 16, 2012, 5:20 p.m.

Go

204 +0

3,399 +0

531 +0

GitHub
luigi by spotify

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

created at Sept. 20, 2012, 3:06 p.m.

Python

474 -1

17,342 +24

2,373 +1

GitHub
librdkafka by confluentinc

The Apache Kafka C/C++ library

created at Sept. 19, 2012, 10:14 a.m.

C

409 +1

7,297 +4

3,109 +0

GitHub
elasticsearch-jdbc by jprante

JDBC importer for Elasticsearch

created at June 2, 2012, 11:17 p.m.

Java

231 +0

2,838 +0

711 -1

GitHub