PyHive by dropbox

Python interface to Hive and Presto. 🐝

created at Feb. 1, 2014, 9:05 a.m.

Python

62 +0

1,665 +0

552 +0

GitHub
docker-logstash by pblittle

Docker image for Logstash 1.4

created at Feb. 4, 2014, 1:09 a.m.

Shell

7 +0

237 +0

90 +0

GitHub
deep-spark by Stratio

Connecting Apache Spark with different data stores [DEPRECATED]

created at Feb. 18, 2014, 8:34 a.m.

Java

115 +0

197 +0

42 +0

GitHub
snappy by google

A fast compressor/decompressor

created at March 3, 2014, 9:58 p.m.

C++

195 +0

5,995 +6

969 +1

GitHub
snackfs-release by tuplejump

The GA Release of SnackFS

created at March 19, 2014, 6:17 a.m.

Scala

7 +0

13 +0

5 +0

GitHub
kcat by edenhill

Generic command line non-JVM Apache Kafka producer and consumer

created at March 30, 2014, 4:25 a.m.

C

79 +0

5,260 +14

473 +1

GitHub
secor by pinterest

Secor is a service implementing Kafka log persistence

created at April 15, 2014, 10:26 p.m.

Java

70 +0

1,835 +0

541 +0

GitHub
flocker by ClusterHQ

Container data volume manager for your Dockerized application

created at April 28, 2014, 6:02 p.m.

Python

168 +0

3,376 +0

285 -1

GitHub
cayley by cayleygraph

An open-source graph database

created at June 5, 2014, 6:49 p.m.

Go

577 +0

14,775 +3

1,251 +0

GitHub
cadvisor by google

Analyzes resource usage and performance characteristics of running containers.

created at June 9, 2014, 4:36 p.m.

Go

387 -2

16,363 +28

2,276 +1

GitHub
dalmatinerdb by dalmatinerdb

See gitlab: https://gitlab.com/Project-FiFo/DalmatinerDB/dalmatinerdb

created at June 13, 2014, 7:08 p.m.

Erlang

37 +0

697 +0

44 +0

GitHub
seaweedfs by seaweedfs

SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.

created at July 14, 2014, 4:41 p.m.

Go

537 +0

21,123 +47

2,172 +4

GitHub
kafkat by airbnb

KafkaT-ool

created at Aug. 14, 2014, 10:09 p.m.

Ruby

243 +0

503 -1

86 +0

GitHub
weave by weaveworks

Simple, resilient multi-host containers networking and more.

created at Aug. 18, 2014, 5:19 a.m.

Go

237 +0

6,584 +4

662 -1

GitHub
rqlite by rqlite

The lightweight, distributed relational database built on SQLite.

created at Aug. 23, 2014, 4:31 a.m.

Go

228 +0

14,909 +33

681 +1

GitHub
protobuf by protocolbuffers

Protocol Buffers - Google's data interchange format

created at Aug. 26, 2014, 3:52 p.m.

C++

2,057 -2

63,739 +78

15,271 +10

GitHub
eventsim by Interana

Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.

created at Sept. 5, 2014, 4:27 p.m.

Scala

111 +0

486 +0

126 +0

GitHub
pg_kafka by xstevens

INACTIVE: A PostgreSQL extension to produce messages to Apache Kafka.

created at Sept. 5, 2014, 6:33 p.m.

C

9 +0

112 -1

15 +0

GitHub
kafka-logger by uber

A kafka logger for winston

created at Oct. 14, 2014, 10:14 p.m.

JavaScript

2,726 -1

45 +0

11 +0

GitHub
gobblin by apache

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.

created at Dec. 1, 2014, 6:10 p.m.

Java

167 +0

2,190 +0

742 +0

GitHub