FiloDB by filodb

Distributed Prometheus time series database

created at Jan. 14, 2015, 6:35 p.m.

Scala

89 +0

1,413 +0

223 +0

GitHub
smart_open by piskvorky

Utils for streaming large files (S3, HDFS, gzip, bz2...)

created at Jan. 2, 2015, 1:05 p.m.

Python

49 +0

3,094 +1

378 +0

GitHub
kyoto by AlticeLabsProjects

Kyoto Tycoon key-value store (and the underlying Kyoto Cabinet library)

created at Dec. 24, 2014, 5:55 p.m.

C++

29 +0

271 +0

40 +0

GitHub
scylladb by scylladb

NoSQL data store using the seastar framework, compatible with Apache Cassandra

created at Dec. 24, 2014, 1:16 p.m.

C++

340 +0

12,591 +32

1,208 +5

GitHub
gobblin by apache

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.

created at Dec. 1, 2014, 6:10 p.m.

Java

167 +0

2,190 +0

742 +0

GitHub
kafka-logger by uber

A kafka logger for winston

created at Oct. 14, 2014, 10:14 p.m.

JavaScript

2,726 -1

45 +0

11 +0

GitHub
pg_kafka by xstevens

INACTIVE: A PostgreSQL extension to produce messages to Apache Kafka.

created at Sept. 5, 2014, 6:33 p.m.

C

9 +0

112 -1

15 +0

GitHub
eventsim by Interana

Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.

created at Sept. 5, 2014, 4:27 p.m.

Scala

111 +0

486 +0

126 +0

GitHub
protobuf by protocolbuffers

Protocol Buffers - Google's data interchange format

created at Aug. 26, 2014, 3:52 p.m.

C++

2,057 -2

63,739 +78

15,271 +10

GitHub
rqlite by rqlite

The lightweight, distributed relational database built on SQLite.

created at Aug. 23, 2014, 4:31 a.m.

Go

228 +0

14,909 +33

681 +1

GitHub
weave by weaveworks

Simple, resilient multi-host containers networking and more.

created at Aug. 18, 2014, 5:19 a.m.

Go

237 +0

6,584 +4

662 -1

GitHub
kafkat by airbnb

KafkaT-ool

created at Aug. 14, 2014, 10:09 p.m.

Ruby

243 +0

503 -1

86 +0

GitHub
seaweedfs by seaweedfs

SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.

created at July 14, 2014, 4:41 p.m.

Go

537 +0

21,123 +47

2,172 +4

GitHub
dalmatinerdb by dalmatinerdb

See gitlab: https://gitlab.com/Project-FiFo/DalmatinerDB/dalmatinerdb

created at June 13, 2014, 7:08 p.m.

Erlang

37 +0

697 +0

44 +0

GitHub
cadvisor by google

Analyzes resource usage and performance characteristics of running containers.

created at June 9, 2014, 4:36 p.m.

Go

387 -2

16,363 +28

2,276 +1

GitHub
cayley by cayleygraph

An open-source graph database

created at June 5, 2014, 6:49 p.m.

Go

577 +0

14,775 +3

1,251 +0

GitHub
flocker by ClusterHQ

Container data volume manager for your Dockerized application

created at April 28, 2014, 6:02 p.m.

Python

168 +0

3,376 +0

285 -1

GitHub
secor by pinterest

Secor is a service implementing Kafka log persistence

created at April 15, 2014, 10:26 p.m.

Java

70 +0

1,835 +0

541 +0

GitHub
kcat by edenhill

Generic command line non-JVM Apache Kafka producer and consumer

created at March 30, 2014, 4:25 a.m.

C

79 +0

5,260 +14

473 +1

GitHub
snackfs-release by tuplejump

The GA Release of SnackFS

created at March 19, 2014, 6:17 a.m.

Scala

7 +0

13 +0

5 +0

GitHub