docker-logstash by pblittle

Docker image for Logstash 1.4

created at Feb. 4, 2014, 1:09 a.m.

Shell

7 +0

237 +0

90 +0

GitHub
blueflood by rax-maas

A distributed system designed to ingest and process time series data

created at May 15, 2013, 2:50 p.m.

Java

95 +0

592 +0

102 +0

GitHub
heroic by spotify

The Heroic Time Series Database

created at May 29, 2015, 5:20 a.m.

Java

58 +0

845 +2

109 +0

GitHub
timely by NationalSecurityAgency

Accumulo backed time series database

created at April 12, 2016, 9:33 p.m.

CSS

51 +0

376 +1

110 +0

GitHub
nessie by projectnessie

Nessie: Transactional Catalog for Data Lakes with Git-like semantics

created at April 9, 2020, 6:39 p.m.

Java

27 +0

858 +7

116 +0

GitHub
incubator-hivemall by apache

Mirror of Apache Hivemall (incubating)

created at Sept. 15, 2016, 7 a.m.

Java

32 +0

310 +0

119 +0

GitHub
datacompy by capitalone

Pandas and Spark DataFrame comparison for humans and more!

created at March 23, 2018, 1:16 p.m.

Python

25 +0

397 +3

122 +0

GitHub
eventsim by Interana

Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.

created at Sept. 5, 2014, 4:27 p.m.

Scala

111 +0

487 +0

125 +0

GitHub
mysql_utils by pinterest

Pinterest MySQL Management Tools

created at Oct. 24, 2015, 5:33 p.m.

Python

72 +0

879 +0

141 +0

GitHub
pinball by pinterest

Pinball is a scalable workflow manager

created at March 4, 2015, 3:13 a.m.

JavaScript

54 +0

1,047 +0

143 +0

GitHub
bottledwater-pg by confluentinc

Change data capture from PostgreSQL into Kafka

created at Feb. 11, 2015, 6:02 p.m.

C

368 +0

1,525 +0

155 +0

GitHub
DataProfiler by capitalone

What's in your data? Extract schema, statistics and entities from datasets

created at Nov. 9, 2020, 3:20 p.m.

Python

21 +0

1,369 +5

156 +0

GitHub
HyperDex by rescrv

HyperDex is a scalable, searchable key-value store

created at Feb. 20, 2012, 11:32 a.m.

C++

88 +0

1,395 +1

168 +0

GitHub
faust by faust-streaming

Python Stream Processing. A Faust fork

created at Oct. 22, 2020, 3:32 p.m.

Python

28 +0

1,470 +3

173 +1

GitHub
memdb by rain1017

Distributed Transactional In-Memory Database (全球首个支持分布式事务的MongoDB)

created at Feb. 18, 2015, 7:35 a.m.

JavaScript

44 +0

597 +0

195 +0

GitHub
snappydata by TIBCOSoftware

Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster

created at Sept. 16, 2015, 10:36 a.m.

Scala

84 +0

1,037 +0

203 +0

GitHub
zombodb by zombodb

Making Postgres and Elasticsearch work together like it's 2023

created at July 17, 2015, 4:53 p.m.

PLpgSQL

95 +0

4,619 +5

211 +0

GitHub
snakebite by spotify

A pure python HDFS client

created at May 7, 2013, 9:44 a.m.

Python

128 +0

858 +0

216 +0

GitHub
haproxy_exporter by prometheus

Simple server that scrapes HAProxy stats and exports them via HTTP for Prometheus consumption

created at Jan. 31, 2013, 3:33 p.m.

Go

30 +0

610 +1

219 +0

GitHub
FiloDB by filodb

Distributed Prometheus time series database

created at Jan. 14, 2015, 6:35 p.m.

Scala

89 +0

1,413 +0

224 +0

GitHub