docker-logstash by pblittle

Docker image for Logstash 1.4

created at Feb. 4, 2014, 1:09 a.m.

Shell

7 +0

237 +0

90 +0

GitHub
blueflood by rax-maas

A distributed system designed to ingest and process time series data

created at May 15, 2013, 2:50 p.m.

Java

95 +0

592 +0

102 +0

GitHub
heroic by spotify

The Heroic Time Series Database

created at May 29, 2015, 5:20 a.m.

Java

58 +0

843 +0

109 +0

GitHub
timely by NationalSecurityAgency

Accumulo backed time series database

created at April 12, 2016, 9:33 p.m.

CSS

51 +0

374 +0

110 +0

GitHub
nessie by projectnessie

Nessie: Transactional Catalog for Data Lakes with Git-like semantics

created at April 9, 2020, 6:39 p.m.

Java

27 -1

841 +7

116 +1

GitHub
incubator-hivemall by apache

Mirror of Apache Hivemall (incubating)

created at Sept. 15, 2016, 7 a.m.

Java

32 +0

310 +0

119 +0

GitHub
datacompy by capitalone

Pandas and Spark DataFrame comparison for humans and more!

created at March 23, 2018, 1:16 p.m.

Python

25 +0

389 +6

122 +2

GitHub
eventsim by Interana

Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.

created at Sept. 5, 2014, 4:27 p.m.

Scala

111 +0

486 +0

126 +0

GitHub
mysql_utils by pinterest

Pinterest MySQL Management Tools

created at Oct. 24, 2015, 5:33 p.m.

Python

72 +0

879 +0

141 +0

GitHub
pinball by pinterest

Pinball is a scalable workflow manager

created at March 4, 2015, 3:13 a.m.

JavaScript

54 +0

1,047 +0

143 +0

GitHub
DataProfiler by capitalone

What's in your data? Extract schema, statistics and entities from datasets

created at Nov. 9, 2020, 3:20 p.m.

Python

21 +0

1,363 +1

154 +0

GitHub
bottledwater-pg by confluentinc

Change data capture from PostgreSQL into Kafka

created at Feb. 11, 2015, 6:02 p.m.

C

366 +0

1,524 -2

155 +0

GitHub
HyperDex by rescrv

HyperDex is a scalable, searchable key-value store

created at Feb. 20, 2012, 11:32 a.m.

C++

88 +0

1,394 +0

168 +0

GitHub
faust by faust-streaming

Python Stream Processing. A Faust fork

created at Oct. 22, 2020, 3:32 p.m.

Python

28 +0

1,465 +15

171 +1

GitHub
memdb by rain1017

Distributed Transactional In-Memory Database (全球首个支持分布式事务的MongoDB)

created at Feb. 18, 2015, 7:35 a.m.

JavaScript

44 +0

597 +0

195 +0

GitHub
snappydata by TIBCOSoftware

Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster

created at Sept. 16, 2015, 10:36 a.m.

Scala

84 +0

1,037 +0

203 +0

GitHub
zombodb by zombodb

Making Postgres and Elasticsearch work together like it's 2023

created at July 17, 2015, 4:53 p.m.

PLpgSQL

95 +0

4,609 -1

210 +1

GitHub
snakebite by spotify

A pure python HDFS client

created at May 7, 2013, 9:44 a.m.

Python

129 -1

858 +0

216 +0

GitHub
haproxy_exporter by prometheus

Simple server that scrapes HAProxy stats and exports them via HTTP for Prometheus consumption

created at Jan. 31, 2013, 3:33 p.m.

Go

30 +0

609 +0

219 +0

GitHub
FiloDB by filodb

Distributed Prometheus time series database

created at Jan. 14, 2015, 6:35 p.m.

Scala

89 +0

1,413 +0

223 +0

GitHub