protobuf by protocolbuffers

Protocol Buffers - Google's data interchange format

created at Aug. 26, 2014, 3:52 p.m.

C++

2,057 -4

63,274 +99

15,209 +7

GitHub
superset by apache

Apache Superset is a Data Visualization and Data Exploration Platform

created at July 21, 2015, 6:55 p.m.

TypeScript

1,498 +5

57,784 +275

12,335 +54

GitHub
prometheus by prometheus

The Prometheus monitoring system and time series database.

created at Nov. 24, 2012, 11:14 a.m.

Go

1,127 -1

52,307 +100

8,691 +18

GitHub
metabase by metabase

The simplest, fastest way to get business intelligence and analytics to everyone in your company yum

created at Feb. 2, 2015, 7:25 p.m.

Clojure

640 +1

36,128 +56

4,817 +13

GitHub
tidb by pingcap

TiDB is an open-source, cloud-native, distributed, MySQL-Compatible database for elastic scale and real-time analytics. Try AI-powered Chat2Query free at : https://tidbcloud.com/free-trial

created at Sept. 6, 2015, 4:01 a.m.

Go

1,270 -2

35,942 +54

5,689 +5

GitHub
airflow by apache

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

created at April 13, 2015, 6:04 p.m.

Python

756 -1

33,981 +86

13,425 +20

GitHub
influxdb by influxdata

Scalable datastore for metrics, events, and real-time analytics

created at Sept. 26, 2013, 2:31 p.m.

Rust

742 +0

27,518 +50

3,471 +1

GitHub
seaweedfs by seaweedfs

SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.

created at July 14, 2014, 4:41 p.m.

Go

537 -2

20,795 +71

2,148 +13

GitHub
dash by plotly

Data Apps & Dashboards for Python. No JavaScript Required.

created at April 10, 2015, 1:53 a.m.

Python

417 +0

20,338 +52

1,975 +3

GitHub
luigi by spotify

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

created at Sept. 20, 2012, 3:06 p.m.

Python

476 +0

17,217 +16

2,360 -1

GitHub
cadvisor by google

Analyzes resource usage and performance characteristics of running containers.

created at June 9, 2014, 4:36 p.m.

Go

392 +0

16,190 +27

2,263 +3

GitHub
rqlite by rqlite

The lightweight, distributed relational database built on SQLite.

created at Aug. 23, 2014, 4:31 a.m.

Go

230 +0

14,750 +37

676 +3

GitHub
cayley by cayleygraph

An open-source graph database

created at June 5, 2014, 6:49 p.m.

Go

578 -1

14,745 +1

1,248 +0

GitHub
nomad by hashicorp

Nomad is an easy-to-use, flexible, and performant workload orchestrator that can deploy a mix of microservice, batch, containerized, and non-containerized applications. Nomad is easy to operate and scale and has native Consul and Vault integrations.

created at June 1, 2015, 10:21 a.m.

Go

537 +0

14,338 +14

1,868 +5

GitHub
druid by apache

Apache Druid: a high performance real-time analytics database.

created at Oct. 23, 2012, 7:08 p.m.

Java

595 +1

13,150 +7

3,629 +0

GitHub
scylladb by scylladb

NoSQL data store using the seastar framework, compatible with Apache Cassandra

created at Dec. 24, 2014, 1:16 p.m.

C++

341 +0

12,344 +39

1,189 +4

GitHub
CMAK by yahoo

CMAK is a tool for managing Apache Kafka clusters

created at Jan. 28, 2015, 6:33 p.m.

Scala

534 +1

11,646 +7

2,492 +0

GitHub
dagster by dagster-io

An orchestration platform for the development, production, and observation of data assets.

created at April 30, 2018, 4:30 p.m.

Python

111 +0

9,953 +80

1,230 +10

GitHub
librdkafka by confluentinc

The Apache Kafka C/C++ library

created at Sept. 19, 2012, 10:14 a.m.

C

411 +0

7,235 +14

3,096 +0

GitHub
kafka-docker by wurstmeister

Dockerfile for Apache Kafka

created at Dec. 23, 2013, 10:01 p.m.

Shell

160 +0

6,822 +9

2,719 +0

GitHub