CMAK by yahoo

CMAK is a tool for managing Apache Kafka clusters

updated at May 5, 2024, 12:41 p.m.

Scala

534 +0

11,676 +4

2,496 +0

GitHub
scylladb by scylladb

NoSQL data store using the seastar framework, compatible with Apache Cassandra

updated at May 5, 2024, 12:18 p.m.

C++

340 +0

12,591 +32

1,208 +5

GitHub
lakeFS by treeverse

lakeFS - Data version control for your data lake | Git for data

updated at May 5, 2024, 11:08 a.m.

Go

40 +0

4,083 +17

329 +0

GitHub
faust by faust-streaming

Python Stream Processing. A Faust fork

updated at May 5, 2024, 11:03 a.m.

Python

28 +0

1,465 +15

171 +1

GitHub
druid by apache

Apache Druid: a high performance real-time analytics database.

updated at May 5, 2024, 9:16 a.m.

Java

592 -1

13,204 +9

3,637 +3

GitHub
cadvisor by google

Analyzes resource usage and performance characteristics of running containers.

updated at May 5, 2024, 8:26 a.m.

Go

387 -2

16,363 +28

2,276 +1

GitHub
rqlite by rqlite

The lightweight, distributed relational database built on SQLite.

updated at May 5, 2024, 5:59 a.m.

Go

228 +0

14,909 +33

681 +1

GitHub
influxdb by influxdata

Scalable datastore for metrics, events, and real-time analytics

updated at May 5, 2024, 4:50 a.m.

Rust

740 -1

27,808 +42

3,489 +5

GitHub
dagster by dagster-io

An orchestration platform for the development, production, and observation of data assets.

updated at May 5, 2024, 4:32 a.m.

Python

114 +1

10,282 +59

1,277 +13

GitHub
airflow by apache

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

updated at May 5, 2024, 4:18 a.m.

Python

755 +2

34,583 +70

13,573 +23

GitHub
zombodb by zombodb

Making Postgres and Elasticsearch work together like it's 2023

updated at May 5, 2024, 3:54 a.m.

PLpgSQL

95 +0

4,609 -1

210 +1

GitHub
cayley by cayleygraph

An open-source graph database

updated at May 5, 2024, 3:19 a.m.

Go

577 +0

14,775 +3

1,251 +0

GitHub
metabase by metabase

The simplest, fastest way to get business intelligence and analytics to everyone in your company yum

updated at May 5, 2024, 2:49 a.m.

Clojure

642 +0

36,613 +85

4,864 +11

GitHub
prometheus by prometheus

The Prometheus monitoring system and time series database.

updated at May 5, 2024, 2:43 a.m.

Go

1,125 -5

52,867 +88

8,754 +7

GitHub
pipelinedb by pipelinedb

High-performance time-series aggregation for PostgreSQL

updated at May 5, 2024, 2:14 a.m.

C

106 +0

2,615 +1

240 +2

GitHub
luigi by spotify

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

updated at May 5, 2024, 1:07 a.m.

Python

474 -1

17,342 +24

2,373 +1

GitHub
superset by apache

Apache Superset is a Data Visualization and Data Exploration Platform

updated at May 4, 2024, 11:57 p.m.

TypeScript

1,498 +3

58,954 +104

12,607 +46

GitHub
dash by plotly

Data Apps & Dashboards for Python. No JavaScript Required.

updated at May 4, 2024, 11:48 p.m.

Python

418 +1

20,535 +37

1,992 +5

GitHub
librdkafka by confluentinc

The Apache Kafka C/C++ library

updated at May 4, 2024, 10:34 p.m.

C

409 +1

7,297 +4

3,109 +0

GitHub
seaweedfs by seaweedfs

SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.

updated at May 4, 2024, 9:16 p.m.

Go

537 +0

21,123 +47

2,172 +4

GitHub