dagster by dagster-io

An orchestration platform for the development, production, and observation of data assets.

updated at May 19, 2024, 11:30 a.m.

Python

115 +0

10,389 +54

1,294 +9

GitHub
seaweedfs by seaweedfs

SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.

updated at May 19, 2024, 1:51 p.m.

Go

535 +0

21,250 +63

2,183 +7

GitHub
lakeFS by treeverse

lakeFS - Data version control for your data lake | Git for data

updated at May 19, 2024, 2:44 p.m.

Go

40 +0

4,096 +7

329 +0

GitHub
nomad by hashicorp

Nomad is an easy-to-use, flexible, and performant workload orchestrator that can deploy a mix of microservice, batch, containerized, and non-containerized applications. Nomad is easy to operate and scale and has native Consul and Vault integrations.

updated at May 19, 2024, 4:24 p.m.

Go

536 +0

14,471 +8

1,897 +0

GitHub