Nomad is an easy-to-use, flexible, and performant workload orchestrator that can deploy a mix of microservice, batch, containerized, and non-containerized applications. Nomad is easy to operate and scale and has native Consul and Vault integrations.
updated at May 19, 2024, 4:24 p.m.
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.
updated at May 19, 2024, 1:51 p.m.
An orchestration platform for the development, production, and observation of data assets.
updated at May 19, 2024, 11:30 a.m.
Pandas and Spark DataFrame comparison for humans and more!
updated at May 19, 2024, 5:17 a.m.
Protocol Buffers - Google's data interchange format
updated at May 19, 2024, 4:23 a.m.
The Prometheus monitoring system and time series database.
updated at May 19, 2024, 12:37 a.m.
Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery (or plain 'ol Postgres, even!) with definitions imported from Collibra, Datahub, ODD and the like.
updated at May 19, 2024, 12:02 a.m.
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
updated at May 18, 2024, 11:06 p.m.
Scalable datastore for metrics, events, and real-time analytics
updated at May 18, 2024, 10:21 p.m.