Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observability. Configure data quality checks from the UI or in YAML files, let DQOps run the data quality checks daily to detect data quality issues.
updated at June 2, 2024, 9:41 a.m.
An orchestration platform for the development, production, and observation of data assets.
updated at June 2, 2024, 9:02 a.m.
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
updated at June 2, 2024, 6:11 a.m.
Scalable datastore for metrics, events, and real-time analytics
updated at June 2, 2024, 5:51 a.m.
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.
updated at June 2, 2024, 5:49 a.m.
The Prometheus monitoring system and time series database.
updated at June 2, 2024, 5:27 a.m.
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
updated at June 2, 2024, 5:07 a.m.
Nomad is an easy-to-use, flexible, and performant workload orchestrator that can deploy a mix of microservice, batch, containerized, and non-containerized applications. Nomad is easy to operate and scale and has native Consul and Vault integrations.
updated at June 2, 2024, 3:59 a.m.
🔥🔥🔥 Open Source Alternative to Hightouch, Census, and RudderStack - Reverse ETL & Customer Data Platform (CDP)
updated at June 2, 2024, 3:18 a.m.
Protocol Buffers - Google's data interchange format
updated at June 2, 2024, 2:07 a.m.
Privacy and Security focused Segment-alternative, in Golang and React
updated at June 2, 2024, 1:53 a.m.