Privacy and Security focused Segment-alternative, in Golang and React
created at July 19, 2019, 9:24 a.m.
A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.
created at Oct. 26, 2020, 1:56 p.m.
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
created at Sept. 16, 2015, 10:36 a.m.
Kyoto Tycoon key-value store (and the underlying Kyoto Cabinet library)
created at Dec. 24, 2014, 5:55 p.m.
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.
created at July 14, 2014, 4:41 p.m.
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
created at Feb. 26, 2019, 1:39 a.m.
Utils for streaming large files (S3, HDFS, gzip, bz2...)
created at Jan. 2, 2015, 1:05 p.m.
Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery (or plain 'ol Postgres, even!) with definitions imported from Collibra, Datahub, ODD and the like.
created at Oct. 18, 2023, 12:49 p.m.
What's in your data? Extract schema, statistics and entities from datasets
created at Nov. 9, 2020, 3:20 p.m.