Data policy IN, dynamic view OUT: PACE is the Policy As Code Engine. It helps you to programatically create and apply a data policy to a processing platform like Databricks, Snowflake or BigQuery (or plain 'ol Postgres, even!) with definitions imported from Collibra, Datahub, ODD and the like.
created at Oct. 18, 2023, 12:49 p.m.
Docker microservice for saving/restoring volume data to S3
created at March 21, 2015, 10:05 p.m.
Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observability. Configure data quality checks from the UI or in YAML files, let DQOps run the data quality checks daily to detect data quality issues.
created at March 8, 2022, 3:18 p.m.
🔥 Open Source Reverse ETL and Customer Data Platform (CDP). An open-source alternative to tools like Hightouch, Census, and RudderStack.
created at Oct. 20, 2023, 3:21 p.m.
A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.
created at Oct. 26, 2020, 1:56 p.m.
What's in your data? Extract schema, statistics and entities from datasets
created at Nov. 9, 2020, 3:20 p.m.
A lightweight tool for easy deployment and rollback of dockerized applications.
created at May 6, 2015, 6:32 p.m.
Pandas and Spark DataFrame comparison for humans and more!
created at March 23, 2018, 1:16 p.m.
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
created at April 9, 2020, 6:39 p.m.
Kyoto Tycoon key-value store (and the underlying Kyoto Cabinet library)
created at Dec. 24, 2014, 5:55 p.m.
Simple server that scrapes HAProxy stats and exports them via HTTP for Prometheus consumption
created at Jan. 31, 2013, 3:33 p.m.
Mirror of Apache Hivemall (incubating)
created at Sept. 15, 2016, 7 a.m.
See gitlab: https://gitlab.com/Project-FiFo/DalmatinerDB/dalmatinerdb
created at June 13, 2014, 7:08 p.m.