Nessie: Transactional Catalog for Data Lakes with Git-like semantics
updated at May 4, 2024, 9:33 a.m.
Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observability. Configure data quality checks from the UI or in YAML files, let DQOps run the data quality checks daily to detect data quality issues.
updated at May 3, 2024, 4:46 p.m.
Mirror of Apache Hivemall (incubating)
updated at April 6, 2024, 6:43 a.m.
Connecting Apache Spark with different data stores [DEPRECATED]
updated at Jan. 1, 2024, 6:17 p.m.