iceberg by apache

Apache Iceberg

updated at Nov. 17, 2024, 8:29 p.m.

Java

160 +0

6,464 +20

2,235 +10

GitHub
sedona by apache

A cluster computing framework for processing large-scale geospatial data

updated at Nov. 17, 2024, 5:14 p.m.

Java

95 +0

1,956 +2

693 -2

GitHub
hudi by apache

Upserts, Deletes And Incremental Processing on Big Data.

updated at Nov. 17, 2024, 5:10 p.m.

Java

1,164 +1

5,436 +21

2,424 -1

GitHub
mongo-spark by mongodb

The MongoDB Spark Connector

updated at Nov. 5, 2024, 8:45 a.m.

Java

79 +0

712 +0

309 +0

GitHub
oryx by OryxProject

Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

updated at Oct. 19, 2024, 8:54 a.m.

Java

208 +0

1,788 +0

405 +0

GitHub
jpmml-evaluator-spark by jpmml

PMML evaluator library for the Apache Spark cluster computing system (http://spark.apache.org/)

updated at March 31, 2024, 2:17 p.m.

Java

14 +0

94 +0

43 +0

GitHub