iceberg by apache

Apache Iceberg

created at Nov. 19, 2018, 4:26 p.m.

Java

162 +2

6,527 +30

2,253 +8

GitHub
hudi by apache

Upserts, Deletes And Incremental Processing on Big Data.

created at Dec. 14, 2016, 3:53 p.m.

Java

1,162 -2

5,469 +15

2,433 +6

GitHub
sedona by apache

A cluster computing framework for processing large-scale geospatial data

created at April 24, 2015, 6:01 p.m.

Java

95 +0

1,964 +5

692 -1

GitHub
oryx by OryxProject

Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

created at July 25, 2014, 8:08 p.m.

Java

208 +0

1,787 +0

405 +0

GitHub
mongo-spark by mongodb

The MongoDB Spark Connector

created at May 20, 2015, 5:59 p.m.

Java

79 +0

713 +0

311 +1

GitHub
jpmml-evaluator-spark by jpmml

PMML evaluator library for the Apache Spark cluster computing system (http://spark.apache.org/)

created at Nov. 29, 2015, 10:03 a.m.

Java

14 +0

94 +0

43 +0

GitHub