Sparkling Water provides H2O functionality inside Spark cluster
created at Oct. 13, 2014, 11:06 p.m.
scikit-learn: machine learning in Python
created at Aug. 17, 2010, 9:43 a.m.
Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
created at July 25, 2016, 9:47 a.m.
(Deprecated) Scikit-learn integration package for Apache Spark
created at Sept. 2, 2015, 6:44 p.m.
An implementation of DBSCAN runing on top of Apache Spark
created at March 15, 2015, 12:45 a.m.
Mazerunner extends a Neo4j graph database to run scheduled big data graph compute algorithms at scale with HDFS and Apache Spark.
created at Oct. 28, 2014, 9:33 p.m.
Apache Spark datasource for OrientDB
created at Oct. 31, 2016, 2:51 p.m.
The official Riak Spark Connector for Apache Spark with Riak TS and Riak KV
created at May 7, 2015, 7:22 p.m.
PMML evaluator library for the Apache Spark cluster computing system (http://spark.apache.org/)
created at Nov. 29, 2015, 10:03 a.m.
DataStax Connector for Apache Spark to Apache Cassandra
created at June 27, 2014, 3:45 p.m.
XML data source for Spark SQL and DataFrames
created at Nov. 26, 2015, 2:46 a.m.
Jupyter magics and kernels for working with remote Spark clusters
created at Sept. 21, 2015, 3:35 p.m.
A library for time series analysis on Apache Spark
created at March 11, 2015, 8:14 a.m.