Stanford CoreNLP wrapper for Apache Spark
created at Aug. 21, 2015, 8:54 p.m.
The official Riak Spark Connector for Apache Spark with Riak TS and Riak KV
created at May 7, 2015, 7:22 p.m.
Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments
created at April 17, 2015, 7:39 p.m.
An implementation of DBSCAN runing on top of Apache Spark
created at March 15, 2015, 12:45 a.m.
A library for time series analysis on Apache Spark
created at March 11, 2015, 8:14 a.m.
Base classes to use when writing tests with Spark
created at Jan. 30, 2015, 10:23 p.m.
Mazerunner extends a Neo4j graph database to run scheduled big data graph compute algorithms at scale with HDFS and Apache Spark.
created at Oct. 28, 2014, 9:33 p.m.
Sparkling Water provides H2O functionality inside Spark cluster
created at Oct. 13, 2014, 11:06 p.m.
Interactive and Reactive Data Science using Scala and Spark.
created at Sept. 5, 2014, 7:35 p.m.
REST job server for Apache Spark
created at Aug. 21, 2014, 11:07 p.m.
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
created at July 25, 2014, 8:08 p.m.