Streaming MapReduce with Scalding and Storm
created at Sept. 25, 2012, 10:38 p.m.
PredictionIO, a machine learning server for developers and ML engineers.
created at Jan. 25, 2013, 7:42 p.m.
FACTORIE is a toolkit for deployable probabilistic modeling, implemented as a software library in Scala. It provides its users with a succinct language for creating relational factor graphs, estimating parameters and performing inference.
created at June 25, 2013, 1:21 a.m.
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
created at Nov. 19, 2013, 11:47 p.m.
Topic Modeling the Sarah Palin emails.
created at March 27, 2014, 7:49 p.m.
Sparkling Water provides H2O functionality inside Spark cluster
created at Oct. 13, 2014, 11:06 p.m.
Distributed decision tree ensemble learning in Scala
created at Nov. 20, 2014, 6:47 p.m.
Scala Library/REPL for Machine Learning Research
created at Feb. 16, 2015, 3:22 p.m.