State of the Art Natural Language Processing
updated at Nov. 29, 2024, 6:58 p.m.
A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.
updated at Nov. 29, 2024, 4:34 p.m.
FACTORIE is a toolkit for deployable probabilistic modeling, implemented as a software library in Scala. It provides its users with a succinct language for creating relational factor graphs, estimating parameters and performing inference.
updated at Nov. 29, 2024, 1:31 p.m.
PredictionIO, a machine learning server for developers and ML engineers.
updated at Nov. 28, 2024, 4:31 p.m.
Streaming MapReduce with Scalding and Storm
updated at Nov. 28, 2024, 4:30 p.m.
Sparkling Water provides H2O functionality inside Spark cluster
updated at Nov. 26, 2024, 5:11 p.m.
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
updated at Nov. 4, 2024, 1:06 a.m.
TensorFlow API for the Scala Programming Language
updated at Oct. 28, 2024, 9:40 p.m.
Distributed decision tree ensemble learning in Scala
updated at Oct. 21, 2024, 11:46 a.m.
Scala Library/REPL for Machine Learning Research
updated at Oct. 7, 2024, 8:47 p.m.
Topic Modeling the Sarah Palin emails.
updated at Sept. 30, 2024, 10:37 p.m.