sarah-palin-lda by wavelets

Topic Modeling the Sarah Palin emails.

created at March 27, 2014, 7:49 p.m.

Scala

3 +0

10 +0

2 +0

GitHub
xerial by xerial

Data management utilities for Scala

created at July 6, 2012, 6:36 a.m.

Scala

6 +0

18 +0

2 +0

GitHub
NDScala by SciScala

N-dimensional / multi-dimensional arrays (tensors) in Scala 3. Think NumPy ndarray / PyTorch Tensor but type-safe over shapes, array/axis labels & numeric data types

created at June 14, 2020, 1:57 p.m.

Scala

7 +0

47 +0

6 +0

GitHub
upshot-montague by Workday

Montague is a little CCG semantic parsing library for Scala.

created at March 21, 2016, 10:51 p.m.

Scala

15 +0

59 +0

8 +0

GitHub
onnx-scala by EmergentOrder

An ONNX (Open Neural Network eXchange) API and backend for typeful, functional deep learning and classical machine learning in Scala 3

created at Aug. 15, 2018, 12:20 a.m.

Scala

8 +0

136 +0

9 +0

GitHub
ganitha by tresata

scalding powered machine learning

created at Aug. 21, 2013, 3:32 p.m.

Scala

17 +0

109 +0

12 +0

GitHub
saul by CogComp

Saul : Declarative Learning-Based Programming

created at Sept. 30, 2015, 10:21 p.m.

Scala

14 +0

63 +0

17 +0

GitHub
bioscala by bioscala

Bioinformatics for the Scala programming language

created at April 23, 2010, 9:38 p.m.

Scala

16 +0

107 +0

20 +0

GitHub
doddle-model by picnicml

cake doddle-model: machine learning in Scala.

created at Feb. 9, 2018, 1:54 p.m.

Scala

14 +0

138 +0

23 +0

GitHub
chalk by scalanlp

Chalk is a natural language processing library.

created at Dec. 2, 2012, 5:45 a.m.

Scala

29 +0

258 +0

49 +0

GitHub
delight by datamechanics

A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.

created at Oct. 26, 2020, 1:56 p.m.

Scala

16 +0

334 +0

50 +0

GitHub
brushfire by stripe-archive

Distributed decision tree ensemble learning in Scala

created at Nov. 20, 2014, 6:47 p.m.

Scala

95 +0

394 +0

50 +0

GitHub
DynaML by transcendent-ai-labs

Scala Library/REPL for Machine Learning Research

created at Feb. 16, 2015, 3:22 p.m.

Scala

19 +0

198 +0

51 +0

GitHub
mist by Hydrospheredata

Serverless proxy for Spark cluster

created at Jan. 15, 2016, 7:22 a.m.

Scala

41 +0

326 +0

67 +0

GitHub
BIDMat by BIDData

A CPU and GPU-accelerated matrix library for data mining

created at Oct. 17, 2012, 11:19 p.m.

Scala

45 +0

264 +0

73 +0

GitHub
tensorflow_scala by eaplatanios

TensorFlow API for the Scala Programming Language

created at April 1, 2017, 6 p.m.

Scala

67 +0

934 +1

96 +0

GitHub
factorie by factorie

FACTORIE is a toolkit for deployable probabilistic modeling, implemented as a software library in Scala. It provides its users with a succinct language for creating relational factor graphs, estimating parameters and performing inference.

created at June 25, 2013, 1:21 a.m.

Scala

70 +0

554 +0

145 +0

GitHub
BIDMach by BIDData

CPU and GPU-accelerated Machine Learning Library

created at Oct. 22, 2012, 3:17 a.m.

Scala

88 +0

913 +0

172 +0

GitHub
summingbird by twitter

Streaming MapReduce with Scalding and Storm

created at Sept. 25, 2012, 10:38 p.m.

Scala

292 +0

2,138 +2

267 +0

GitHub
adam by bigdatagenomics

ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.

created at Nov. 19, 2013, 11:47 p.m.

Scala

100 +0

967 +0

304 +0

GitHub