spark by apache

Apache Spark - A unified analytics engine for large-scale data processing

updated at May 5, 2024, 12:33 a.m.

Scala

2,031 +0

38,419 +43

27,958 +14

GitHub
spark-nlp by JohnSnowLabs

State of the Art Natural Language Processing

updated at May 3, 2024, 6:45 p.m.

Scala

100 +0

3,699 +6

701 +2

GitHub
SynapseML by Microsoft

Simple and Distributed Machine Learning

updated at May 3, 2024, 6:07 a.m.

Scala

146 +0

4,972 +4

815 +2

GitHub
predictionio by apache

PredictionIO, a machine learning server for developers and ML engineers.

updated at May 2, 2024, 8:07 p.m.

Scala

756 +0

12,548 -1

1,936 -3

GitHub
aerosolve by airbnb

A machine learning package built for humans.

updated at May 2, 2024, 7:54 a.m.

Scala

351 +0

4,791 +1

567 -1

GitHub
delight by datamechanics

A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.

updated at April 30, 2024, 9:48 p.m.

Scala

16 +0

335 +1

50 +0

GitHub
scalding by twitter

A Scala API for Cascading

updated at April 28, 2024, 7:30 p.m.

Scala

323 +0

3,471 +1

703 +0

GitHub
tensorflow_scala by eaplatanios

TensorFlow API for the Scala Programming Language

updated at April 26, 2024, 6:25 p.m.

Scala

67 +0

934 +0

96 +0

GitHub
breeze by scalanlp

Breeze is a numerical processing library for Scala.

updated at April 26, 2024, 6:15 p.m.

Scala

209 +0

3,437 +0

694 +1

GitHub
algebird by twitter

Abstract Algebra for Scala

updated at April 26, 2024, 5:23 p.m.

Scala

235 +0

2,287 +0

343 +0

GitHub
summingbird by twitter

Streaming MapReduce with Scalding and Storm

updated at April 24, 2024, 10:35 p.m.

Scala

292 +0

2,138 +0

267 +0

GitHub
sparkling-water by h2oai

Sparkling Water provides H2O functionality inside Spark cluster

updated at April 24, 2024, 9:05 p.m.

Scala

178 +0

952 +0

363 +0

GitHub
adam by bigdatagenomics

ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.

updated at April 13, 2024, 6:15 a.m.

Scala

100 +0

967 +0

305 +1

GitHub
mist by Hydrospheredata

Serverless proxy for Spark cluster

updated at April 2, 2024, 5:42 p.m.

Scala

41 +0

326 +0

67 +0

GitHub
BIDMach by BIDData

CPU and GPU-accelerated Machine Learning Library

updated at March 31, 2024, 2:13 p.m.

Scala

88 +0

913 +0

172 +0

GitHub
chalk by scalanlp

Chalk is a natural language processing library.

updated at March 17, 2024, 10:33 p.m.

Scala

29 +0

258 +0

49 +0

GitHub
brushfire by stripe-archive

Distributed decision tree ensemble learning in Scala

updated at March 17, 2024, 10:33 p.m.

Scala

95 +0

394 +0

50 +0

GitHub
onnx-scala by EmergentOrder

An ONNX (Open Neural Network eXchange) API and backend for typeful, functional deep learning and classical machine learning in Scala 3

updated at March 12, 2024, 11:36 p.m.

Scala

8 +0

136 +0

9 +0

GitHub
doddle-model by picnicml

cake doddle-model: machine learning in Scala.

updated at March 11, 2024, 9:26 p.m.

Scala

14 +0

138 +0

23 +0

GitHub
upshot-montague by Workday

Montague is a little CCG semantic parsing library for Scala.

updated at March 3, 2024, 9:02 p.m.

Scala

15 +0

59 +0

8 +0

GitHub