predictionio by apache

PredictionIO, a machine learning server for developers and ML engineers.

created at Jan. 25, 2013, 7:42 p.m.

Scala

756 +0

12,545 -1

1,928 +1

GitHub
lime by marcotcr

Lime: Explaining the predictions of any machine learning classifier

created at March 15, 2016, 10:18 p.m.

JavaScript

261 +0

11,615 +14

1,810 +3

GitHub
weaviate by semi-technologies

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

created at March 30, 2016, 3:03 p.m.

Go

126 -2

11,500 +90

797 +4

GitHub
compromise by spencermountain

modest natural-language processing

created at July 5, 2011, 9:04 a.m.

JavaScript

163 +0

11,482 +13

653 +0

GitHub
turicreate by apple

Turi Create simplifies the development of custom machine learning models.

created at Dec. 1, 2017, 12:42 a.m.

C++

339 +0

11,203 +1

1,138 +0

GitHub
optuna by optuna

A hyperparameter optimization framework

created at Feb. 21, 2018, 6:12 a.m.

Python

119 +1

10,913 +41

1,037 +6

GitHub
natural by NaturalNode

general natural language facilities for node

created at May 7, 2011, 2:35 a.m.

JavaScript

244 +0

10,623 +4

860 +0

GitHub
statsmodels by statsmodels

Statsmodels: statistical modeling and econometrics in Python

created at June 12, 2011, 5:04 p.m.

Python

283 +0

10,150 +26

2,885 +5

GitHub
Theano by Theano

Theano was a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It is being continued as PyTensor: www.github.com/pymc-devs/pytensor

created at Aug. 10, 2011, 3:48 a.m.

Python

539 +0

9,904 +3

2,488 +2

GitHub
cleanlab by cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

created at May 11, 2018, 1:55 a.m.

Python

90 +0

9,757 +52

751 -1

GitHub
tpot by EpistasisLab

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

created at Nov. 3, 2015, 9:08 p.m.

Python

288 +0

9,736 +5

1,572 +0

GitHub
CoreNLP by stanfordnlp

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

created at June 27, 2013, 9:13 p.m.

Java

489 -1

9,702 +9

2,703 +0

GitHub
segmentation_models.pytorch by qubvel-org

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

created at March 1, 2019, 4:21 p.m.

Python

82 +0

9,693 +42

1,678 +4

GitHub
tflearn by tflearn

Deep learning library featuring a higher-level API for TensorFlow.

created at March 31, 2016, 12:05 p.m.

Python

456 +0

9,619 +1

2,409 +0

GitHub
PySyft by OpenMined

Perform data science on data that remains in someone else's server

created at July 18, 2017, 8:41 p.m.

Python

198 +0

9,516 +16

1,993 -1

GitHub
txtai by neuml

đŸ’¡ All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

created at Aug. 9, 2020, 7:14 p.m.

Python

92 +0

9,369 +118

602 +5

GitHub
altair by vega

Declarative statistical visualization library for Python

created at Sept. 19, 2015, 3:14 a.m.

Python

140 +0

9,365 +16

793 -1

GitHub
golearn by sjwhitworth

Machine Learning for Go

created at Dec. 26, 2013, 1:06 p.m.

Go

431 +0

9,293 +2

1,191 +1

GitHub
fuzzywuzzy by seatgeek

Fuzzy String Matching in Python

created at July 8, 2011, 7:32 p.m.

Python

259 +1

9,231 +4

876 +1

GitHub
stable-baselines3 by DLR-RM

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

created at May 5, 2020, 5:52 a.m.

Python

64 -1

9,142 +53

1,704 +6

GitHub