catboost by catboost

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

created at July 18, 2017, 5:29 a.m.

Python

192 +0

7,837 +5

1,161 +1

GitHub
cortex by cortexlabs

Production infrastructure for machine learning at scale

created at Jan. 24, 2019, 4:43 a.m.

Go

146 +1

7,999 +1

608 +0

GitHub
brain by harthur

Simple feed-forward neural network in JavaScript

created at May 10, 2010, 6:36 a.m.

JavaScript

386 +0

8,008 +1

856 +0

GitHub
einops by arogozhnikov

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

created at Sept. 22, 2018, 12:45 a.m.

Python

68 +0

8,080 +29

337 +1

GitHub
pyod by yzhao062

A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)

created at Oct. 3, 2017, 8:29 p.m.

Python

146 +0

8,099 +33

1,332 +1

GitHub
vaex by vaexio

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

created at Sept. 27, 2014, 9:44 a.m.

Python

143 +0

8,203 +7

589 +0

GitHub
stable-baselines3 by DLR-RM

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

created at May 5, 2020, 5:52 a.m.

Python

61 +0

8,236 +42

1,581 +5

GitHub
gorse by gorse-io

Gorse open source recommender system engine

created at Aug. 14, 2018, 11:01 a.m.

Go

65 +0

8,253 +17

743 +2

GitHub
pymc by pymc-devs

Bayesian Modeling and Probabilistic Programming in Python

created at Feb. 20, 2015, 5:12 p.m.

Python

224 -1

8,266 +19

1,952 +13

GitHub
vowpal_wabbit by VowpalWabbit

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

created at July 31, 2009, 7:36 p.m.

C++

352 +0

8,430 +5

1,927 +0

GitHub
pycaret by pycaret

An open-source, low-code machine learning library in Python

created at Nov. 23, 2019, 6:40 p.m.

Jupyter Notebook

131 +0

8,588 +10

1,735 +2

GitHub
tokenizers by huggingface

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

created at Nov. 1, 2019, 5:52 p.m.

Rust

120 -1

8,609 +21

739 +2

GitHub
pattern by clips

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

created at May 3, 2011, 3:29 p.m.

Python

545 +0

8,693 +0

1,577 +2

GitHub
machinelearning by dotnet

ML.NET is an open source and cross-platform machine learning framework for .NET.

created at May 3, 2018, 4:20 p.m.

C#

580 +0

8,897 +12

1,854 +3

GitHub
cleanlab by cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

created at May 11, 2018, 1:55 a.m.

Python

85 +0

8,917 +14

688 +3

GitHub
altair by vega

Declarative statistical visualization library for Python

created at Sept. 19, 2015, 3:14 a.m.

Python

141 +0

9,017 +16

766 +0

GitHub
segmentation_models.pytorch by qubvel

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

created at March 1, 2019, 4:21 p.m.

Python

79 +0

9,019 +26

1,612 +2

GitHub
fuzzywuzzy by seatgeek

Fuzzy String Matching in Python

created at July 8, 2011, 7:32 p.m.

Python

259 +0

9,157 +6

879 +1

GitHub
golearn by sjwhitworth

Machine Learning for Go

created at Dec. 26, 2013, 1:06 p.m.

Go

433 +0

9,211 +5

1,191 +0

GitHub
PySyft by OpenMined

Perform data science on data that remains in someone else's server

created at July 18, 2017, 8:41 p.m.

Python

197 -1

9,329 +12

1,990 +2

GitHub