tokenizers by huggingface

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

created at Nov. 1, 2019, 5:52 p.m.

Rust

120 +2

8,425 +30

725 +11

GitHub
pycaret by pycaret

An open-source, low-code machine learning library in Python

created at Nov. 23, 2019, 6:40 p.m.

Jupyter Notebook

130 -1

8,414 +20

1,716 +2

GitHub
vowpal_wabbit by VowpalWabbit

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

created at July 31, 2009, 7:36 p.m.

C++

349 +0

8,400 +0

1,928 +0

GitHub
vaex by vaexio

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

created at Sept. 27, 2014, 9:44 a.m.

Python

143 -1

8,171 -3

589 +0

GitHub
pymc by pymc-devs

Bayesian Modeling and Probabilistic Programming in Python

created at Feb. 20, 2015, 5:12 p.m.

Python

225 +0

8,162 +14

1,924 +1

GitHub
gorse by gorse-io

Gorse open source recommender system engine

created at Aug. 14, 2018, 11:01 a.m.

Go

64 +0

8,113 +50

728 +8

GitHub
brain by harthur

Simple feed-forward neural network in JavaScript

created at May 10, 2010, 6:36 a.m.

JavaScript

386 +0

8,006 +0

859 +0

GitHub
cortex by cortexlabs

Production infrastructure for machine learning at scale

created at Jan. 24, 2019, 4:43 a.m.

Go

145 +0

7,990 +0

605 +0

GitHub
pyod by yzhao062

A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)

created at Oct. 3, 2017, 8:29 p.m.

Python

148 +1

7,954 +19

1,314 +1

GitHub
stable-baselines3 by DLR-RM

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

created at May 5, 2020, 5:52 a.m.

Python

60 +0

7,942 +62

1,549 +6

GitHub
einops by arogozhnikov

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

created at Sept. 22, 2018, 12:45 a.m.

Python

68 +0

7,927 +23

334 +0

GitHub
catboost by catboost

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

created at July 18, 2017, 5:29 a.m.

Python

192 +0

7,751 +15

1,148 +3

GitHub
deeplake by activeloopai

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

created at Aug. 9, 2019, 6:17 a.m.

Python

86 +0

7,711 +18

591 +3

GitHub
sktime by sktime

A unified framework for machine learning with time series

created at Nov. 6, 2018, 3:08 p.m.

Python

102 +0

7,407 +9

1,284 +2

GitHub
introduction_to_ml_with_python by amueller

Notebooks and code for the book "Introduction to Machine Learning with Python"

created at May 29, 2016, 6:29 p.m.

Jupyter Notebook

368 +0

7,195 +6

4,484 +1

GitHub
autogluon by autogluon

Fast and Accurate ML in 3 Lines of Code

created at July 29, 2019, 6:51 p.m.

Python

98 +0

7,123 +32

843 +1

GitHub
burn by burn-rs

Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibility, compute efficiency and portability as its primary goals.

created at July 18, 2022, 11:11 p.m.

Rust

54 +2

7,049 +49

316 +1

GitHub
ccv by liuliu

C-based/Cached/Core Computer Vision Library, A Modern Computer Vision Library

created at Sept. 15, 2010, 3:59 p.m.

C

352 +0

7,041 -1

1,713 +0

GitHub
lab by deepmind

A customisable 3D platform for agent-based AI research

created at Nov. 30, 2016, 1:41 p.m.

C

468 +0

7,024 +3

1,360 +3

GitHub
featuretools by alteryx

An open source python library for automated feature engineering

created at Sept. 8, 2017, 10:15 p.m.

Python

158 +0

7,023 +6

852 +0

GitHub