altair by vega

Declarative statistical visualization library for Python

created at Sept. 19, 2015, 3:14 a.m.

Python

140 +0

9,365 +16

793 -1

GitHub
datumbox-framework by datumbox

Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.

created at Oct. 18, 2014, 6:10 p.m.

Java

137 +0

1,085 +0

282 +0

GitHub
pycaret by pycaret

An open-source, low-code machine learning library in Python

created at Nov. 23, 2019, 6:40 p.m.

Jupyter Notebook

136 +1

8,955 +26

1,774 +3

GitHub
dvc by iterative

🦉 Data Versioning and ML Experiments

created at March 4, 2017, 8:16 a.m.

Python

135 +0

13,902 +30

1,187 +1

GitHub
snips-nlu by snipsco

Snips Python library to extract meaning from text

created at Feb. 8, 2017, 4:16 p.m.

Python

134 +0

3,897 +2

513 +0

GitHub
python-recsys by ocelma

A python library for implementing a recommender system

created at Oct. 10, 2011, 4:17 a.m.

Python

133 +0

1,475 +0

436 +0

GitHub
deepdetect by jolibrain

Deep Learning API and Server in C++14 support for PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE

created at May 22, 2015, 2:45 p.m.

C++

132 +0

2,519 +0

561 +0

GitHub
pycascading by twitter-archive

A Python wrapper for Cascading

created at Dec. 9, 2011, 11:56 p.m.

Python

129 +0

222 +0

37 +0

GitHub
albumentations by albumentations-team

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

created at June 6, 2018, 3:10 a.m.

Python

129 +1

14,258 +30

1,649 +2

GitHub
ThinkBayes by AllenDowney

Code repository for Think Bayes.

created at July 8, 2013, 2:30 p.m.

TeX

128 +1

1,648 +2

1,938 +0

GitHub
weaviate by semi-technologies

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

created at March 30, 2016, 3:03 p.m.

Go

126 -2

11,500 +90

797 +4

GitHub
coach by IntelLabs

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

created at Oct. 1, 2017, 7:27 p.m.

Python

126 +0

2,330 +2

461 +0

GitHub
qdrant by qdrant

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

created at May 30, 2020, 9:37 p.m.

Rust

126 +0

20,607 +97

1,400 +3

GitHub
ggpy by yhat

ggplot port for python

created at Oct. 7, 2013, 1:30 p.m.

Python

125 +0

3,700 +0

571 -1

GitHub
onyx by onyx-platform

Distributed, masterless, high performance, fault tolerant data processing

created at Dec. 2, 2013, 1:21 a.m.

Clojure

122 +0

2,050 +0

205 +0

GitHub
tokenizers by huggingface

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

created at Nov. 1, 2019, 5:52 p.m.

Rust

121 +0

9,050 +8

802 +3

GitHub
dedupe by dedupeio

id A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

created at April 20, 2012, 2:57 p.m.

Python

120 +0

4,145 +3

551 +1

GitHub
optuna by optuna

A hyperparameter optimization framework

created at Feb. 21, 2018, 6:12 a.m.

Python

119 +1

10,913 +41

1,037 +6

GitHub
mining by mining

Business Intelligence (BI) in Python, OLAP

created at Jan. 23, 2014, 7:20 p.m.

Python

117 +0

1,279 +0

238 +1

GitHub
mlxtend by rasbt

A library of extension and helper modules for Python's data analysis and machine learning libraries.

created at Aug. 14, 2014, 1:56 a.m.

Python

115 -1

4,909 +4

872 +3

GitHub