ydata-profiling by ydataai

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

created at Jan. 9, 2016, 11:47 p.m.

Python

151 +0

12,070 +15

1,629 -1

GitHub
polars by pola-rs

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

created at May 13, 2020, 7:45 p.m.

Rust

148 +0

26,379 +152

1,607 +10

GitHub
TensorLayer by tensorlayer

Deep Learning and Reinforcement Learning Library for Scientists and Engineers

created at June 7, 2016, 3:55 p.m.

Python

459 +0

7,297 +1

1,605 -2

GitHub
albumentations by albumentations-team

Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

created at June 6, 2018, 3:10 a.m.

Python

128 -1

13,449 +20

1,594 +2

GitHub
pattern by clips

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

created at May 3, 2011, 3:29 p.m.

Python

545 +0

8,668 +2

1,574 +0

GitHub
mlpack by mlpack

mlpack: a fast, header-only C++ machine learning library

created at Dec. 17, 2014, 6:16 p.m.

C++

182 +0

4,820 +10

1,570 +2

GitHub
stable-baselines3 by DLR-RM

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

created at May 5, 2020, 5:52 a.m.

Python

61 +1

7,976 +34

1,556 +7

GitHub
tpot by EpistasisLab

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

created at Nov. 3, 2015, 9:08 p.m.

Python

288 -1

9,507 +5

1,542 +0

GitHub
BayesianOptimization by bayesian-optimization

A Python implementation of global optimization with gaussian processes.

created at June 6, 2014, 8:18 a.m.

Python

134 +1

7,517 +22

1,503 +0

GitHub
great_expectations by great-expectations

Always know what to expect from your data.

created at Sept. 11, 2017, 12:18 a.m.

Python

82 +0

9,479 +12

1,471 +3

GitHub
autokeras by keras-team

AutoML library for deep learning

created at Nov. 19, 2017, 11:18 p.m.

Python

302 +0

9,069 +3

1,398 -1

GitHub
dopamine by google

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

created at July 26, 2018, 9:58 a.m.

Jupyter Notebook

429 +0

10,375 +1

1,364 +0

GitHub
keras-rl by keras-rl

Deep Reinforcement Learning for Keras.

created at July 2, 2016, 3:53 p.m.

Python

204 +0

5,492 +0

1,362 -1

GitHub
sktime by sktime

A unified framework for machine learning with time series

created at Nov. 6, 2018, 3:08 p.m.

Python

102 +0

7,417 +10

1,284 +0

GitHub
sonnet by deepmind

TensorFlow-based neural network library

created at April 3, 2017, 11:34 a.m.

Python

420 +0

9,691 +1

1,280 -1

GitHub
imbalanced-learn by scikit-learn-contrib

A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning

created at Aug. 16, 2014, 5:08 a.m.

Python

141 +0

6,706 +7

1,273 +1

GitHub
auto-sklearn by automl

Automated Machine Learning with scikit-learn

created at July 2, 2015, 3:38 p.m.

Python

214 -1

7,411 +7

1,261 -3

GitHub
pytorch3d by facebookresearch

PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

created at Oct. 25, 2019, 2:23 a.m.

Python

147 +0

8,313 +20

1,254 +0

GitHub
gluon-cv by dmlc

Gluon CV Toolkit

created at Feb. 26, 2018, 1:33 a.m.

Python

153 +0

5,754 +1

1,210 +1

GitHub
tsfresh by blue-yonder

Automatic extraction of relevant features from time series:

created at Oct. 26, 2016, 11:29 a.m.

Jupyter Notebook

167 -1

8,093 +7

1,197 +1

GitHub