dask by dask

Parallel computing with task scheduling

created at Jan. 4, 2015, 6:50 p.m.

Python

212 +0

12,600 +21

1,709 +0

GitHub
seaborn by mwaskom

Statistical data visualization in Python

created at June 18, 2012, 6:41 p.m.

Python

264 +0

12,572 +19

1,926 +5

GitHub
ydata-profiling by ydataai

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

created at Jan. 9, 2016, 11:47 p.m.

Python

152 +0

12,533 +13

1,685 +3

GitHub
ludwig by ludwig-ai

Low-code framework for building custom LLMs, neural networks, and other AI models

created at Dec. 27, 2018, 11:58 p.m.

Python

194 +0

11,186 +7

1,194 +2

GitHub
optuna by optuna

A hyperparameter optimization framework

created at Feb. 21, 2018, 6:12 a.m.

Python

119 +1

10,913 +41

1,037 +6

GitHub
statsmodels by statsmodels

Statsmodels: statistical modeling and econometrics in Python

created at June 12, 2011, 5:04 p.m.

Python

283 +0

10,150 +26

2,885 +5

GitHub
great_expectations by great-expectations

Always know what to expect from your data.

created at Sept. 11, 2017, 12:18 a.m.

Python

82 +0

9,988 +15

1,543 +2

GitHub
modin by modin-project

Modin: Scale your Pandas workflows by changing a single line of code

created at June 21, 2018, 9:35 p.m.

Python

117 +0

9,891 +15

653 +2

GitHub
sonnet by deepmind

TensorFlow-based neural network library

created at April 3, 2017, 11:34 a.m.

Python

423 +0

9,778 +6

1,299 +1

GitHub
cleanlab by cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

created at May 11, 2018, 1:55 a.m.

Python

90 +0

9,757 +52

751 -1

GitHub
tpot by EpistasisLab

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

created at Nov. 3, 2015, 9:08 p.m.

Python

288 +0

9,736 +5

1,572 +0

GitHub
tflearn by tflearn

Deep learning library featuring a higher-level API for TensorFlow.

created at March 31, 2016, 12:05 p.m.

Python

456 +0

9,619 +1

2,409 +0

GitHub
cupy by cupy

NumPy & SciPy for GPU

created at Nov. 1, 2016, 9:54 a.m.

Python

128 +0

9,485 +30

854 +3

GitHub
autokeras by keras-team

AutoML library for deep learning

created at Nov. 19, 2017, 11:18 p.m.

Python

301 +0

9,153 +4

1,402 +0

GitHub
stable-baselines3 by DLR-RM

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

created at May 5, 2020, 5:52 a.m.

Python

64 -1

9,142 +53

1,704 +6

GitHub
pytorch3d by facebookresearch

PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

created at Oct. 25, 2019, 2:23 a.m.

Python

149 +1

8,811 +20

1,315 +1

GitHub
pattern by clips

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

created at May 3, 2011, 3:29 p.m.

Python

543 +0

8,751 +7

1,577 +0

GitHub
pymc by pymc-devs

Bayesian Modeling and Probabilistic Programming in Python

created at Feb. 20, 2015, 5:12 p.m.

Python

226 +1

8,720 +6

2,015 +4

GitHub
pyro by pyro-ppl

Deep universal probabilistic programming with Python and PyTorch

created at June 16, 2017, 5:03 a.m.

Python

200 +0

8,567 +19

986 +1

GitHub
vaex by vaexio

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

created at Sept. 27, 2014, 9:44 a.m.

Python

144 +0

8,297 +7

590 +0

GitHub