dgl by dmlc

Python package built to ease deep learning on graph, on top of existing DL frameworks.

created at April 20, 2018, 2:49 p.m.

Python

169 +0

13,020 +9

2,957 +3

GitHub
ydata-profiling by ydataai

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

created at Jan. 9, 2016, 11:47 p.m.

Python

151 +0

12,070 +15

1,629 -1

GitHub
dask by dask

Parallel computing with task scheduling

created at Jan. 4, 2015, 6:50 p.m.

Python

211 +0

12,024 +22

1,669 +1

GitHub
seaborn by mwaskom

Statistical data visualization in Python

created at June 18, 2012, 6:41 p.m.

Python

259 +0

11,969 +10

1,860 +1

GitHub
ludwig by ludwig-ai

Low-code framework for building custom LLMs, neural networks, and other AI models

created at Dec. 27, 2018, 11:58 p.m.

Python

192 +0

10,831 +20

1,163 +0

GitHub
optuna by optuna

A hyperparameter optimization framework

created at Feb. 21, 2018, 6:12 a.m.

Python

118 +0

9,696 +34

952 -1

GitHub
sonnet by deepmind

TensorFlow-based neural network library

created at April 3, 2017, 11:34 a.m.

Python

420 +0

9,691 +1

1,280 -1

GitHub
tflearn by tflearn

Deep learning library featuring a higher-level API for TensorFlow.

created at March 31, 2016, 12:05 p.m.

Python

457 +0

9,605 +0

2,412 -1

GitHub
statsmodels by statsmodels

Statsmodels: statistical modeling and econometrics in Python

created at June 12, 2011, 5:04 p.m.

Python

280 -1

9,566 +13

2,823 -1

GitHub
tpot by EpistasisLab

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

created at Nov. 3, 2015, 9:08 p.m.

Python

288 -1

9,507 +5

1,542 +0

GitHub
modin by modin-project

Modin: Scale your Pandas workflows by changing a single line of code

created at June 21, 2018, 9:35 p.m.

Python

116 +0

9,489 +14

643 +1

GitHub
great_expectations by great-expectations

Always know what to expect from your data.

created at Sept. 11, 2017, 12:18 a.m.

Python

82 +0

9,479 +12

1,471 +3

GitHub
autokeras by keras-team

AutoML library for deep learning

created at Nov. 19, 2017, 11:18 p.m.

Python

302 +0

9,069 +3

1,398 -1

GitHub
cleanlab by cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

created at May 11, 2018, 1:55 a.m.

Python

85 +0

8,687 +30

668 +0

GitHub
pattern by clips

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

created at May 3, 2011, 3:29 p.m.

Python

545 +0

8,668 +2

1,574 +0

GitHub
pyro by pyro-ppl

Deep universal probabilistic programming with Python and PyTorch

created at June 16, 2017, 5:03 a.m.

Python

201 +0

8,368 +5

984 +0

GitHub
pytorch3d by facebookresearch

PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

created at Oct. 25, 2019, 2:23 a.m.

Python

147 +0

8,313 +20

1,254 +0

GitHub
pymc by pymc-devs

Bayesian Modeling and Probabilistic Programming in Python

created at Feb. 20, 2015, 5:12 p.m.

Python

225 +0

8,172 +10

1,925 +1

GitHub
vaex by vaexio

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

created at Sept. 27, 2014, 9:44 a.m.

Python

143 +0

8,171 +0

588 -1

GitHub
stable-baselines3 by DLR-RM

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

created at May 5, 2020, 5:52 a.m.

Python

61 +1

7,976 +34

1,556 +7

GitHub