statsmodels by statsmodels

Statsmodels: statistical modeling and econometrics in Python

created at June 12, 2011, 5:04 p.m.

Python

283 +0

10,150 +26

2,885 +5

GitHub
great_expectations by great-expectations

Always know what to expect from your data.

created at Sept. 11, 2017, 12:18 a.m.

Python

82 +0

9,988 +15

1,543 +2

GitHub
LAVIS by salesforce

LAVIS - A One-stop Library for Language-Vision Intelligence

created at Aug. 24, 2022, 2:36 a.m.

Jupyter Notebook

99 +0

9,930 +31

973 +2

GitHub
modin by modin-project

Modin: Scale your Pandas workflows by changing a single line of code

created at June 21, 2018, 9:35 p.m.

Python

117 +0

9,891 +15

653 +2

GitHub
sonnet by deepmind

TensorFlow-based neural network library

created at April 3, 2017, 11:34 a.m.

Python

423 +0

9,778 +6

1,299 +1

GitHub
cleanlab by cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

created at May 11, 2018, 1:55 a.m.

Python

90 +0

9,757 +52

751 -1

GitHub
tpot by EpistasisLab

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

created at Nov. 3, 2015, 9:08 p.m.

Python

288 +0

9,736 +5

1,572 +0

GitHub
tflearn by tflearn

Deep learning library featuring a higher-level API for TensorFlow.

created at March 31, 2016, 12:05 p.m.

Python

456 +0

9,619 +1

2,409 +0

GitHub
cupy by cupy

NumPy & SciPy for GPU

created at Nov. 1, 2016, 9:54 a.m.

Python

128 +0

9,485 +30

854 +3

GitHub
autokeras by keras-team

AutoML library for deep learning

created at Nov. 19, 2017, 11:18 p.m.

Python

301 +0

9,153 +4

1,402 +0

GitHub
stable-baselines3 by DLR-RM

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

created at May 5, 2020, 5:52 a.m.

Python

64 -1

9,142 +53

1,704 +6

GitHub
pycaret by pycaret

An open-source, low-code machine learning library in Python

created at Nov. 23, 2019, 6:40 p.m.

Jupyter Notebook

136 +1

8,955 +26

1,774 +3

GitHub
pytorch3d by facebookresearch

PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

created at Oct. 25, 2019, 2:23 a.m.

Python

149 +1

8,811 +20

1,315 +1

GitHub
pattern by clips

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

created at May 3, 2011, 3:29 p.m.

Python

543 +0

8,751 +7

1,577 +0

GitHub
pymc by pymc-devs

Bayesian Modeling and Probabilistic Programming in Python

created at Feb. 20, 2015, 5:12 p.m.

Python

226 +1

8,720 +6

2,015 +4

GitHub
pyro by pyro-ppl

Deep universal probabilistic programming with Python and PyTorch

created at June 16, 2017, 5:03 a.m.

Python

200 +0

8,567 +19

986 +1

GitHub
cudf by rapidsai

cuDF - GPU DataFrame Library

created at May 7, 2017, 3:43 a.m.

C++

154 +0

8,448 +22

908 +5

GitHub
tsfresh by blue-yonder

Automatic extraction of relevant features from time series:

created at Oct. 26, 2016, 11:29 a.m.

Jupyter Notebook

171 +0

8,436 +7

1,214 +2

GitHub
vaex by vaexio

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

created at Sept. 27, 2014, 9:44 a.m.

Python

144 +0

8,297 +7

590 +0

GitHub
catboost by catboost

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

created at July 18, 2017, 5:29 a.m.

C++

191 +0

8,091 +9

1,190 +2

GitHub