tpot by EpistasisLab

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

created at Nov. 3, 2015, 9:08 p.m.

Python

287 +0

9,665 +11

1,564 +1

GitHub
LAVIS by salesforce

LAVIS - A One-stop Library for Language-Vision Intelligence

created at Aug. 24, 2022, 2:36 a.m.

Jupyter Notebook

97 +1

9,681 +26

950 +6

GitHub
sonnet by deepmind

TensorFlow-based neural network library

created at April 3, 2017, 11:34 a.m.

Python

423 +0

9,751 +3

1,293 +1

GitHub
modin by modin-project

Modin: Scale your Pandas workflows by changing a single line of code

created at June 21, 2018, 9:35 p.m.

Python

114 +0

9,759 +15

651 +0

GitHub
great_expectations by great-expectations

Always know what to expect from your data.

created at Sept. 11, 2017, 12:18 a.m.

Python

81 +0

9,848 +32

1,522 +8

GitHub
statsmodels by statsmodels

Statsmodels: statistical modeling and econometrics in Python

created at June 12, 2011, 5:04 p.m.

Python

283 +0

9,993 +13

2,871 +3

GitHub
dopamine by google

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

created at July 26, 2018, 9:58 a.m.

Jupyter Notebook

425 +0

10,448 +3

1,373 +1

GitHub
optuna by optuna

A hyperparameter optimization framework

created at Feb. 21, 2018, 6:12 a.m.

Python

115 +0

10,569 +42

1,004 +2

GitHub
ludwig by ludwig-ai

Low-code framework for building custom LLMs, neural networks, and other AI models

created at Dec. 27, 2018, 11:58 p.m.

Python

194 +1

11,095 +10

1,188 +1

GitHub
lime by marcotcr

Lime: Explaining the predictions of any machine learning classifier

created at March 15, 2016, 10:18 p.m.

JavaScript

262 -1

11,526 +14

1,795 -1

GitHub
ydata-profiling by ydataai

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

created at Jan. 9, 2016, 11:47 p.m.

Python

152 +0

12,403 +18

1,670 +3

GitHub
seaborn by mwaskom

Statistical data visualization in Python

created at June 18, 2012, 6:41 p.m.

Python

262 +0

12,415 +14

1,908 +0

GitHub
dask by dask

Parallel computing with task scheduling

created at Jan. 4, 2015, 6:50 p.m.

Python

212 +0

12,424 +24

1,698 +5

GitHub
dgl by dmlc

Python package built to ease deep learning on graph, on top of existing DL frameworks.

created at April 20, 2018, 2:49 p.m.

Python

172 +0

13,377 +16

3,001 +3

GitHub
dlib by davisking

A toolkit for making real world machine learning and data analysis applications in C++

created at Jan. 29, 2014, 12:45 a.m.

C++

480 +1

13,434 +25

3,369 +3

GitHub
nltk by nltk

NLTK Source

created at Sept. 7, 2009, 10:53 a.m.

Python

462 -1

13,442 +18

2,864 +3

GitHub
dvc by iterative

🦉 ML Experiments and Data Management with Git

created at March 4, 2017, 8:16 a.m.

Python

137 -1

13,633 +27

1,175 +0

GitHub
flair by flairNLP

A very simple framework for state-of-the-art Natural Language Processing (NLP)

created at June 11, 2018, 11:04 a.m.

Python

202 +1

13,826 +14

2,089 +0

GitHub
albumentations by albumentations-team

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

created at June 6, 2018, 3:10 a.m.

Python

129 +0

14,061 +33

1,636 +4

GitHub
horovod by horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

created at Aug. 9, 2017, 7:39 p.m.

Python

335 +1

14,169 +11

2,224 +0

GitHub