autogluon by autogluon

Fast and Accurate ML in 3 Lines of Code

created at July 29, 2019, 6:51 p.m.

Python

99 +0

7,320 +45

864 +4

GitHub
xarray by pydata

N-D labeled arrays and datasets in Python

created at Sept. 30, 2013, 5:21 p.m.

Python

110 +1

3,461 +10

1,029 +4

GitHub
numexpr by pydata

Fast numerical array expression evaluator for Python, NumPy, Pandas, PyTables and more

created at Nov. 30, 2013, 10:33 p.m.

Python

60 +0

2,166 +6

204 +1

GitHub
DALEX by ModelOriented

moDel Agnostic Language for Exploration and eXplanation

created at Feb. 18, 2018, 3:24 a.m.

Python

48 +0

1,341 +3

166 +0

GitHub
sk-transformers by chrislemke

A collection of pandas & scikit-learn compatible transformers for preprocessing and feature engineering 🛠

created at Sept. 18, 2022, 1:52 p.m.

Python

3 +0

8 +0

0 +0

GitHub
ydata-synthetic by ydataai

Synthetic data generators for tabular and time-series data

created at May 4, 2020, 3:52 p.m.

Jupyter Notebook

32 +0

1,349 +7

227 +1

GitHub
pandera by unionai-oss

A light-weight, flexible, and expressive statistical data testing library

created at Nov. 1, 2018, 2:18 a.m.

Python

18 +0

3,097 +13

286 +2

GitHub
ydata-profiling by ydataai

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

created at Jan. 9, 2016, 11:47 p.m.

Python

153 +1

12,180 +22

1,642 +5

GitHub
traceml by polyaxon

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

created at March 25, 2016, 9:59 p.m.

Python

14 +0

493 +0

43 +0

GitHub
zoofs by jaswinder9051998

zoofs is a python library for performing feature selection using a variety of nature-inspired wrapper algorithms. The algorithms range from swarm-intelligence to physics-based to Evolutionary. It's easy to use , flexible and powerful tool to reduce your feature size.

created at July 11, 2020, 8:33 a.m.

Python

4 +0

236 +0

45 +0

GitHub
NitroFE by NITRO-AI

NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for providing continuous calculation.

created at Aug. 26, 2021, 11:09 a.m.

Python

4 +0

106 +0

8 +0

GitHub
geemap by gee-community

A Python package for interactive geospatial analysis and visualization with Google Earth Engine.

created at March 8, 2020, 3:21 p.m.

Python

112 -1

3,271 +8

1,065 +1

GitHub
skpro by sktime

A unified framework for tabular probabilistic regression and probability distributions in python

created at Sept. 11, 2017, 8:03 a.m.

Python

9 +0

214 +2

42 +0

GitHub
BayesianOptimization by bayesian-optimization

A Python implementation of global optimization with gaussian processes.

created at June 6, 2014, 8:18 a.m.

Python

134 +0

7,596 +11

1,508 +1

GitHub
shap by shap

A game theoretic approach to explain the output of any machine learning model.

created at Nov. 22, 2016, 7:17 p.m.

Jupyter Notebook

241 +1

21,966 +40

3,198 +4

GitHub
hamilton by DAGWorks-Inc

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.

created at Feb. 23, 2023, 5:16 p.m.

Jupyter Notebook

13 +1

1,516 +18

87 +0

GitHub
qiskit by Qiskit

Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.

created at March 3, 2017, 5:02 p.m.

Python

213 +1

4,775 +26

2,270 +5

GitHub
sonnet by deepmind

TensorFlow-based neural network library

created at April 3, 2017, 11:34 a.m.

Python

421 +0

9,711 +0

1,288 +0

GitHub
trfl by deepmind

TensorFlow Reinforcement Learning

created at Aug. 8, 2018, 2:44 p.m.

Python

207 +0

3,139 +0

386 +0

GitHub
jraph by deepmind

A Graph Neural Network Library in Jax

created at Nov. 23, 2020, 10:27 a.m.

Python

42 +0

1,336 +5

88 +1

GitHub