autogluon by autogluon

Fast and Accurate ML in 3 Lines of Code

created at July 29, 2019, 6:51 p.m.

Python

99 +1

7,275 +25

860 +5

GitHub
xarray by pydata

N-D labeled arrays and datasets in Python

created at Sept. 30, 2013, 5:21 p.m.

Python

109 +0

3,451 +5

1,025 +1

GitHub
numexpr by pydata

Fast numerical array expression evaluator for Python, NumPy, Pandas, PyTables and more

created at Nov. 30, 2013, 10:33 p.m.

Python

60 +0

2,160 +4

203 +0

GitHub
DALEX by ModelOriented

moDel Agnostic Language for Exploration and eXplanation

created at Feb. 18, 2018, 3:24 a.m.

Python

48 +0

1,338 +2

166 +0

GitHub
sk-transformers by chrislemke

A collection of pandas & scikit-learn compatible transformers for preprocessing and feature engineering 🛠

created at Sept. 18, 2022, 1:52 p.m.

Python

3 +0

8 +0

0 +0

GitHub
ydata-synthetic by ydataai

Synthetic data generators for tabular and time-series data

created at May 4, 2020, 3:52 p.m.

Jupyter Notebook

32 +0

1,342 +7

226 -1

GitHub
pandera by unionai-oss

A light-weight, flexible, and expressive statistical data testing library

created at Nov. 1, 2018, 2:18 a.m.

Python

18 +0

3,084 +8

284 +2

GitHub
ydata-profiling by ydataai

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

created at Jan. 9, 2016, 11:47 p.m.

Python

152 +0

12,158 +17

1,637 +0

GitHub
traceml by polyaxon

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

created at March 25, 2016, 9:59 p.m.

Python

14 +0

493 +0

43 +0

GitHub
zoofs by jaswinder9051998

zoofs is a python library for performing feature selection using a variety of nature-inspired wrapper algorithms. The algorithms range from swarm-intelligence to physics-based to Evolutionary. It's easy to use , flexible and powerful tool to reduce your feature size.

created at July 11, 2020, 8:33 a.m.

Python

4 +0

236 +0

45 +0

GitHub
NitroFE by NITRO-AI

NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for providing continuous calculation.

created at Aug. 26, 2021, 11:09 a.m.

Python

4 +0

106 +0

8 +0

GitHub
geemap by gee-community

A Python package for interactive geospatial analysis and visualization with Google Earth Engine.

created at March 8, 2020, 3:21 p.m.

Python

113 +0

3,263 +8

1,064 +0

GitHub
skpro by sktime

A unified framework for tabular probabilistic regression and probability distributions in python

created at Sept. 11, 2017, 8:03 a.m.

Python

9 +0

212 +3

42 +0

GitHub
BayesianOptimization by bayesian-optimization

A Python implementation of global optimization with gaussian processes.

created at June 6, 2014, 8:18 a.m.

Python

134 +0

7,585 +12

1,507 +1

GitHub
shap by shap

A game theoretic approach to explain the output of any machine learning model.

created at Nov. 22, 2016, 7:17 p.m.

Jupyter Notebook

240 +0

21,926 +63

3,194 +11

GitHub
hamilton by DAGWorks-Inc

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.

created at Feb. 23, 2023, 5:16 p.m.

Jupyter Notebook

12 +0

1,498 +42

87 +1

GitHub
qiskit by Qiskit

Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.

created at March 3, 2017, 5:02 p.m.

Python

212 +0

4,749 +25

2,265 +7

GitHub
sonnet by deepmind

TensorFlow-based neural network library

created at April 3, 2017, 11:34 a.m.

Python

421 +0

9,711 +6

1,288 +3

GitHub
trfl by deepmind

TensorFlow Reinforcement Learning

created at Aug. 8, 2018, 2:44 p.m.

Python

207 -1

3,139 +0

386 +0

GitHub
jraph by deepmind

A Graph Neural Network Library in Jax

created at Nov. 23, 2020, 10:27 a.m.

Python

42 +0

1,331 +1

87 +0

GitHub