batchflow by analysiscenter

BatchFlow helps you conveniently work with random or sequential batches of your data and define data processing and machine learning workflows even for datasets that do not fit into memory.

created at March 13, 2017, 2:22 p.m.

Python

16 +1

197 +1

46 +0

GitHub
pyjanitor by pyjanitor-devs

Clean APIs for data cleaning. Python implementation of R package Janitor

created at March 4, 2018, 10:43 p.m.

Python

16 +0

1,307 +3

166 +0

GitHub
adaptive by python-adaptive

chart with upwards trend Adaptive: parallel active learning of mathematical functions

created at Dec. 10, 2017, 1:47 a.m.

Python

16 +0

1,121 +1

58 +0

GitHub
BoostARoota by chasedehan

A fast xgboost feature selection algorithm

created at Aug. 23, 2017, 9:43 p.m.

Python

16 +0

214 +1

38 +0

GitHub
recmetrics by statisticianinstilettos

A library of metrics for evaluating recommender systems

created at Oct. 15, 2018, 3:29 p.m.

Jupyter Notebook

15 +0

561 +0

100 +0

GitHub
scikit-rvm by JamesRitchie

Relevance Vector Machine implementation using the scikit-learn API.

created at Aug. 2, 2015, 4:04 p.m.

Python

14 +0

228 -1

73 +0

GitHub
mlforecast by Nixtla

Scalable machine 🤖 learning for time series forecasting.

created at April 26, 2021, 8:58 p.m.

Python

14 +1

764 +13

71 +1

GitHub
muda by bmcfee

A library for augmenting annotated audio data

created at Nov. 7, 2014, 9:21 p.m.

Python

14 +0

229 +0

33 +0

GitHub
traceml by polyaxon

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

created at March 25, 2016, 9:59 p.m.

Python

14 +0

494 +1

43 +0

GitHub
hamilton by DAGWorks-Inc

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.

created at Feb. 23, 2023, 5:16 p.m.

Jupyter Notebook

13 +0

1,531 +15

88 +1

GitHub
scikit-mdr by EpistasisLab

A sklearn-compatible Python implementation of Multifactor Dimensionality Reduction (MDR) for feature construction.

created at June 7, 2016, 5:49 p.m.

Python

13 +0

125 +0

26 +0

GitHub
pystan by stan-dev

PyStan, a Python interface to Stan, a platform for statistical modeling. Documentation: https://pystan.readthedocs.io

created at May 24, 2013, 1:21 a.m.

Python

13 +0

324 +0

58 +0

GitHub
bayesloop by christophmark

Probabilistic programming framework that facilitates objective model selection for time-varying parameter models.

created at Aug. 27, 2015, 8:11 a.m.

Python

13 +0

136 +0

26 +0

GitHub
CapsNet-Visualization by bourdakos1

🎆 A visualization of the CapsNet layers to better understand how it works

created at Jan. 19, 2018, 8:46 p.m.

Python

13 +0

394 +0

93 +0

GitHub
imbalanced-algorithms by dialnd

Python-based implementations of algorithms for learning on imbalanced data.

created at Aug. 24, 2016, 8:59 p.m.

Python

13 +0

232 +0

100 +0

GitHub
themis-ml by cosmicBboy

A library that implements fairness-aware machine learning algorithms

created at Aug. 26, 2017, 8:15 p.m.

Jupyter Notebook

12 +0

123 +0

26 +0

GitHub
L2X by Jianbo-Lab

None

created at Feb. 21, 2018, 8:33 p.m.

Python

12 +0

123 +0

36 +0

GitHub
pyvarinf by ctallec

Python package facilitating the use of Bayesian Deep Learning methods with Variational Inference for PyTorch

created at March 2, 2018, 3:32 p.m.

Python

12 +0

358 +1

50 +0

GitHub
sigopt-sklearn by sigopt

SigOpt wrappers for scikit-learn methods

created at April 15, 2016, 8:58 p.m.

Python

12 +0

75 +0

11 +0

GitHub
Solid by 100

🎯 A comprehensive gradient-free optimization framework written in Python

created at June 12, 2017, 5:02 a.m.

Python

12 +0

575 +0

64 +0

GitHub