feature_engine by feature-engine

Feature engineering package with sklearn like functionality

updated at Nov. 16, 2024, 9:50 p.m.

Python

33 +0

1,924 +4

312 +0

GitHub
dlib by davisking

A toolkit for making real world machine learning and data analysis applications in C++

updated at Nov. 16, 2024, 9:30 p.m.

C++

479 +2

13,563 +21

3,380 +6

GitHub
pysal by pysal

PySAL: Python Spatial Analysis Library Meta-Package

updated at Nov. 16, 2024, 9:25 p.m.

Python

79 +0

1,330 +1

303 +0

GitHub
catboost by catboost

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

updated at Nov. 16, 2024, 9:24 p.m.

C++

191 +0

8,091 +9

1,190 +2

GitHub
aequitas by dssg

Bias Auditing & Fair ML Toolkit

updated at Nov. 16, 2024, 9:22 p.m.

Python

43 +0

694 +3

113 +0

GitHub
cudf by rapidsai

cuDF - GPU DataFrame Library

updated at Nov. 16, 2024, 9:16 p.m.

C++

154 +0

8,448 +22

908 +5

GitHub
missingno by ResidentMario

Missing data visualization module for Python.

updated at Nov. 16, 2024, 8:45 p.m.

Python

76 +0

3,961 +6

518 +1

GitHub
darts by unit8co

A python library for user-friendly forecasting and anomaly detection on time series.

updated at Nov. 16, 2024, 8:44 p.m.

Python

61 +0

8,087 +20

881 +2

GitHub
vision by pytorch

Datasets, Transforms and Models specific to Computer Vision

updated at Nov. 16, 2024, 7:45 p.m.

Python

435 -1

16,257 +43

6,956 +5

GitHub
yellowbrick by DistrictDataLabs

Visual analysis and diagnostic tools to facilitate machine learning model selection.

updated at Nov. 16, 2024, 7:37 p.m.

Python

103 +0

4,293 +3

559 +0

GitHub
cltk by cltk

The Classical Language Toolkit

updated at Nov. 16, 2024, 7:32 p.m.

Python

65 +0

840 +2

330 +0

GitHub
deap by DEAP

Distributed Evolutionary Algorithms in Python

updated at Nov. 16, 2024, 7:20 p.m.

Python

191 +0

5,852 +8

1,128 +0

GitHub
vaex by vaexio

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

updated at Nov. 16, 2024, 7:06 p.m.

Python

144 +0

8,297 +7

590 +0

GitHub
pymc by pymc-devs

Bayesian Modeling and Probabilistic Programming in Python

updated at Nov. 16, 2024, 6:58 p.m.

Python

226 +1

8,720 +6

2,015 +4

GitHub
pomegranate by jmschrei

Fast, flexible and easy to use probabilistic modelling in Python.

updated at Nov. 16, 2024, 6:53 p.m.

Python

95 +0

3,376 +2

590 +0

GitHub
echarts by apache

Apache ECharts is a powerful, interactive charting and data visualization library for browser

updated at Nov. 16, 2024, 6:48 p.m.

TypeScript

1,970 -1

60,639 +59

19,613 -3

GitHub
Metrics by benhamner

Machine learning evaluation metrics, implemented in Python, R, Haskell, and MATLAB / Octave

updated at Nov. 16, 2024, 6:42 p.m.

Python

87 +0

1,628 +0

454 +0

GitHub
featuretools by alteryx

An open source python library for automated feature engineering

updated at Nov. 16, 2024, 6:39 p.m.

Python

158 +0

7,270 +11

878 -1

GitHub
scikit-image by scikit-image

Image processing in Python

updated at Nov. 16, 2024, 6:22 p.m.

Python

186 +0

6,091 +9

2,235 +4

GitHub
shap by shap

A game theoretic approach to explain the output of any machine learning model.

updated at Nov. 16, 2024, 6:20 p.m.

Jupyter Notebook

245 +1

22,880 +52

3,290 +0

GitHub