xarray by pydata

N-D labeled arrays and datasets in Python

created at Sept. 30, 2013, 5:21 p.m.

Python

109 +0

3,417 +5

1,016 -1

GitHub
metric-learn by scikit-learn-contrib

Metric learning algorithms in Python

created at Nov. 2, 2013, 8:29 a.m.

Python

46 +0

1,376 -1

230 +0

GitHub
numexpr by pydata

Fast numerical array expression evaluator for Python, NumPy, Pandas, PyTables and more

created at Nov. 30, 2013, 10:33 p.m.

Python

60 +0

2,144 +4

200 +0

GitHub
cltk by cltk

The Classical Language Toolkit

created at Jan. 11, 2014, 11:59 p.m.

Python

66 +0

820 +0

323 +0

GitHub
dlib by davisking

A toolkit for making real world machine learning and data analysis applications in C++

created at Jan. 29, 2014, 12:45 a.m.

C++

478 +0

13,050 +20

3,320 -1

GitHub
xgboost by dmlc

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

created at Feb. 6, 2014, 5:28 p.m.

C++

912 +0

25,596 +12

8,665 +0

GitHub
featureforge by machinalis

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

created at Feb. 17, 2014, 5:21 p.m.

Python

34 +0

382 +0

79 +0

GitHub
numdifftools by pbrod

Solve automatic numerical differentiation problems in one or more variables.

created at March 12, 2014, 5:31 p.m.

Python

12 +0

245 +0

42 +0

GitHub
sacred by IDSIA

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

created at March 31, 2014, 6:05 p.m.

Python

69 +0

4,159 +3

377 -1

GitHub
scikit-multilearn by scikit-multilearn

A scikit-learn based module for multi-label et. al. classification

created at April 30, 2014, 1:05 p.m.

Python

33 +0

906 +2

176 +0

GitHub
holoviews by holoviz

With Holoviews, your data visualizes itself.

created at May 7, 2014, 4:59 p.m.

Python

59 +0

2,625 +1

394 +0

GitHub
deap by DEAP

Distributed Evolutionary Algorithms in Python

created at May 21, 2014, 8:07 p.m.

Python

189 +0

5,566 +8

1,104 +0

GitHub
optunity by claesenm

optimization routines for hyperparameter tuning

created at May 28, 2014, 5:29 p.m.

Jupyter Notebook

24 +0

414 +0

78 +0

GitHub
BayesianOptimization by bayesian-optimization

A Python implementation of global optimization with gaussian processes.

created at June 6, 2014, 8:18 a.m.

Python

134 +1

7,517 +22

1,503 +0

GitHub
skl-groups by dougalsutherland

scikit-learn addon to operate on set/"group"-based features

created at June 10, 2014, 10:36 p.m.

Python

6 +0

41 +0

7 +0

GitHub
Spearmint by HIPS

Spearmint Bayesian optimization codebase

created at Aug. 5, 2014, 6:13 p.m.

Python

79 +0

1,540 +1

326 +0

GitHub
mlxtend by rasbt

A library of extension and helper modules for Python's data analysis and machine learning libraries.

created at Aug. 14, 2014, 1:56 a.m.

Python

117 -1

4,769 +4

844 -1

GitHub
imbalanced-learn by scikit-learn-contrib

A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning

created at Aug. 16, 2014, 5:08 a.m.

Python

141 +0

6,706 +7

1,273 +1

GitHub
vaex by vaexio

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

created at Sept. 27, 2014, 9:44 a.m.

Python

143 +0

8,171 +0

588 -1

GitHub
sparkit-learn by lensacom

PySpark + Scikit-learn = Sparkit-learn

created at Oct. 15, 2014, 2:01 p.m.

Python

90 +0

1,147 +0

254 +1

GitHub