GaussianMixtures.jl by davidavdav

Large scale Gaussian Mixture Models

created at Dec. 4, 2013, 12:10 p.m.

Julia

6 +0

99 +0

39 +1

GitHub
PigPen by Netflix

Map-Reduce for Clojure

created at Dec. 12, 2013, 10:56 p.m.

Clojure

475 +1

567 +0

55 +0

GitHub
pandas-cookbook by jvns

Recipes for using Python's pandas library

created at Dec. 21, 2013, 5:14 p.m.

Jupyter Notebook

305 +0

6,664 +7

2,317 +1

GitHub
golearn by sjwhitworth

Machine Learning for Go

created at Dec. 26, 2013, 1:06 p.m.

Go

431 +0

9,293 +2

1,191 +1

GitHub
grt by nickgillian

gesture recognition toolkit

created at Jan. 3, 2014, 3:16 a.m.

C++

89 +0

864 +1

284 +0

GitHub
cltk by cltk

The Classical Language Toolkit

created at Jan. 11, 2014, 11:59 p.m.

Python

65 +0

840 +2

330 +0

GitHub
zzarchive-Vulpes by fsprojects-archive

Vulpes: a Deep Belief Net written in F#, and using Alea.cuBase to access the GPU.

created at Jan. 15, 2014, 9:57 p.m.

JavaScript

24 +0

116 +0

18 +0

GitHub
clortex by htm-community

(pre-alpha) Implementation of Jeff Hawkins' Hierarchical Temporal Memory & Cortical Learning Algorithm

created at Jan. 16, 2014, 2:15 p.m.

Clojure

47 +0

183 +0

18 +0

GitHub
StatsKit.jl by JuliaStats

Convenience meta-package to load essential packages for statistics

created at Jan. 19, 2014, 12:24 a.m.

Julia

20 +0

141 +0

16 +0

GitHub
mining by mining

Business Intelligence (BI) in Python, OLAP

created at Jan. 23, 2014, 7:20 p.m.

Python

117 +0

1,279 +0

238 +1

GitHub
OverFeat by sermanet

None

created at Jan. 24, 2014, 12:59 a.m.

C

54 +0

597 +0

203 +0

GitHub
DataScience by AllenDowney

Site for a Data Science class taught by Allen Downey

created at Jan. 31, 2014, 1:46 p.m.

HTML

10 +0

44 +0

58 +0

GitHub
meta by meta-toolkit

A Modern C++ Data Sciences Toolkit

created at Feb. 2, 2014, 11:54 p.m.

C++

63 +0

696 +1

236 +0

GitHub
DataFramesMeta.jl by JuliaData

Metaprogramming tools for DataFrames

created at Feb. 5, 2014, 12:16 a.m.

Julia

19 +0

481 +0

55 +0

GitHub
xgboost by dmlc

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

created at Feb. 6, 2014, 5:28 p.m.

C++

908 +0

26,303 +26

8,729 +3

GitHub
NMF.jl by JuliaStats

A Julia package for non-negative matrix factorization

created at Feb. 8, 2014, 3:25 p.m.

Julia

13 +0

90 +0

34 +0

GitHub
featureforge by machinalis

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

created at Feb. 17, 2014, 5:21 p.m.

Python

34 +0

382 +0

77 +0

GitHub
spark by apache

Apache Spark - A unified analytics engine for large-scale data processing

created at Feb. 25, 2014, 8 a.m.

Scala

2,023 +1

39,929 +83

28,314 +8

GitHub
h2o-3 by h2oai

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

created at March 3, 2014, 4:08 p.m.

Jupyter Notebook

387 +0

6,924 +7

1,997 -1

GitHub
go-geom by twpayne

Package geom implements efficient geometry types for geospatial applications.

created at March 6, 2014, 1:39 p.m.

Go

15 +0

856 +2

106 +0

GitHub