GaussianMixtures.jl by davidavdav

Large scale Gaussian Mixture Models

created at Dec. 4, 2013, 12:10 p.m.

Julia

6 +0

95 +0

36 +0

GitHub
PigPen by Netflix

Map-Reduce for Clojure

created at Dec. 12, 2013, 10:56 p.m.

Clojure

471 +1

562 +1

55 +0

GitHub
pandas-cookbook by jvns

Recipes for using Python's pandas library

created at Dec. 21, 2013, 5:14 p.m.

Jupyter Notebook

309 +0

6,525 +6

2,304 +0

GitHub
golearn by sjwhitworth

Machine Learning for Go

created at Dec. 26, 2013, 1:06 p.m.

Go

433 +0

9,211 +5

1,191 +0

GitHub
grt by nickgillian

gesture recognition toolkit

created at Jan. 3, 2014, 3:16 a.m.

C++

92 +0

852 +0

286 +0

GitHub
cltk by cltk

The Classical Language Toolkit

created at Jan. 11, 2014, 11:59 p.m.

Python

65 +0

823 +0

326 +0

GitHub
zzarchive-Vulpes by fsprojects-archive

Vulpes: a Deep Belief Net written in F#, and using Alea.cuBase to access the GPU.

created at Jan. 15, 2014, 9:57 p.m.

JavaScript

24 +0

116 +0

18 +0

GitHub
clortex by htm-community

(pre-alpha) Implementation of Jeff Hawkins' Hierarchical Temporal Memory & Cortical Learning Algorithm

created at Jan. 16, 2014, 2:15 p.m.

Clojure

47 +0

182 +0

18 +0

GitHub
StatsKit.jl by JuliaStats

Convenience meta-package to load essential packages for statistics

created at Jan. 19, 2014, 12:24 a.m.

Julia

20 +0

138 +0

16 +0

GitHub
mining by mining

Business Intelligence (BI) in Python, OLAP

created at Jan. 23, 2014, 7:20 p.m.

Python

118 +0

1,268 +0

234 +0

GitHub
OverFeat by sermanet

None

created at Jan. 24, 2014, 12:59 a.m.

C

57 +0

595 +0

205 +0

GitHub
DataScience by AllenDowney

Site for a Data Science class taught by Allen Downey

created at Jan. 31, 2014, 1:46 p.m.

HTML

10 +0

42 +0

58 +0

GitHub
meta by meta-toolkit

A Modern C++ Data Sciences Toolkit

created at Feb. 2, 2014, 11:54 p.m.

C++

62 +0

686 +0

233 +1

GitHub
DataFramesMeta.jl by JuliaData

Metaprogramming tools for DataFrames

created at Feb. 5, 2014, 12:16 a.m.

Julia

19 +0

476 +0

54 +0

GitHub
xgboost by dmlc

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

created at Feb. 6, 2014, 5:28 p.m.

C++

913 +0

25,734 +20

8,683 +1

GitHub
NMF.jl by JuliaStats

A Julia package for non-negative matrix factorization

created at Feb. 8, 2014, 3:25 p.m.

Julia

14 +0

90 +0

34 +0

GitHub
featureforge by machinalis

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

created at Feb. 17, 2014, 5:21 p.m.

Python

34 +0

382 +0

77 +0

GitHub
spark by apache

Apache Spark - A unified analytics engine for large-scale data processing

created at Feb. 25, 2014, 8 a.m.

Scala

2,031 +1

38,708 +52

28,034 +22

GitHub
h2o-3 by h2oai

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

created at March 3, 2014, 4:08 p.m.

Jupyter Notebook

385 +0

6,768 +8

1,995 +0

GitHub
go-geom by twpayne

Package geom implements efficient geometry types for geospatial applications.

created at March 6, 2014, 1:39 p.m.

Go

15 +0

800 +3

104 +1

GitHub