GaussianMixtures.jl by davidavdav

Large scale Gaussian Mixture Models

created at Dec. 4, 2013, 12:10 p.m.

Julia

6 +0

95 +0

36 +0

GitHub
PigPen by Netflix

Map-Reduce for Clojure

created at Dec. 12, 2013, 10:56 p.m.

Clojure

471 +0

562 +0

55 +0

GitHub
pandas-cookbook by jvns

Recipes for using Python's pandas library

created at Dec. 21, 2013, 5:14 p.m.

Jupyter Notebook

309 +0

6,535 +10

2,306 +2

GitHub
golearn by sjwhitworth

Machine Learning for Go

created at Dec. 26, 2013, 1:06 p.m.

Go

433 +0

9,218 +7

1,191 +0

GitHub
grt by nickgillian

gesture recognition toolkit

created at Jan. 3, 2014, 3:16 a.m.

C++

92 +0

852 +0

287 +1

GitHub
cltk by cltk

The Classical Language Toolkit

created at Jan. 11, 2014, 11:59 p.m.

Python

65 +0

824 +1

326 +0

GitHub
zzarchive-Vulpes by fsprojects-archive

Vulpes: a Deep Belief Net written in F#, and using Alea.cuBase to access the GPU.

created at Jan. 15, 2014, 9:57 p.m.

JavaScript

24 +0

116 +0

18 +0

GitHub
clortex by htm-community

(pre-alpha) Implementation of Jeff Hawkins' Hierarchical Temporal Memory & Cortical Learning Algorithm

created at Jan. 16, 2014, 2:15 p.m.

Clojure

47 +0

183 +1

18 +0

GitHub
StatsKit.jl by JuliaStats

Convenience meta-package to load essential packages for statistics

created at Jan. 19, 2014, 12:24 a.m.

Julia

20 +0

138 +0

16 +0

GitHub
mining by mining

Business Intelligence (BI) in Python, OLAP

created at Jan. 23, 2014, 7:20 p.m.

Python

118 +0

1,270 +2

234 +0

GitHub
OverFeat by sermanet

None

created at Jan. 24, 2014, 12:59 a.m.

C

57 +0

595 +0

205 +0

GitHub
DataScience by AllenDowney

Site for a Data Science class taught by Allen Downey

created at Jan. 31, 2014, 1:46 p.m.

HTML

10 +0

42 +0

58 +0

GitHub
meta by meta-toolkit

A Modern C++ Data Sciences Toolkit

created at Feb. 2, 2014, 11:54 p.m.

C++

62 +0

686 +0

233 +0

GitHub
DataFramesMeta.jl by JuliaData

Metaprogramming tools for DataFrames

created at Feb. 5, 2014, 12:16 a.m.

Julia

19 +0

476 +0

54 +0

GitHub
xgboost by dmlc

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

created at Feb. 6, 2014, 5:28 p.m.

C++

913 +0

25,748 +14

8,684 +1

GitHub
NMF.jl by JuliaStats

A Julia package for non-negative matrix factorization

created at Feb. 8, 2014, 3:25 p.m.

Julia

14 +0

90 +0

34 +0

GitHub
featureforge by machinalis

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

created at Feb. 17, 2014, 5:21 p.m.

Python

34 +0

382 +0

77 +0

GitHub
spark by apache

Apache Spark - A unified analytics engine for large-scale data processing

created at Feb. 25, 2014, 8 a.m.

Scala

2,033 +2

38,767 +59

28,051 +17

GitHub
h2o-3 by h2oai

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

created at March 3, 2014, 4:08 p.m.

Jupyter Notebook

385 +0

6,776 +8

1,992 -3

GitHub
go-geom by twpayne

Package geom implements efficient geometry types for geospatial applications.

created at March 6, 2014, 1:39 p.m.

Go

15 +0

802 +2

104 +0

GitHub