Naive Bayesian Classifier written in APL
created at Sept. 13, 2015, 3:27 p.m.
list of anything (Community driven list of anything) text :)
created at June 21, 2014, 2:22 p.m.
Scripts to generate a dataset with static frames from the Arcade Learning Environment
created at May 9, 2014, 4:07 p.m.
John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm
created at Oct. 31, 2011, 8:31 p.m.
Rust library for Self Organising Maps (SOM).
created at March 4, 2018, 7:44 a.m.
Curated set of transformers that make your work with steppy faster and more effective
created at April 26, 2018, 10:27 p.m.
This project has been moved to https://github.com/cnclabs/smore.
created at Nov. 20, 2016, 8:49 a.m.
Hopfield Networks for unsupervised learning in Haskell
created at Nov. 20, 2013, 6:06 p.m.
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is regular-expression based, extensible, and advanced tokeniser written in C++ (http://ilk.uvt.nl/ucto).
created at May 21, 2014, 5:28 p.m.
Code for the Best Buy competition at Kaggle
created at Sept. 19, 2012, 2:38 p.m.
Source code and supporting content for my Ruby Manor presentation on Data Visualisation with Ruby
created at Dec. 6, 2009, 9:59 p.m.
A Kaggle competition: discriminate gender based on handwriting
created at April 10, 2013, 9:34 p.m.
Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies
created at Dec. 13, 2014, 6:38 p.m.
python-timbl, originally developed by Sander Canisius, is a Python extension module wrapping the full TiMBL C++ programming interface. With this module, all functionality exposed through the C++ interface is also available to Python scripts. Being able to access the API from Python greatly facilitates prototyping TiMBL-based applications.
created at Feb. 11, 2013, 11:07 a.m.