Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. http://ilk.uvt.nl/ucto --
created at March 26, 2013, 11:16 a.m.
A native Go clean room implementation of the Porter Stemming algorithm.
created at June 2, 2013, 5:18 p.m.
Machine Learning libraries for Go Lang - Linear regression, Logistic regression, etc.
created at Dec. 1, 2013, 4:02 p.m.
Open solution to the Toxic Comment Classification Challenge
created at Jan. 4, 2018, 4:29 p.m.
.NET Standard bindings for Apache MxNet with Imperative, Symbolic and Gluon Interface for developing, training and deploying Machine Learning models in C#. https://mxnet.tech-quantum.com/
created at May 2, 2019, 2:14 a.m.
A Julia package for non-negative matrix factorization
created at Feb. 8, 2014, 3:25 p.m.
doddle-model: machine learning in Scala.
created at Feb. 9, 2018, 1:54 p.m.
Kernel density estimators for Julia
created at April 13, 2014, 7:14 p.m.
Kaggle Submission for "Detecting Insults in Social Commentary"
created at Sept. 22, 2012, 2:16 p.m.
Julia package for loading many of the data sets available in R
created at Nov. 24, 2012, 5:16 a.m.
A Julia package for Gaussian Processes
created at April 30, 2015, 2:46 p.m.
TensorFlow C API Class Wrapper in Server Side Swift.
created at June 14, 2017, 2:06 a.m.
Python package for consolidated and extensive Univariate,Bivariate Data Analysis and Visualization catering to both categorical and continuous datasets.
created at July 30, 2016, 2:49 p.m.