A variety of loaders for various NLP corpora.
created at Aug. 26, 2016, 8:02 a.m.
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. http://ilk.uvt.nl/ucto --
created at March 26, 2013, 11:16 a.m.
A library for machine learning that builds predictions using a linear regression.
created at June 8, 2015, 4:52 p.m.
Interactive arts and charts plotting with Clojure(Script) and Vega-lite / Vega. Flower viewing 花見 (hanami)
created at Sept. 14, 2017, 8:53 p.m.
An implementation of Dell Zhang's solution to Wikipedia's Participation Challenge on Kaggle
created at Nov. 22, 2011, 5:38 a.m.
Code for Accelerometer Biometric Competition at Kaggle
created at Aug. 1, 2013, 2:01 p.m.
Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser)
created at Sept. 7, 2014, 8:32 p.m.
Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.
created at Oct. 30, 2020, 3:25 p.m.