Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. http://ilk.uvt.nl/ucto --
updated at May 21, 2024, 11:41 a.m.
Deep Learning API and Server in C++14 support for Caffe, PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE
updated at May 22, 2024, 3:10 p.m.
Turi Create simplifies the development of custom machine learning models.
updated at May 23, 2024, 1:42 a.m.
ThunderSVM: A Fast SVM Library on GPUs and CPUs
updated at May 23, 2024, 6:54 a.m.
ThunderGBM: Fast GBDTs and Random Forests on GPUs
updated at May 25, 2024, 2:16 a.m.
A lightweight C++ machine learning library for embedded electronics and robotics.
updated at May 25, 2024, 8 a.m.
Reinforcement Learning environments based on the 1993 game Doom
updated at May 25, 2024, 9:22 a.m.
Comprehensive automatic differentiation in C++
updated at May 25, 2024, 9:53 a.m.
ZPar statistical parser. Universal language support (depending on the availability of training data), with language-specific features for Chinese and English. Currently support word segmentation, POS tagging, dependency and phrase-structure parsing.
updated at May 25, 2024, 1:42 p.m.
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
updated at May 25, 2024, 2:09 p.m.
Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
updated at May 25, 2024, 6:15 p.m.