ucto by LanguageMachines

Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. http://ilk.uvt.nl/ucto --

created at March 26, 2013, 11:16 a.m.

C++

13 +0

61 +0

13 +0

GitHub
neuraln by totemstech

None

created at Sept. 17, 2014, 5:25 p.m.

C++

11 +0

275 +0

26 +0

GitHub
colibri-core by proycon

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.

created at Sept. 21, 2013, 2:12 p.m.

C++

11 +0

123 +0

19 +0

GitHub
libfolia by LanguageMachines

FoLiA library for C++

created at March 26, 2013, 12:46 p.m.

C++

10 +0

14 +0

7 +0

GitHub
sunnet by Tyill

tiny neural net

created at Aug. 13, 2018, 12:59 p.m.

C++

10 +0

61 +0

24 +0

GitHub
xad by auto-differentiation

Comprehensive automatic differentiation in C++

created at July 7, 2022, 2 p.m.

C++

9 +0

215 +0

16 +1

GitHub
banditlib by jkomiyama

Multi-armed bandit simulation library

created at March 11, 2014, 4:09 a.m.

C++

8 +0

136 +1

45 +0

GitHub
LKYDeepNN by mosdeo

Low dependency(C++11 STL only), good portability, header-only, deep neural networks for embedded

created at Dec. 31, 2016, 12:05 p.m.

C++

8 +0

48 +0

13 +0

GitHub
lua---opengm by clementfarabet

An OpenGM wrapper for Lua.

created at Sept. 23, 2011, 6:51 a.m.

C++

5 +0

8 +0

0 +0

GitHub
torch-svm by koraykv

None

created at Oct. 1, 2012, 2:34 p.m.

C++

5 +0

43 +0

26 +0

GitHub
vowpal_wabbit by clementfarabet

John Langford's original release of Vowpal Wabbit -- a fast online learning algorithm

created at Oct. 31, 2011, 8:31 p.m.

C++

4 +0

2 +0

0 +0

GitHub
raku-Algorithm-LibSVM by titsuki

A Raku bindings for libsvm

created at Jan. 4, 2017, 2:40 a.m.

C++

2 +0

8 +0

2 +0

GitHub
LuaSHkit by ocallaco

Lua wrapper for shkit

created at April 1, 2014, 5 p.m.

C++

1 +0

3 +0

0 +0

GitHub