ucto by LanguageMachines

Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. http://ilk.uvt.nl/ucto --

updated at May 21, 2024, 11:41 a.m.

C++

13 +0

63 +1

13 +0

GitHub
deepdetect by jolibrain

Deep Learning API and Server in C++14 support for Caffe, PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE

updated at May 22, 2024, 3:10 p.m.

C++

131 +0

2,501 +0

560 +0

GitHub
xlearn by aksnzhy

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

updated at May 22, 2024, 6:10 p.m.

C++

110 +0

3,078 -1

520 +1

GitHub
turicreate by apple

Turi Create simplifies the development of custom machine learning models.

updated at May 23, 2024, 1:42 a.m.

C++

338 +0

11,159 +5

1,133 -1

GitHub
thundersvm by Xtra-Computing

ThunderSVM: A Fast SVM Library on GPUs and CPUs

updated at May 23, 2024, 6:54 a.m.

C++

56 +0

1,541 +1

213 +0

GitHub
grt by nickgillian

gesture recognition toolkit

updated at May 23, 2024, 8:41 a.m.

C++

92 +0

852 +1

286 +0

GitHub
libfolia by LanguageMachines

FoLiA library for C++

updated at May 23, 2024, 1:22 p.m.

C++

10 +0

14 +0

7 +0

GitHub
neuraln by totemstech

None

updated at May 23, 2024, 4:23 p.m.

C++

11 +0

276 +1

26 +0

GitHub
bigartm by bigartm

Fast topic modeling platform

updated at May 24, 2024, 7:19 a.m.

C++

41 +0

662 +0

117 +0

GitHub
oneDAL by oneapi-src

oneAPI Data Analytics Library (oneDAL)

updated at May 24, 2024, 11:09 a.m.

C++

48 +0

593 +0

209 +0

GitHub
thundergbm by Xtra-Computing

ThunderGBM: Fast GBDTs and Random Forests on GPUs

updated at May 25, 2024, 2:16 a.m.

C++

25 +0

689 +1

84 +0

GitHub
oneDNN by oneapi-src

oneAPI Deep Neural Network Library (oneDNN)

updated at May 25, 2024, 7:09 a.m.

C++

184 +0

3,476 +4

957 -1

GitHub
Fido by FidoProject

A lightweight C++ machine learning library for embedded electronics and robotics.

updated at May 25, 2024, 8 a.m.

C++

37 +0

429 +1

82 +0

GitHub
ViZDoom by Farama-Foundation

Reinforcement Learning environments based on the 1993 game Doom godmode

updated at May 25, 2024, 9:22 a.m.

C++

50 +0

1,678 +5

396 +1

GitHub
dynet by clab

DyNet: The Dynamic Neural Network Toolkit

updated at May 25, 2024, 9:49 a.m.

C++

185 +0

3,411 +2

706 +0

GitHub
xad by auto-differentiation

Comprehensive automatic differentiation in C++

updated at May 25, 2024, 9:53 a.m.

C++

9 +0

215 -1

17 +0

GitHub
zpar by frcchang

ZPar statistical parser. Universal language support (depending on the availability of training data), with language-specific features for Chinese and English. Currently support word segmentation, POS tagging, dependency and phrase-structure parsing.

updated at May 25, 2024, 1:42 p.m.

C++

13 +0

133 -1

33 +0

GitHub
vowpal_wabbit by VowpalWabbit

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

updated at May 25, 2024, 2:09 p.m.

C++

350 +1

8,418 +3

1,927 +0

GitHub
frog by LanguageMachines

Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.

updated at May 25, 2024, 6:15 p.m.

C++

16 +0

73 +0

11 +0

GitHub
shogun by shogun-toolbox

Shōgun

updated at May 25, 2024, 7:12 p.m.

C++

217 +0

3,009 +0

1,038 +2

GitHub