josephmisiti/awesome-machine-learning

ucto by LanguageMachines

Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. http://ilk.uvt.nl/ucto --

updated at May 21, 2024, 11:41 a.m.

C++

13 +0

63 +1

13 +0

GitHub

deepdetect by jolibrain

Deep Learning API and Server in C++14 support for Caffe, PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE

updated at May 22, 2024, 3:10 p.m.

C++

131 +0

2,501 +0

560 +0

GitHub

xlearn by aksnzhy

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

updated at May 22, 2024, 6:10 p.m.

C++

110 +0

3,078 -1

520 +1

GitHub

turicreate by apple

Turi Create simplifies the development of custom machine learning models.

updated at May 23, 2024, 1:42 a.m.

C++

338 +0

11,159 +5

1,133 -1

GitHub

thundersvm by Xtra-Computing

ThunderSVM: A Fast SVM Library on GPUs and CPUs

updated at May 23, 2024, 6:54 a.m.

C++

56 +0

1,541 +1

213 +0

GitHub

grt by nickgillian

gesture recognition toolkit

updated at May 23, 2024, 8:41 a.m.

C++

92 +0

852 +1

286 +0

GitHub

libfolia by LanguageMachines

FoLiA library for C++

updated at May 23, 2024, 1:22 p.m.

C++

10 +0

14 +0

7 +0

GitHub

neuraln by totemstech

None

updated at May 23, 2024, 4:23 p.m.

C++

11 +0

276 +1

26 +0

GitHub

bigartm by bigartm

Fast topic modeling platform

updated at May 24, 2024, 7:19 a.m.

C++

41 +0

662 +0

117 +0

GitHub

oneDAL by oneapi-src

oneAPI Data Analytics Library (oneDAL)

updated at May 24, 2024, 11:09 a.m.

C++

48 +0

593 +0

209 +0

GitHub

thundergbm by Xtra-Computing

ThunderGBM: Fast GBDTs and Random Forests on GPUs

updated at May 25, 2024, 2:16 a.m.

C++

25 +0

689 +1

84 +0

GitHub

oneDNN by oneapi-src

oneAPI Deep Neural Network Library (oneDNN)

updated at May 25, 2024, 7:09 a.m.

C++

184 +0

3,476 +4

957 -1

GitHub

Fido by FidoProject

A lightweight C++ machine learning library for embedded electronics and robotics.

updated at May 25, 2024, 8 a.m.

C++

37 +0

429 +1

82 +0

GitHub

ViZDoom by Farama-Foundation

Reinforcement Learning environments based on the 1993 game Doom

updated at May 25, 2024, 9:22 a.m.

C++

50 +0

1,678 +5

396 +1

GitHub

dynet by clab

DyNet: The Dynamic Neural Network Toolkit

updated at May 25, 2024, 9:49 a.m.

C++

185 +0

3,411 +2

706 +0

GitHub

xad by auto-differentiation

Comprehensive automatic differentiation in C++

updated at May 25, 2024, 9:53 a.m.

C++

9 +0

215 -1

17 +0

GitHub

zpar by frcchang

ZPar statistical parser. Universal language support (depending on the availability of training data), with language-specific features for Chinese and English. Currently support word segmentation, POS tagging, dependency and phrase-structure parsing.

updated at May 25, 2024, 1:42 p.m.

C++

13 +0

133 -1

33 +0

GitHub

vowpal_wabbit by VowpalWabbit

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

updated at May 25, 2024, 2:09 p.m.

C++

350 +1

8,418 +3

1,927 +0

GitHub

frog by LanguageMachines

Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.

updated at May 25, 2024, 6:15 p.m.

C++

16 +0

73 +0

11 +0

GitHub

shogun by shogun-toolbox

Shōgun

updated at May 25, 2024, 7:12 p.m.

C++

217 +0

3,009 +0

1,038 +2

GitHub

All languages 773 Python 270 C++ 53 Julia 53 Go 45 Jupyter Notebook 45 JavaScript 42 Clojure 34 Scala 29 Lua 24 C 22 Java 21 Ruby 17 Rust 15 HTML 9 Objective-C 9 Unknown languages 9 Swift 8 TypeScript 8 Haskell 6 Cuda 5 R 5 PHP 4 C# 3 Dockerfile 3 MATLAB 3 Common Lisp 2 Crystal 2 Cython 2 Elixir 2 Fortran 2 OCaml 2 OpenEdge ABL 2 SAS 2 Shell 2 TeX 2 APL 1 CoffeeScript 1 GAP 1 Gleam 1 Kotlin 1 Makefile 1 Matlab 1 Perl 1 PostScript 1 Raku 1 Scheme 1