frog by LanguageMachines

Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.

updated at May 4, 2024, 6:48 a.m.

C++

16 +0

73 +0

11 +0

GitHub
ucto by LanguageMachines

Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation. Ucto comes with tokenisation rules for several languages and can be easily extended to suit other languages. It has been incorporated for tokenizing Dutch text in Frog, our Dutch morpho-syntactic processor. http://ilk.uvt.nl/ucto --

updated at May 4, 2024, 7:12 a.m.

C++

13 +0

61 +0

13 +0

GitHub
Impala by cloudera

Real-time Query for Hadoop; mirror of Apache Impala

updated at May 4, 2024, 11 a.m.

C++

14 +0

30 +0

32 +0

GitHub
Fido by FidoProject

A lightweight C++ machine learning library for embedded electronics and robotics.

updated at May 6, 2024, 9:40 p.m.

C++

37 +0

427 +1

83 +0

GitHub
colibri-core by proycon

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.

updated at May 7, 2024, 4:20 a.m.

C++

11 +0

123 +0

19 +0

GitHub
dynet by clab

DyNet: The Dynamic Neural Network Toolkit

updated at May 7, 2024, 1:40 p.m.

C++

184 +0

3,407 +1

705 +0

GitHub
amazon-dsstne by amazon-archives

Deep Scalable Sparse Tensor Network Engine (DSSTNE) is an Amazon developed library for building Deep Learning (DL) machine learning (ML) models

updated at May 8, 2024, 2:37 a.m.

C++

341 +0

4,412 -1

735 +0

GitHub
rgf by RGF-team

Home repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one written in C++, along with various language-specific wrappers.

updated at May 8, 2024, 9:22 a.m.

C++

18 +0

372 +1

56 +0

GitHub
banditlib by jkomiyama

Multi-armed bandit simulation library

updated at May 8, 2024, 10:58 p.m.

C++

8 +0

136 +1

45 +0

GitHub
wav2letter by flashlight

Facebook AI Research's Automatic Speech Recognition Toolkit

updated at May 9, 2024, 6:22 a.m.

C++

245 +0

6,336 +3

1,015 +0

GitHub
ViZDoom by Farama-Foundation

Reinforcement Learning environments based on the 1993 game Doom godmode

updated at May 9, 2024, 10:32 a.m.

C++

50 -1

1,673 +3

395 +0

GitHub
MITIE by mit-nlp

MITIE: library and tools for information extraction

updated at May 10, 2024, 6:34 a.m.

C++

193 +0

2,905 +1

537 +0

GitHub
vowpal_wabbit by VowpalWabbit

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

updated at May 10, 2024, 10:29 a.m.

C++

349 +0

8,409 +5

1,927 -1

GitHub
oneDAL by oneapi-src

oneAPI Data Analytics Library (oneDAL)

updated at May 10, 2024, 10:44 a.m.

C++

47 +0

593 +0

208 +0

GitHub
shogun by shogun-toolbox

Shōgun

updated at May 10, 2024, 8:25 p.m.

C++

217 +0

3,009 +3

1,036 +0

GitHub
oneDNN by oneapi-src

oneAPI Deep Neural Network Library (oneDNN)

updated at May 11, 2024, 7:11 a.m.

C++

184 -2

3,471 +4

955 +1

GitHub
infinity by infiniflow

The AI-native database built for LLM applications, providing incredibly fast full-text and vector search

updated at May 11, 2024, 7:39 a.m.

C++

23 +0

1,786 +82

147 +9

GitHub
thundersvm by Xtra-Computing

ThunderSVM: A Fast SVM Library on GPUs and CPUs

updated at May 11, 2024, 9:14 a.m.

C++

56 -1

1,537 -1

213 +0

GitHub
deepdetect by jolibrain

Deep Learning API and Server in C++14 support for Caffe, PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE

updated at May 11, 2024, 9:32 a.m.

C++

131 +0

2,500 +2

560 +0

GitHub
mxnet by apache

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

updated at May 11, 2024, 10:14 a.m.

C++

1,069 +0

20,713 -1

6,812 -2

GitHub