wiki2vec by idio

Generating Vectors for DBpedia Entities via Word2Vec and Wikipedia Dumps. Questions? https://gitter.im/idio-opensource/Lobby

created at Feb. 10, 2015, 9:20 p.m.

Java

45 +0

600 +0

137 +0

GitHub
Multilingual-Latent-Dirichlet-Allocation-LDA by ArtificiAI

A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.

created at Sept. 3, 2018, 9:52 p.m.

Python

10 +0

80 +0

29 +0

GitHub
ixa-pipe-pos by ixa-ehu

IXA pipes Part of Speech tagger and Lemmatizer (http://ixa2.si.ehu.es/ixa-pipes)

created at June 7, 2013, 9:43 a.m.

Java

9 +0

17 +0

15 +0

GitHub
LatinamericanTextResources by dav009

A collection of Latinamerican Corpora, dictionaries to serve as resources for Text processing and Text mining.

created at July 26, 2013, 9:21 p.m.

Unknown languages

4 +0

6 +0

4 +0

GitHub
estem by MaG21

Spanish stemming

created at May 21, 2012, 7:28 p.m.

Ruby

4 +0

3 +0

0 +0

GitHub