ruby-ngram by tkellen

Break words and phrases into ngrams.

created at Dec. 12, 2013, 3:07 p.m.

Ruby

4 +0

12 +0

1 +0

GitHub
stopwords-filter by brenes

Project for filtering stopwords

created at Aug. 12, 2012, 3:59 p.m.

Ruby

4 +0

75 +0

46 +0

GitHub
wlapi by arbox

Ruby based API for the project Wortschatz Leipzig.

created at Oct. 26, 2011, 1:37 p.m.

Ruby

4 +0

19 +0

8 +0

GitHub
CommonRegexRuby by talyssonoc

Find a lot of kinds of common information in a string. CommonRegex port for Ruby

created at Jan. 23, 2015, 12:39 p.m.

Ruby

4 +0

79 +0

5 +0

GitHub
open_nlp by hck

JRuby tools wrapper for Apache OpenNLP

created at Sept. 21, 2012, 9:36 a.m.

Ruby

4 +0

11 +0

2 +0

GitHub
nlp_toolz by LeFnord

wrapper for basic nlp tools

created at Dec. 7, 2012, 10:27 a.m.

Ruby

4 +0

2 +0

1 +0

GitHub
weka-jruby by paulgoetze

Machine Learning & Data Mining with JRuby

created at Dec. 2, 2015, 6:58 p.m.

Ruby

4 +0

66 +0

8 +0

GitHub
maxent_string_classifier by mccraigmccraig

a JRuby maximum entropy classifier for string data, based on the OpenNLP Maxent framework

created at April 22, 2009, 3:23 p.m.

Ruby

4 +0

9 +0

5 +0

GitHub
tickle by yb66

Natural language parser for recurring events

created at Jan. 12, 2012, 2:59 a.m.

Ruby

3 +0

78 +0

13 +0

GitHub
levenshtein-ffi by dbalatero

Fast string edit distance computation, using the Damerau-Levenshtein algorithm.

created at April 7, 2010, 5:30 p.m.

Ruby

3 +0

149 +0

24 +0

GitHub
linnaeus by djcp

A redis-backed Bayesian classifier

created at Oct. 27, 2012, 5:59 p.m.

Ruby

3 +0

37 +0

11 +0

GitHub
stimmung by pachacamac

Sentiment analysis for the German language

created at June 30, 2015, 1:27 a.m.

Ruby

3 +0

20 +0

5 +0

GitHub
TranslitKit by AnalyzePlatypus

Hebrew - English Transliteration Engine

created at Feb. 23, 2017, 1:13 a.m.

Ruby

3 +0

7 +0

2 +0

GitHub
numerizer by jduff

Parse numbers in natural language from strings (ex forty two).

created at Dec. 25, 2009, 4:48 a.m.

Ruby

3 +0

36 +0

13 +0

GitHub
pwrake by masa16

Parallel Workflow extension for Rake, runs on multicores, clusters, clouds.

created at April 13, 2012, 11 a.m.

Ruby

3 +0

57 +0

4 +0

GitHub
composable_operations by t6d

Composable Operations is a tool set for creating operations and assembling multiple of these operations in operation pipelines.

created at June 10, 2013, 10:04 a.m.

Ruby

3 +0

47 +0

7 +0

GitHub
iuliia-rb by adnikiforov

Russian transliteration using nalgeon/iuliia schemas

created at April 30, 2020, 10:08 a.m.

Ruby

3 +0

10 +0

1 +0

GitHub
textoken by manorie

Simple and customizable text tokenization gem.

created at Sept. 23, 2015, 1:34 p.m.

Ruby

3 +0

31 +0

3 +0

GitHub
uea-stemmer by ealdent

Ruby port of UEALite Stemmer - a conservative stemmer for search and indexing

created at July 15, 2009, 1 p.m.

Ruby

3 +0

52 +0

5 +0

GitHub
nickel by iainbeeston

Nickel extracts date, time, and message information from naturally worded text.

created at June 28, 2013, 9:10 p.m.

Ruby

3 +0

112 +0

17 +0

GitHub