damerau-levenshtein by GlobalNamesArchitecture

Calculates edit distance using Damerau-Levenshtein algorithm

created at July 21, 2011, 7:54 p.m.

Ruby

7 +0

136 +1

18 +0

GitHub
tts by c2h2

A ruby gem for Text-To-Speech by using google translate service.

created at June 15, 2011, 8:38 a.m.

Ruby

7 +0

93 +1

27 +0

GitHub
unicode by blackwinter

Unicode normalization library. (Mirror of Yoshida-san's code base to maintain the RubyGem.)

created at March 1, 2010, 12:26 p.m.

C

6 +0

79 +0

10 +0

GitHub
Naive-Bayes by reddavis

Simple Naive Bayes classifier

created at Nov. 14, 2009, 8:32 p.m.

Ruby

6 +0

48 +0

8 +0

GitHub
pragmatic_tokenizer by diasks2

A multilingual tokenizer to split a string into tokens

created at Jan. 5, 2016, 7:30 a.m.

Ruby

6 +0

90 +0

11 +0

GitHub
tokenizer by arbox

A simple tokenizer in Ruby for NLP tasks.

created at Aug. 23, 2011, 3:38 p.m.

Ruby

5 +0

45 +0

11 +0

GitHub
kronic by xaviershay

A dirt simple library for parsing and formatting human readable dates

created at Sept. 19, 2010, 4:59 p.m.

Ruby

5 +0

151 +0

12 +0

GitHub
lda-ruby by ealdent

A Ruby wrapper for Latent Dirichlet Allocation (LDA).

created at Nov. 14, 2008, 8:54 p.m.

C

5 +0

133 +0

32 +0

GitHub
liblinear-ruby-swig by tomz

This is the Ruby interface to LIBLINEAR (much more efficient than LIBSVM for text classification and other large linear classifications)

created at Feb. 28, 2009, 7:17 p.m.

C++

5 +0

83 +0

12 +0

GitHub
microsoft_translator by ikayzo

Ruby client for the microsoft translator API

created at June 9, 2012, 5:40 p.m.

Ruby

5 +0

21 +0

10 +0

GitHub
ffi-hunspell by postmodern

Ruby FFI bindings for Hunspell.

created at Oct. 6, 2010, 5:41 a.m.

Ruby

5 +0

48 +0

24 +0

GitHub
hotwater by colinsurprenant

Fast Ruby FFI string edit distance algorithms

created at Feb. 25, 2013, 6:55 a.m.

Ruby

5 +0

81 +0

1 +0

GitHub
N-Gram by reddavis

N-Gram generator in Ruby - http://en.wikipedia.org/wiki/N-gram

created at Dec. 18, 2009, 3:30 p.m.

Ruby

5 +0

37 +0

6 +0

GitHub
Tactful_Tokenizer by zencephalon

Accurate Bayesian sentence tokenizer in Ruby.

created at March 10, 2010, 3:17 a.m.

Ruby

5 +0

79 +0

13 +0

GitHub
alchemyapi_ruby by AlchemyAPI

A sdk for AlchemyAPI using Ruby - Please note that this legacy AlchemyAPI SDK is no longer supported by IBM. Please use the Watson SDKs https://github.com/watson-developer-cloud?utf8=✓&query=sdk

created at Oct. 5, 2013, 1:11 p.m.

Ruby

5 +0

36 +0

28 +0

GitHub
re2 by mudge

Ruby bindings to RE2, a "fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python".

created at July 24, 2010, 7:22 p.m.

Ruby

5 +0

128 +1

14 +0

GitHub
iuliia by nalgeon

Transliterate Cyrillic → Latin in every possible way

created at April 27, 2020, 6:09 p.m.

Unknown languages

5 +0

68 +0

8 +0

GitHub
going_the_distance by schneems

Distance Measurements are Awesome!

created at Sept. 18, 2014, 7:38 a.m.

Ruby

5 +0

61 +0

6 +0

GitHub
ruby-ngram by tkellen

Break words and phrases into ngrams.

created at Dec. 12, 2013, 3:07 p.m.

Ruby

4 +0

12 +0

1 +0

GitHub
stopwords-filter by brenes

Project for filtering stopwords

created at Aug. 12, 2012, 3:59 p.m.

Ruby

4 +0

75 +0

46 +0

GitHub