going_the_distance by schneems

Distance Measurements are Awesome!

created at Sept. 18, 2014, 7:38 a.m.

Ruby

5 +0

61 +0

6 +0

GitHub
ffi-hunspell by postmodern

Ruby FFI bindings for Hunspell.

created at Oct. 6, 2010, 5:41 a.m.

Ruby

5 +0

48 +0

24 +0

GitHub
liblinear-ruby-swig by tomz

This is the Ruby interface to LIBLINEAR (much more efficient than LIBSVM for text classification and other large linear classifications)

created at Feb. 28, 2009, 7:17 p.m.

C++

5 +0

83 +0

12 +0

GitHub
iuliia by nalgeon

Transliterate Cyrillic → Latin in every possible way

created at April 27, 2020, 6:09 p.m.

Unknown languages

5 +0

69 +0

8 +0

GitHub
alchemyapi_ruby by AlchemyAPI

A sdk for AlchemyAPI using Ruby - Please note that this legacy AlchemyAPI SDK is no longer supported by IBM. Please use the Watson SDKs https://github.com/watson-developer-cloud?utf8=✓&query=sdk

created at Oct. 5, 2013, 1:11 p.m.

Ruby

5 +0

36 +0

28 +0

GitHub
microsoft_translator by ikayzo

Ruby client for the microsoft translator API

created at June 9, 2012, 5:40 p.m.

Ruby

5 +0

21 +0

10 +0

GitHub
Tactful_Tokenizer by zencephalon

Accurate Bayesian sentence tokenizer in Ruby.

created at March 10, 2010, 3:17 a.m.

Ruby

5 +0

80 +0

13 +0

GitHub
kronic by xaviershay

A dirt simple library for parsing and formatting human readable dates

created at Sept. 19, 2010, 4:59 p.m.

Ruby

5 +0

150 +0

12 +0

GitHub
lda-ruby by ealdent

A Ruby wrapper for Latent Dirichlet Allocation (LDA).

created at Nov. 14, 2008, 8:54 p.m.

C

5 +0

133 +0

32 +0

GitHub
N-Gram by reddavis

N-Gram generator in Ruby - http://en.wikipedia.org/wiki/N-gram

created at Dec. 18, 2009, 3:30 p.m.

Ruby

5 +0

37 +0

6 +0

GitHub
hotwater by colinsurprenant

Fast Ruby FFI string edit distance algorithms

created at Feb. 25, 2013, 6:55 a.m.

Ruby

5 +0

81 +0

1 +0

GitHub
unicode by blackwinter

Unicode normalization library. (Mirror of Yoshida-san's code base to maintain the RubyGem.)

created at March 1, 2010, 12:26 p.m.

C

5 +0

80 +0

13 +0

GitHub
re2 by mudge

Ruby bindings to RE2, a "fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python".

created at July 24, 2010, 7:22 p.m.

Ruby

5 +0

130 +0

13 +0

GitHub
tokenizer by arbox

A simple tokenizer in Ruby for NLP tasks.

created at Aug. 23, 2011, 3:38 p.m.

Ruby

5 +0

46 +0

11 +0

GitHub
Naive-Bayes by reddavis

Simple Naive Bayes classifier

created at Nov. 14, 2009, 8:32 p.m.

Ruby

6 +0

49 +0

8 +0

GitHub
pragmatic_tokenizer by diasks2

A multilingual tokenizer to split a string into tokens

created at Jan. 5, 2016, 7:30 a.m.

Ruby

6 +0

90 +0

11 +0

GitHub
raingrams by postmodern

A flexible and general-purpose ngrams library written in Ruby. Raingrams supports ngram sizes greater than 1, text/non-text grams, multiple parsing styles and open/closed vocabulary models.

created at March 8, 2009, 10:54 a.m.

Ruby

7 +0

69 +0

7 +0

GitHub
rwordnet by doches

A pure Ruby interface to the WordNet database

created at Nov. 10, 2008, 7:21 p.m.

Ruby

7 +0

89 +0

27 +0

GitHub
tts by c2h2

A ruby gem for Text-To-Speech by using google translate service.

created at June 15, 2011, 8:38 a.m.

Ruby

7 +0

93 +0

28 +0

GitHub
TF-IDF by reddavis

Term Frequency - Inverse Document Frequency in Ruby

created at Dec. 18, 2009, 3:23 p.m.

Ruby

7 +0

35 +0

6 +0

GitHub