re2 by mudge

Ruby bindings to RE2, a "fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python".

created at July 24, 2010, 7:22 p.m.

Ruby

5 +0

128 +1

14 +0

GitHub
numerizer by jduff

Parse numbers in natural language from strings (ex forty two).

created at Dec. 25, 2009, 4:48 a.m.

Ruby

3 +0

36 +0

13 +0

GitHub
Tactful_Tokenizer by zencephalon

Accurate Bayesian sentence tokenizer in Ruby.

created at March 10, 2010, 3:17 a.m.

Ruby

5 +0

79 +0

13 +0

GitHub
tickle by yb66

Natural language parser for recurring events

created at Jan. 12, 2012, 2:59 a.m.

Ruby

3 +0

78 +0

13 +0

GitHub
liblinear-ruby-swig by tomz

This is the Ruby interface to LIBLINEAR (much more efficient than LIBSVM for text classification and other large linear classifications)

created at Feb. 28, 2009, 7:17 p.m.

C++

5 +0

83 +0

12 +0

GitHub
kronic by xaviershay

A dirt simple library for parsing and formatting human readable dates

created at Sept. 19, 2010, 4:59 p.m.

Ruby

5 +0

151 +0

12 +0

GitHub
pragmatic_tokenizer by diasks2

A multilingual tokenizer to split a string into tokens

created at Jan. 5, 2016, 7:30 a.m.

Ruby

6 +0

90 +0

11 +0

GitHub
linnaeus by djcp

A redis-backed Bayesian classifier

created at Oct. 27, 2012, 5:59 p.m.

Ruby

3 +0

37 +0

11 +0

GitHub
Hunspell by segabor

Ruby wrapper for the famous spell checker library hunspell.

created at May 4, 2011, 4:50 a.m.

C

4 +0

33 +0

11 +0

GitHub
tokenizer by arbox

A simple tokenizer in Ruby for NLP tasks.

created at Aug. 23, 2011, 3:38 p.m.

Ruby

5 +0

45 +0

11 +0

GitHub
open-nlp by louismullie

Ruby bindings to the OpenNLP Java toolkit.

created at Dec. 19, 2012, 2:44 a.m.

Ruby

9 +0

91 +0

11 +0

GitHub
unicode by blackwinter

Unicode normalization library. (Mirror of Yoshida-san's code base to maintain the RubyGem.)

created at March 1, 2010, 12:26 p.m.

C

6 +0

79 +0

10 +0

GitHub
punkt-segmenter by lfcipriani

Ruby port of the NLTK Punkt sentence segmentation algorithm

created at June 26, 2010, 2:17 a.m.

Ruby

2 +0

92 +0

10 +0

GitHub
microsoft_translator by ikayzo

Ruby client for the microsoft translator API

created at June 9, 2012, 5:40 p.m.

Ruby

5 +0

21 +0

10 +0

GitHub
scylla by hashwin

None

created at Aug. 26, 2011, 3:46 p.m.

Ruby

2 +0

36 +0

8 +0

GitHub
weka-jruby by paulgoetze

Machine Learning & Data Mining with JRuby

created at Dec. 2, 2015, 6:58 p.m.

Ruby

4 +0

66 +1

8 +0

GitHub
Naive-Bayes by reddavis

Simple Naive Bayes classifier

created at Nov. 14, 2009, 8:32 p.m.

Ruby

6 +0

48 +0

8 +0

GitHub
fuzzy_tools by brianhempel

Fuzzy document finding in Ruby

created at May 29, 2012, 4:49 p.m.

Ruby

2 +0

23 +0

8 +0

GitHub
wlapi by arbox

Ruby based API for the project Wortschatz Leipzig.

created at Oct. 26, 2011, 1:37 p.m.

Ruby

4 +0

19 +0

8 +0

GitHub
iuliia by nalgeon

Transliterate Cyrillic → Latin in every possible way

created at April 27, 2020, 6:09 p.m.

Unknown languages

5 +0

68 +0

8 +0

GitHub