weka-jruby by paulgoetze

Machine Learning & Data Mining with JRuby

created at Dec. 2, 2015, 6:58 p.m.

Ruby

4 +0

66 +0

8 +0

GitHub
raingrams by postmodern

A flexible and general-purpose ngrams library written in Ruby. Raingrams supports ngram sizes greater than 1, text/non-text grams, multiple parsing styles and open/closed vocabulary models.

created at March 8, 2009, 10:54 a.m.

Ruby

7 +0

69 +0

7 +0

GitHub
stopwords-filter by brenes

Project for filtering stopwords

created at Aug. 12, 2012, 3:59 p.m.

Ruby

4 +0

75 +0

46 +0

GitHub
tickle by yb66

Natural language parser for recurring events

created at Jan. 12, 2012, 2:59 a.m.

Ruby

3 +0

78 +0

13 +0

GitHub
CommonRegexRuby by talyssonoc

Find a lot of kinds of common information in a string. CommonRegex port for Ruby

created at Jan. 23, 2015, 12:39 p.m.

Ruby

4 +0

79 +0

5 +0

GitHub
monkeylearn-ruby by monkeylearn

Official Ruby client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Ruby apps.

created at Oct. 16, 2015, 4:30 p.m.

Ruby

15 +0

80 +0

14 +0

GitHub
Tactful_Tokenizer by zencephalon

Accurate Bayesian sentence tokenizer in Ruby.

created at March 10, 2010, 3:17 a.m.

Ruby

5 +0

80 +1

13 +0

GitHub
hotwater by colinsurprenant

Fast Ruby FFI string edit distance algorithms

created at Feb. 25, 2013, 6:55 a.m.

Ruby

5 +0

81 +0

1 +0

GitHub
rwordnet by doches

A pure Ruby interface to the WordNet database

created at Nov. 10, 2008, 7:21 p.m.

Ruby

7 +0

88 +0

26 +0

GitHub
pragmatic_tokenizer by diasks2

A multilingual tokenizer to split a string into tokens

created at Jan. 5, 2016, 7:30 a.m.

Ruby

6 +0

90 +0

11 +0

GitHub
open-nlp by louismullie

Ruby bindings to the OpenNLP Java toolkit.

created at Dec. 19, 2012, 2:44 a.m.

Ruby

9 +0

91 +0

11 +0

GitHub
ruby-nlp by tiendung

Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer

created at Aug. 11, 2008, 10:50 a.m.

Ruby

11 +0

92 +0

14 +0

GitHub
punkt-segmenter by lfcipriani

Ruby port of the NLTK Punkt sentence segmentation algorithm

created at June 26, 2010, 2:17 a.m.

Ruby

2 +0

92 +0

10 +0

GitHub
tts by c2h2

A ruby gem for Text-To-Speech by using google translate service.

created at June 15, 2011, 8:38 a.m.

Ruby

7 +0

93 +0

27 +0

GitHub
rsyntaxtree by yohasebe

Syntax tree generator for linguistic research

created at Nov. 29, 2009, 2:14 p.m.

Ruby

7 +0

95 +0

16 +0

GitHub
lemmatizer by yohasebe

Lemmatizer for text in English. Inspired by Python's nltk.corpus.reader.wordnet.morphy

created at Oct. 27, 2012, 11:16 p.m.

Ruby

8 +0

108 +0

15 +0

GitHub
nickel by iainbeeston

Nickel extracts date, time, and message information from naturally worded text.

created at June 28, 2013, 9:10 p.m.

Ruby

3 +0

112 +0

17 +0

GitHub
re2 by mudge

Ruby bindings to RE2, a "fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python".

created at July 24, 2010, 7:22 p.m.

Ruby

5 +0

128 +0

14 +0

GitHub
damerau-levenshtein by GlobalNamesArchitecture

Calculates edit distance using Damerau-Levenshtein algorithm

created at July 21, 2011, 7:54 p.m.

Ruby

7 +0

136 +0

19 +0

GitHub
levenshtein-ffi by dbalatero

Fast string edit distance computation, using the Damerau-Levenshtein algorithm.

created at April 7, 2010, 5:30 p.m.

Ruby

3 +0

149 +0

24 +0

GitHub