Naive-Bayes by reddavis

Simple Naive Bayes classifier

created at Nov. 14, 2009, 8:32 p.m.

Ruby

6 +0

49 +0

8 +0

GitHub
scalpel by louismullie

A fast and accurate rule-based sentence segmentation tool for Ruby.

created at Aug. 15, 2012, 5:14 a.m.

Ruby

8 +0

51 +0

5 +0

GitHub
uea-stemmer by ealdent

Ruby port of UEALite Stemmer - a conservative stemmer for search and indexing

created at July 15, 2009, 1 p.m.

Ruby

3 +0

53 +0

5 +0

GitHub
pwrake by masa16

Parallel Workflow extension for Rake, runs on multicores, clusters, clouds.

created at April 13, 2012, 11 a.m.

Ruby

3 +0

57 +0

4 +0

GitHub
going_the_distance by schneems

Distance Measurements are Awesome!

created at Sept. 18, 2014, 7:38 a.m.

Ruby

5 +0

61 +0

6 +0

GitHub
ruby-spacy by yohasebe

A wrapper module for using spaCy natural language processing library from the Ruby programming language via PyCall

created at June 19, 2021, 2:04 a.m.

Ruby

8 +0

63 +0

5 +0

GitHub
weka-jruby by paulgoetze

Machine Learning & Data Mining with JRuby

created at Dec. 2, 2015, 6:58 p.m.

Ruby

4 +0

65 +0

8 +0

GitHub
iuliia by nalgeon

Transliterate Cyrillic → Latin in every possible way

created at April 27, 2020, 6:09 p.m.

Unknown languages

5 +0

69 +0

8 +0

GitHub
raingrams by postmodern

A flexible and general-purpose ngrams library written in Ruby. Raingrams supports ngram sizes greater than 1, text/non-text grams, multiple parsing styles and open/closed vocabulary models.

created at March 8, 2009, 10:54 a.m.

Ruby

7 +0

69 +0

7 +0

GitHub
stopwords-filter by brenes

Project for filtering stopwords

created at Aug. 12, 2012, 3:59 p.m.

Ruby

4 +0

77 +0

51 +1

GitHub
CommonRegexRuby by talyssonoc

Find a lot of kinds of common information in a string. CommonRegex port for Ruby

created at Jan. 23, 2015, 12:39 p.m.

Ruby

4 +0

79 +0

5 +0

GitHub
Tactful_Tokenizer by zencephalon

Accurate Bayesian sentence tokenizer in Ruby.

created at March 10, 2010, 3:17 a.m.

Ruby

5 +0

80 +0

13 +0

GitHub
monkeylearn-ruby by monkeylearn

Official Ruby client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Ruby apps.

created at Oct. 16, 2015, 4:30 p.m.

Ruby

15 +0

80 +0

14 +0

GitHub
unicode by blackwinter

Unicode normalization library. (Mirror of Yoshida-san's code base to maintain the RubyGem.)

created at March 1, 2010, 12:26 p.m.

C

5 +0

80 +0

13 +0

GitHub
hotwater by colinsurprenant

Fast Ruby FFI string edit distance algorithms

created at Feb. 25, 2013, 6:55 a.m.

Ruby

5 +0

81 +0

1 +0

GitHub
tickle by yb66

Natural language parser for recurring events

created at Jan. 12, 2012, 2:59 a.m.

Ruby

3 +0

82 +1

13 +0

GitHub
liblinear-ruby-swig by tomz

This is the Ruby interface to LIBLINEAR (much more efficient than LIBSVM for text classification and other large linear classifications)

created at Feb. 28, 2009, 7:17 p.m.

C++

5 +0

83 +0

12 +0

GitHub
rwordnet by doches

A pure Ruby interface to the WordNet database

created at Nov. 10, 2008, 7:21 p.m.

Ruby

7 +0

89 +0

27 +0

GitHub
pragmatic_tokenizer by diasks2

A multilingual tokenizer to split a string into tokens

created at Jan. 5, 2016, 7:30 a.m.

Ruby

6 +0

90 +0

11 +0

GitHub
open-nlp by louismullie

Ruby bindings to the OpenNLP Java toolkit.

created at Dec. 19, 2012, 2:44 a.m.

Ruby

9 +0

91 +0

11 +0

GitHub