punkt-segmenter by lfcipriani

Ruby port of the NLTK Punkt sentence segmentation algorithm

created at June 26, 2010, 2:17 a.m.

Ruby

2 +0

92 +0

10 +0

GitHub
ruby-nlp by tiendung

Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer

created at Aug. 11, 2008, 10:50 a.m.

Ruby

11 +0

92 +0

14 +0

GitHub
open-nlp by louismullie

Ruby bindings to the OpenNLP Java toolkit.

created at Dec. 19, 2012, 2:44 a.m.

Ruby

9 +0

91 +0

11 +0

GitHub
pragmatic_tokenizer by diasks2

A multilingual tokenizer to split a string into tokens

created at Jan. 5, 2016, 7:30 a.m.

Ruby

6 +0

90 +0

11 +0

GitHub
rwordnet by doches

A pure Ruby interface to the WordNet database

created at Nov. 10, 2008, 7:21 p.m.

Ruby

7 +0

88 +0

26 +1

GitHub
hotwater by colinsurprenant

Fast Ruby FFI string edit distance algorithms

created at Feb. 25, 2013, 6:55 a.m.

Ruby

5 +0

81 +0

1 +0

GitHub
monkeylearn-ruby by monkeylearn

Official Ruby client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Ruby apps.

created at Oct. 16, 2015, 4:30 p.m.

Ruby

15 +0

80 +0

14 +0

GitHub
CommonRegexRuby by talyssonoc

Find a lot of kinds of common information in a string. CommonRegex port for Ruby

created at Jan. 23, 2015, 12:39 p.m.

Ruby

4 +0

79 +0

5 +0

GitHub
Tactful_Tokenizer by zencephalon

Accurate Bayesian sentence tokenizer in Ruby.

created at March 10, 2010, 3:17 a.m.

Ruby

5 +0

79 +0

13 +0

GitHub
tickle by yb66

Natural language parser for recurring events

created at Jan. 12, 2012, 2:59 a.m.

Ruby

3 +0

78 +0

13 +0

GitHub
stopwords-filter by brenes

Project for filtering stopwords

created at Aug. 12, 2012, 3:59 p.m.

Ruby

4 +0

75 +0

46 +0

GitHub
raingrams by postmodern

A flexible and general-purpose ngrams library written in Ruby. Raingrams supports ngram sizes greater than 1, text/non-text grams, multiple parsing styles and open/closed vocabulary models.

created at March 8, 2009, 10:54 a.m.

Ruby

7 +0

69 +0

7 +0

GitHub
weka-jruby by paulgoetze

Machine Learning & Data Mining with JRuby

created at Dec. 2, 2015, 6:58 p.m.

Ruby

4 +0

66 +0

8 +0

GitHub
going_the_distance by schneems

Distance Measurements are Awesome!

created at Sept. 18, 2014, 7:38 a.m.

Ruby

5 +0

61 +0

6 +0

GitHub
pwrake by masa16

Parallel Workflow extension for Rake, runs on multicores, clusters, clouds.

created at April 13, 2012, 11 a.m.

Ruby

3 +0

57 +0

4 +0

GitHub
uea-stemmer by ealdent

Ruby port of UEALite Stemmer - a conservative stemmer for search and indexing

created at July 15, 2009, 1 p.m.

Ruby

3 +0

52 +0

5 +0

GitHub
ruby-spacy by yohasebe

A wrapper module for using spaCy natural language processing library from the Ruby programming language via PyCall

created at June 19, 2021, 2:04 a.m.

Ruby

8 +0

52 +0

4 +0

GitHub
scalpel by louismullie

A fast and accurate rule-based sentence segmentation tool for Ruby.

created at Aug. 15, 2012, 5:14 a.m.

Ruby

8 +0

50 +0

5 +0

GitHub
Naive-Bayes by reddavis

Simple Naive Bayes classifier

created at Nov. 14, 2009, 8:32 p.m.

Ruby

6 +0

48 +0

8 +0

GitHub
ffi-hunspell by postmodern

Ruby FFI bindings for Hunspell.

created at Oct. 6, 2010, 5:41 a.m.

Ruby

5 +0

48 +0

24 +0

GitHub