ruby-spark by ondra-m

Ruby wrapper for Apache Spark

created at Jan. 26, 2015, 8:07 p.m.

Ruby

16 +0

225 +0

29 +0

GitHub
pragmatic_segmenter by diasks2

Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.

created at Jan. 4, 2015, 9 a.m.

Ruby

16 +0

537 +2

51 +1

GitHub
monkeylearn-ruby by monkeylearn

Official Ruby client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Ruby apps.

created at Oct. 16, 2015, 4:30 p.m.

Ruby

15 +0

80 +0

14 +0

GitHub
regexp-examples by tom-lord

Generate strings that match a given regular expression

created at Nov. 4, 2014, 11:46 p.m.

Ruby

15 +0

521 +0

31 +0

GitHub
espeak-ruby by dejan

Ruby wrapper for ‘espeak’ and ‘lame’ with sugar on top to create Text-To-Speech mp3 files.

created at Jan. 4, 2009, 1:47 a.m.

Ruby

15 +0

192 +0

22 +0

GitHub
fuzzy-string-match by kiyoka

fuzzy string matching library for ruby

created at Oct. 12, 2010, 2:30 a.m.

Ruby

13 +0

279 +0

39 +0

GitHub
chronic_duration by henrypoydar

A simple Ruby natural language parser for elapsed time

created at Jan. 11, 2009, 9 p.m.

Ruby

13 +0

351 +0

68 +0

GitHub
verbal_expressions by ryan-endacott

Make difficult regular expressions easy! Ruby port of the awesome VerbalExpressions repo - https://github.com/jehna/VerbalExpressions

created at July 22, 2013, 6:07 p.m.

Ruby

13 +0

571 -1

26 +0

GitHub
words_counted by abitdodgy

A Ruby natural language processor.

created at April 30, 2014, 3:07 a.m.

Ruby

12 +0

159 +0

29 +0

GitHub
nbayes by oasic

A robust, full-featured Ruby implementation of Naive Bayes

created at June 4, 2012, 8:57 p.m.

Ruby

12 +0

153 +0

33 +0

GitHub
yomu by yomurb

Read text and metadata from files and documents (.doc, .docx, .pages, .odt, .rtf, .pdf)

created at March 25, 2012, 10:03 a.m.

Ruby

12 +0

492 +0

122 +0

GitHub
fuzzy_match by seamusabshere

Find a needle (a document or record) in a haystack using string similarity and (optionally) regular expression rules. Uses Dice's Coefficient (aka Pair Similiarity) and Levenshtein Distance internally.

created at Jan. 13, 2012, 4:46 p.m.

Ruby

11 +0

668 +0

47 +0

GitHub
ruby-nlp by tiendung

Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer

created at Aug. 11, 2008, 10:50 a.m.

Ruby

11 +0

92 +0

14 +0

GitHub
phobos by phobos

Simplifying Kafka for ruby apps

created at Aug. 13, 2016, 6:14 p.m.

Ruby

9 +0

219 +0

38 -2

GitHub
open-nlp by louismullie

Ruby bindings to the OpenNLP Java toolkit.

created at Dec. 19, 2012, 2:44 a.m.

Ruby

9 +0

91 +0

11 +0

GitHub
ruby-spacy by yohasebe

A wrapper module for using spaCy natural language processing library from the Ruby programming language via PyCall

created at June 19, 2021, 2:04 a.m.

Ruby

8 +0

52 +0

4 +0

GitHub
lemmatizer by yohasebe

Lemmatizer for text in English. Inspired by Python's nltk.corpus.reader.wordnet.morphy

created at Oct. 27, 2012, 11:16 p.m.

Ruby

8 +0

108 +0

15 +0

GitHub
scalpel by louismullie

A fast and accurate rule-based sentence segmentation tool for Ruby.

created at Aug. 15, 2012, 5:14 a.m.

Ruby

8 +0

50 +0

5 +0

GitHub
att_speech by adhearsion

A Ruby library for consuming the AT&T Speech API for speech to text.

created at Aug. 15, 2012, 4:02 p.m.

Ruby

8 +0

20 +0

6 +0

GitHub
rwordnet by doches

A pure Ruby interface to the WordNet database

created at Nov. 10, 2008, 7:21 p.m.

Ruby

7 +0

88 +0

26 +0

GitHub