Find a needle (a document or record) in a haystack using string similarity and (optionally) regular expression rules. Uses Dice's Coefficient (aka Pair Similiarity) and Levenshtein Distance internally.
created at Jan. 13, 2012, 4:46 p.m.
Ruby bindings to the OpenNLP Java toolkit.
created at Dec. 19, 2012, 2:44 a.m.
Lemmatizer for text in English. Inspired by Python's nltk.corpus.reader.wordnet.morphy
created at Oct. 27, 2012, 11:16 p.m.
A Ruby library for consuming the AT&T Speech API for speech to text.
created at Aug. 15, 2012, 4:02 p.m.
Automatically exported from code.google.com/p/berkeleyparser
created at July 7, 2015, 8:35 a.m.
A wrapper module for using spaCy natural language processing library from the Ruby programming language via PyCall
created at June 19, 2021, 2:04 a.m.
A fast and accurate rule-based sentence segmentation tool for Ruby.
created at Aug. 15, 2012, 5:14 a.m.
Syntax tree generator for linguistic research
created at Nov. 29, 2009, 2:14 p.m.