Accurate Bayesian sentence tokenizer in Ruby.
updated at May 8, 2024, 10:27 a.m.
ID3-based implementation of the ML Decision Tree algorithm
updated at May 9, 2024, 4:15 a.m.
Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.
updated at May 10, 2024, 7:25 p.m.
Make difficult regular expressions easy! Ruby port of the awesome VerbalExpressions repo - https://github.com/jehna/VerbalExpressions
updated at May 11, 2024, 9:29 a.m.
Sphinx/Manticore plugin for ActiveRecord/Rails
updated at May 17, 2024, 9:05 a.m.
Elasticsearch integrations for ActiveModel/Record and Ruby on Rails
updated at May 17, 2024, 4:41 p.m.
Find a needle (a document or record) in a haystack using string similarity and (optionally) regular expression rules. Uses Dice's Coefficient (aka Pair Similiarity) and Levenshtein Distance internally.
updated at May 18, 2024, 5:39 p.m.