Tactful_Tokenizer by zencephalon

Accurate Bayesian sentence tokenizer in Ruby.

updated at May 8, 2024, 10:27 a.m.

Ruby

5 +0

80 +0

13 +0

GitHub
decisiontree by igrigorik

ID3-based implementation of the ML Decision Tree algorithm

updated at May 9, 2024, 4:15 a.m.

Ruby

40 +0

1,423 +0

131 +0

GitHub
re2 by mudge

Ruby bindings to RE2, a "fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python".

updated at May 10, 2024, 6:24 p.m.

Ruby

5 +0

128 +0

14 +0

GitHub
pragmatic_segmenter by diasks2

Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.

updated at May 10, 2024, 7:25 p.m.

Ruby

16 +0

537 +0

51 +0

GitHub
verbal_expressions by ryan-endacott

Make difficult regular expressions easy! Ruby port of the awesome VerbalExpressions repo - https://github.com/jehna/VerbalExpressions

updated at May 11, 2024, 9:29 a.m.

Ruby

13 +0

571 +0

26 +0

GitHub
stopwords-filter by brenes

Project for filtering stopwords

updated at May 12, 2024, 4:46 p.m.

Ruby

4 +0

76 +1

46 +0

GitHub
parallel by grosser

Ruby: parallel processing made simple and fast

updated at May 14, 2024, 9:41 a.m.

Ruby

77 +0

4,119 +2

255 +1

GitHub
chronic by mojombo

Chronic is a pure Ruby natural language date parser.

updated at May 15, 2024, 9:38 p.m.

Ruby

70 +0

3,225 +1

437 +0

GitHub
thinking-sphinx by pat

Sphinx/Manticore plugin for ActiveRecord/Rails

updated at May 17, 2024, 9:05 a.m.

Ruby

31 +0

1,621 +0

467 +0

GitHub
iruby by SciRuby

Official gem repository: Ruby kernel for Jupyter/IPython Notebook

updated at May 17, 2024, 11:05 a.m.

Ruby

30 +0

830 +3

24 +1

GitHub
elasticsearch-rails by elastic

Elasticsearch integrations for ActiveModel/Record and Ruby on Rails

updated at May 17, 2024, 4:41 p.m.

Ruby

391 +1

3,053 -2

792 +1

GitHub
google-api-ruby-client by googleapis

REST client for Google APIs

updated at May 18, 2024, 4:04 p.m.

Ruby

115 +0

2,756 -1

866 -1

GitHub
fuzzy_match by seamusabshere

Find a needle (a document or record) in a haystack using string similarity and (optionally) regular expression rules. Uses Dice's Coefficient (aka Pair Similiarity) and Levenshtein Distance internally.

updated at May 18, 2024, 5:39 p.m.

Ruby

11 +0

669 +1

48 +1

GitHub