fuzzy_match by seamusabshere

Find a needle (a document or record) in a haystack using string similarity and (optionally) regular expression rules. Uses Dice's Coefficient (aka Pair Similiarity) and Levenshtein Distance internally.

created at Jan. 13, 2012, 4:46 p.m.

Ruby

11 +0

668 -1

48 +0

GitHub
rmagick by rmagick

Ruby bindings for ImageMagick

created at July 24, 2014, 2:02 p.m.

C++

18 +0

696 +2

140 +0

GitHub
tf-idf-similarity by jpmckinney

Ruby gem to calculate the similarity between texts using tf*idf

created at Sept. 10, 2012, 1:29 a.m.

Ruby

23 +0

729 +1

64 +0

GitHub
iruby by SciRuby

Official gem repository: Ruby kernel for Jupyter/IPython Notebook

created at March 3, 2015, 6:05 p.m.

Ruby

30 +0

831 +1

24 +0

GitHub
pycall.rb by mrkn

Calling Python functions from the Ruby language

created at Oct. 5, 2016, 3:57 p.m.

C

40 +0

1,032 +1

71 +0

GitHub
ruby-nlp by diasks2

A collection of links to Ruby Natural Language Processing (NLP) libraries, tools and software

created at March 23, 2015, 3:48 a.m.

Unknown languages

83 +0

1,263 +0

105 +0

GitHub
treat by louismullie

Natural language processing framework for Ruby.

created at Jan. 24, 2012, 2:07 a.m.

Ruby

68 +0

1,364 +0

128 +0

GitHub
decisiontree by igrigorik

ID3-based implementation of the ML Decision Tree algorithm

created at Feb. 23, 2009, 4:52 a.m.

Ruby

40 +0

1,423 +0

131 +0

GitHub
thinking-sphinx by pat

Sphinx/Manticore plugin for ActiveRecord/Rails

created at April 14, 2008, 1:28 a.m.

Ruby

31 +0

1,622 +1

467 +0

GitHub
lita by litaio

ChatOps for Ruby.

created at April 20, 2013, 10:35 a.m.

Ruby

40 +0

1,678 +0

178 +0

GitHub
awesome-ocr by kba

Links to awesome OCR projects

created at April 27, 2016, 4:54 p.m.

Unknown languages

128 +0

2,631 +7

344 +1

GitHub
google-api-ruby-client by googleapis

REST client for Google APIs

created at Jan. 26, 2012, 9:54 p.m.

Ruby

114 -1

2,758 +2

865 -1

GitHub
sunspot by sunspot

Solr-powered search for Ruby objects

created at Oct. 13, 2008, 3:46 p.m.

JavaScript

32 +0

2,968 +1

922 -1

GitHub
elasticsearch-rails by elastic

Elasticsearch integrations for ActiveModel/Record and Ruby on Rails

created at Nov. 8, 2013, 5 p.m.

Ruby

390 -1

3,054 +1

792 +0

GitHub
chronic by mojombo

Chronic is a pure Ruby natural language date parser.

created at Jan. 29, 2008, 6:48 a.m.

Ruby

70 +0

3,224 -1

437 +0

GitHub
parallel by grosser

Ruby: parallel processing made simple and fast

created at Aug. 11, 2009, 6:54 p.m.

Ruby

77 +0

4,119 +0

255 +0

GitHub
CoreNLP by stanfordnlp

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

created at June 27, 2013, 9:13 p.m.

Java

488 +0

9,501 +13

2,694 +0

GitHub
tesseract by tesseract-ocr

Tesseract Open Source OCR Engine (main repository)

created at Aug. 12, 2014, 6:04 p.m.

C++

1,682 -1

58,667 +161

9,123 +18

GitHub