ruby-spark by ondra-m

Ruby wrapper for Apache Spark

updated at April 21, 2024, 2:14 p.m.

Ruby

16 +0

225 +1

29 +0

GitHub
CoreNLP by stanfordnlp

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

updated at April 21, 2024, 1:06 p.m.

Java

489 -1

9,458 +14

2,694 +1

GitHub
google-api-ruby-client by googleapis

REST client for Google APIs

updated at April 21, 2024, 11 a.m.

Ruby

117 +3

2,753 +5

866 +0

GitHub
tesseract by tesseract-ocr

Tesseract Open Source OCR Engine (main repository)

updated at April 21, 2024, 10:22 a.m.

C++

1,682 +1

57,926 +146

9,072 +13

GitHub
tf-idf-similarity by jpmckinney

Ruby gem to calculate the similarity between texts using tf*idf

updated at April 21, 2024, 2:11 a.m.

Ruby

23 +0

726 +4

63 +0

GitHub
pycall.rb by mrkn

Calling Python functions from the Ruby language

updated at April 20, 2024, 4:52 p.m.

C

41 +0

1,028 +2

71 +0

GitHub
pragmatic_segmenter by diasks2

Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.

updated at April 20, 2024, 4:45 p.m.

Ruby

16 +0

535 +1

49 +0

GitHub
rmagick by rmagick

Ruby bindings for ImageMagick

updated at April 20, 2024, 5:52 a.m.

C++

19 +0

692 +0

140 +0

GitHub
chronic by mojombo

Chronic is a pure Ruby natural language date parser.

updated at April 19, 2024, 7:07 p.m.

Ruby

70 +0

3,222 +0

437 -1

GitHub
re2 by mudge

Ruby bindings to RE2, a "fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python".

updated at April 19, 2024, 6:48 p.m.

Ruby

5 +0

127 +1

14 +0

GitHub
elasticsearch-rails by elastic

Elasticsearch integrations for ActiveModel/Record and Ruby on Rails

updated at April 19, 2024, 6:06 p.m.

Ruby

393 +1

3,056 +3

790 +0

GitHub
iruby by SciRuby

Official gem repository: Ruby kernel for Jupyter/IPython Notebook

updated at April 19, 2024, 5:05 p.m.

Ruby

30 +0

823 +0

23 +0

GitHub
parallel by grosser

Ruby: parallel processing made simple and fast

updated at April 19, 2024, 4:53 p.m.

Ruby

77 +0

4,111 +11

255 +7

GitHub
decisiontree by igrigorik

ID3-based implementation of the ML Decision Tree algorithm

updated at April 19, 2024, 9:46 a.m.

Ruby

40 +0

1,422 +3

130 +0

GitHub
awesome-ocr by kba

Links to awesome OCR projects

updated at April 19, 2024, 7:59 a.m.

Unknown languages

127 +1

2,586 +5

339 +0

GitHub
ruby-fann by tangledpath

Ruby library for interfacing with FANN (Fast Artificial Neural Network)

updated at April 18, 2024, 11:03 a.m.

C

30 +0

489 +2

42 +0

GitHub
classifier-reborn by jekyll

A general classifier module to allow Bayesian and other types of classifications. A fork of cardmagic/classifier.

updated at April 16, 2024, 11:46 p.m.

Ruby

20 +0

547 +2

108 +0

GitHub
treat by louismullie

Natural language processing framework for Ruby.

updated at April 15, 2024, 10:07 p.m.

Ruby

68 +0

1,364 +1

128 +0

GitHub
words_counted by abitdodgy

A Ruby natural language processor.

updated at April 15, 2024, 10:06 p.m.

Ruby

12 +0

159 +1

29 +0

GitHub
yomu by yomurb

Read text and metadata from files and documents (.doc, .docx, .pages, .odt, .rtf, .pdf)

updated at April 15, 2024, 10:05 p.m.

Ruby

12 +0

491 +1

122 +1

GitHub