tesseract by tesseract-ocr

Tesseract Open Source OCR Engine (main repository)

updated at Nov. 17, 2024, 11:50 a.m.

C++

1,692 +1

62,379 +148

9,520 +14

GitHub
chronic_between by jrobertson

A natural language parser for validating complex date ranges

updated at Nov. 16, 2024, 8:58 a.m.

Ruby

2 +0

28 +1

3 +0

GitHub
pragmatic_segmenter by diasks2

Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.

updated at Nov. 16, 2024, 8:44 a.m.

Ruby

16 +0

552 +3

55 +0

GitHub
tickle by yb66

Natural language parser for recurring events

updated at Nov. 16, 2024, 6:17 a.m.

Ruby

3 +0

82 +1

13 +0

GitHub
chronic by mojombo

Chronic is a pure Ruby natural language date parser.

updated at Nov. 16, 2024, 3:56 a.m.

Ruby

70 +0

3,244 +0

465 +2

GitHub
lita by litaio

ChatOps for Ruby.

updated at Nov. 15, 2024, 10:59 p.m.

Ruby

40 +0

1,678 -1

179 +0

GitHub
awesome-ocr by kba

Links to awesome OCR projects

updated at Nov. 15, 2024, 9:03 p.m.

Unknown languages

128 +0

2,817 +11

349 +0

GitHub
pycall.rb by mrkn

Calling Python functions from the Ruby language

updated at Nov. 15, 2024, 5:40 p.m.

C

39 +0

1,057 +1

75 +0

GitHub
parallel by grosser

Ruby: parallel processing made simple and fast

updated at Nov. 15, 2024, 12:40 p.m.

Ruby

75 +0

4,168 +7

254 +0

GitHub
amatch by flori

Approximate String Matching library

updated at Nov. 15, 2024, 12:08 p.m.

C

9 +0

378 +2

35 +0

GitHub
tf-idf-similarity by jpmckinney

Ruby gem to calculate the similarity between texts using tf*idf

updated at Nov. 15, 2024, 12:04 p.m.

Ruby

23 +0

748 +1

64 +0

GitHub
ruby-fann by tangledpath

Ruby library for interfacing with FANN (Fast Artificial Neural Network)

updated at Nov. 15, 2024, 6:23 a.m.

C

30 +0

497 +2

42 +0

GitHub
google-api-ruby-client by googleapis

REST client for Google APIs

updated at Nov. 14, 2024, 2:29 p.m.

Ruby

114 +0

2,805 +3

871 +0

GitHub
rsolr by rsolr

A Ruby client for Apache Solr

updated at Nov. 14, 2024, 10:09 a.m.

Ruby

20 +0

421 +0

142 +0

GitHub
CoreNLP by stanfordnlp

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

updated at Nov. 14, 2024, 3:51 a.m.

Java

489 -1

9,702 +9

2,703 +0

GitHub
iruby by SciRuby

Official gem repository: Ruby kernel for Jupyter/IPython Notebook

updated at Nov. 14, 2024, 12:37 a.m.

Ruby

30 +0

901 +2

29 +1

GitHub
stanford-core-nlp by louismullie

Ruby bindings to the Stanford Core NLP tools (English, French, German).

updated at Nov. 13, 2024, 10:27 a.m.

Ruby

34 +0

432 -1

70 +0

GitHub
yomu by yomurb

Read text and metadata from files and documents (.doc, .docx, .pages, .odt, .rtf, .pdf)

updated at Nov. 12, 2024, 5:54 p.m.

Ruby

12 +0

499 +1

125 +0

GitHub
fuzzy_match by seamusabshere

Find a needle (a document or record) in a haystack using string similarity and (optionally) regular expression rules. Uses Dice's Coefficient (aka Pair Similiarity) and Levenshtein Distance internally.

updated at Nov. 12, 2024, 3:42 p.m.

Ruby

10 +0

676 +1

46 +0

GitHub
nmt-list by jonsafari

A list of Neural MT implementations

updated at Nov. 11, 2024, 9:14 a.m.

Unknown languages

33 +0

359 +1

69 +0

GitHub