REST client for Google APIs
updated at May 26, 2024, 10:38 a.m.
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
updated at May 26, 2024, 8:25 a.m.
Tesseract Open Source OCR Engine (main repository)
updated at May 25, 2024, 9:39 p.m.
Syntax tree generator for linguistic research
updated at May 25, 2024, 8:43 p.m.
Sphinx/Manticore plugin for ActiveRecord/Rails
updated at May 24, 2024, 1:29 a.m.
Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.
updated at May 23, 2024, 5:54 a.m.
Find a needle (a document or record) in a haystack using string similarity and (optionally) regular expression rules. Uses Dice's Coefficient (aka Pair Similiarity) and Levenshtein Distance internally.
updated at May 22, 2024, 12:29 p.m.
Ruby gem to calculate the similarity between texts using tf*idf
updated at May 20, 2024, 9:06 p.m.
Generate strings that match a given regular expression
updated at May 20, 2024, 6:43 p.m.
Elasticsearch integrations for ActiveModel/Record and Ruby on Rails
updated at May 20, 2024, 4:19 p.m.
ID3-based implementation of the ML Decision Tree algorithm
updated at May 20, 2024, 11:18 a.m.