fuzzy_match in arbox/nlp-with-ruby

Find a needle (a document or record) in a haystack using string similarity and (optionally) regular expression rules. Uses Dice's Coefficient (aka Pair Similiarity) and Levenshtein Distance internally.

updated at Nov. 12, 2024, 3:42 p.m.

Ruby

10 +0

676 +1

46 +0

GitHub
data_miner in markets/awesome-ruby

Download, unpack from a ZIP/TAR/GZ/BZ2 archive, parse, correct, convert units and import Google Spreadsheets, XLS, ODS, XML, CSV, HTML, etc. into your ActiveRecord models. Uses RemoteTable gem internally.

updated at Aug. 30, 2024, 1:48 p.m.

Ruby

14 +0

302 +0

18 +0

GitHub
upsert in markets/awesome-ruby

Upsert on MySQL, PostgreSQL, and SQLite3. Transparently creates functions (UDF) for MySQL and PostgreSQL; on SQLite3, uses INSERT OR IGNORE.

updated at July 19, 2024, 9:28 p.m.

Ruby

10 +0

652 +0

77 +0

GitHub