fuzzy_match in arbox/nlp-with-ruby

Find a needle (a document or record) in a haystack using string similarity and (optionally) regular expression rules. Uses Dice's Coefficient (aka Pair Similiarity) and Levenshtein Distance internally.

created at Jan. 13, 2012, 4:46 p.m.

Ruby

10 +0

676 -1

46 +0

GitHub
upsert in markets/awesome-ruby

Upsert on MySQL, PostgreSQL, and SQLite3. Transparently creates functions (UDF) for MySQL and PostgreSQL; on SQLite3, uses INSERT OR IGNORE.

created at June 13, 2012, 3:15 a.m.

Ruby

10 +0

651 +0

77 +0

GitHub
data_miner in markets/awesome-ruby

Download, unpack from a ZIP/TAR/GZ/BZ2 archive, parse, correct, convert units and import Google Spreadsheets, XLS, ODS, XML, CSV, HTML, etc. into your ActiveRecord models. Uses RemoteTable gem internally.

created at Aug. 19, 2009, 12:46 p.m.

Ruby

14 +0

302 +0

18 +0

GitHub