pragmatic_tokenizer in arbox/nlp-with-ruby

A multilingual tokenizer to split a string into tokens

updated at Feb. 22, 2024, 4:04 p.m.

Ruby

6 +0

90 +0

11 +0

GitHub
pragmatic_segmenter in markets/awesome-ruby, arbox/nlp-with-ruby

Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.

updated at May 29, 2024, 12:43 p.m.

Ruby

16 +0

539 +1

51 +0

GitHub