python-ucto by proycon

This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is regular-expression based, extensible, and advanced tokeniser written in C++ (http://ilk.uvt.nl/ucto).

created at May 21, 2014, 5:28 p.m.

Cython

4 +0

29 +0

5 +0

GitHub
python-frog by proycon

Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser)

created at Sept. 7, 2014, 8:32 p.m.

Cython

6 +0

47 +0

10 +0

GitHub