ZPar statistical parser. Universal language support (depending on the availability of training data), with language-specific features for Chinese and English. Currently support word segmentation, POS tagging, dependency and phrase-structure parsing.
created at June 30, 2015, 1:55 p.m.
A python wrapper around the ZPar parser for English.
created at Sept. 8, 2014, 1:41 p.m.
Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser)
created at Sept. 7, 2014, 8:32 p.m.
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is regular-expression based, extensible, and advanced tokeniser written in C++ (http://ilk.uvt.nl/ucto).
created at May 21, 2014, 5:28 p.m.
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
created at July 6, 2010, 11:42 a.m.
Tools, wrappers, etc... for data science with a concentration on text processing
created at Nov. 3, 2013, 4:13 p.m.
A python framework to transform natural language questions to queries in a database query language.
created at Dec. 3, 2012, 3:46 p.m.
FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.
created at Oct. 5, 2017, 5:32 p.m.
The world's simplest facial recognition api for Python and the command line
created at March 3, 2017, 9:52 p.m.
A library for machine learning that builds predictions using a linear regression.
created at June 8, 2015, 4:52 p.m.
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
created at Jan. 23, 2015, 7:38 p.m.