pylearn2 by lisa-lab

Warning: This project does not have any current developer. See bellow.

created at Nov. 22, 2010, 6 p.m.

Python

267 +0

2,752 +0

1,091 -1

GitHub
pattern by clips

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

created at May 3, 2011, 3:29 p.m.

Python

543 +0

8,726 +4

1,577 -1

GitHub
simpleai by simpleai-team

simple artificial intelligence utilities

created at July 24, 2012, 11:12 p.m.

Python

106 +0

964 +0

248 +0

GitHub
featureforge by machinalis

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

created at Feb. 17, 2014, 5:21 p.m.

Python

34 +0

382 +0

77 +0

GitHub
xgboost by dmlc

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

created at Feb. 6, 2014, 5:28 p.m.

C++

909 +0

26,140 +20

8,706 +2

GitHub
machine-learning by jeff1evesque

Web-interface + rest API for classification and regression (https://jeff1evesque.github.io/machine-learning.docs)

created at Aug. 13, 2014, 4:50 p.m.

JavaScript

22 +0

256 +0

85 +0

GitHub
chainer by chainer

A flexible framework of neural networks for deep learning

created at June 5, 2015, 5:50 a.m.

Python

285 -1

5,886 +1

1,368 +0

GitHub
auto_ml by ClimbsRocks

[UNMAINTAINED] Automated machine learning for analytics & production

created at Aug. 7, 2016, 9:35 p.m.

Python

97 +0

1,641 -1

310 +0

GitHub
dedupe by dedupeio

id A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

created at April 20, 2012, 2:57 p.m.

Python

120 +0

4,106 +5

550 +1

GitHub
DrQA by facebookresearch

Reading Wikipedia to Answer Open-Domain Questions

created at July 7, 2017, 10:44 p.m.

Python

158 +0

4,474 -1

899 +0

GitHub
polyglot by aboSamoor

Multilingual text (NLP) processing toolkit

created at June 30, 2014, 2:07 a.m.

Python

77 +0

2,309 +1

337 +0

GitHub
yase by PPACI

Yet Another Sequence Encoder - Encode sequences to vector of vector in python !

created at April 12, 2017, 6:45 p.m.

Python

2 +0

13 +0

1 +0

GitHub
cltk by cltk

The Classical Language Toolkit

created at Jan. 11, 2014, 11:59 p.m.

Python

65 +0

834 +0

328 +0

GitHub
stanford-corenlp-python by dasmith

Python wrapper for Stanford CoreNLP tools v3.4.1

created at Feb. 26, 2011, 6:20 p.m.

Python

41 +0

611 +0

229 +0

GitHub
textacy by chartbeat-labs

NLP, before and after spaCy

created at Feb. 3, 2016, 4:52 p.m.

Python

87 +0

2,208 +1

249 +0

GitHub
jellyfish by jamesturk

🪼 a python library for doing approximate and phonetic matching of strings.

created at July 9, 2010, 8:41 p.m.

Jupyter Notebook

42 +0

2,045 +3

158 +1

GitHub
fuzzywuzzy by seatgeek

Fuzzy String Matching in Python

created at July 8, 2011, 7:32 p.m.

Python

259 +0

9,219 +5

875 +1

GitHub
distance by doukremt

Levenshtein and Hamming distance computation

created at Nov. 3, 2013, 2:02 p.m.

C

5 +0

117 +0

17 +0

GitHub
PyStanfordDependencies by dmcc

Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies

created at Dec. 13, 2014, 6:38 p.m.

Python

4 +0

68 +0

17 +0

GitHub
data-science-ipython-notebooks by donnemartin

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

created at Jan. 23, 2015, 7:38 p.m.

Python

1,615 -1

27,231 +78

7,846 +9

GitHub