pylearn2 by lisa-lab

Warning: This project does not have any current developer. See bellow.

created at Nov. 22, 2010, 6 p.m.

Python

267 +0

2,757 +3

1,090 +0

GitHub
pattern by clips

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

created at May 3, 2011, 3:29 p.m.

Python

543 +0

8,751 +7

1,577 +0

GitHub
simpleai by simpleai-team

simple artificial intelligence utilities

created at July 24, 2012, 11:12 p.m.

Python

106 +0

968 +0

250 +0

GitHub
featureforge by machinalis

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

created at Feb. 17, 2014, 5:21 p.m.

Python

34 +0

382 +0

77 +0

GitHub
xgboost by dmlc

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

created at Feb. 6, 2014, 5:28 p.m.

C++

908 +0

26,303 +26

8,729 +3

GitHub
machine-learning by jeff1evesque

Web-interface + rest API for classification and regression (https://jeff1evesque.github.io/machine-learning.docs)

created at Aug. 13, 2014, 4:50 p.m.

JavaScript

22 +0

257 +0

85 +0

GitHub
chainer by chainer

A flexible framework of neural networks for deep learning

created at June 5, 2015, 5:50 a.m.

Python

282 +0

5,892 +2

1,367 +0

GitHub
auto_ml by ClimbsRocks

[UNMAINTAINED] Automated machine learning for analytics & production

created at Aug. 7, 2016, 9:35 p.m.

Python

97 +0

1,643 -1

310 +0

GitHub
dedupe by dedupeio

id A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

created at April 20, 2012, 2:57 p.m.

Python

120 +0

4,145 +3

551 +1

GitHub
DrQA by facebookresearch

Reading Wikipedia to Answer Open-Domain Questions

created at July 7, 2017, 10:44 p.m.

Python

158 +0

4,478 +2

896 +0

GitHub
polyglot by aboSamoor

Multilingual text (NLP) processing toolkit

created at June 30, 2014, 2:07 a.m.

Python

77 +0

2,315 +2

337 +0

GitHub
yase by PPACI

Yet Another Sequence Encoder - Encode sequences to vector of vector in python !

created at April 12, 2017, 6:45 p.m.

Python

2 +0

13 +0

1 +0

GitHub
cltk by cltk

The Classical Language Toolkit

created at Jan. 11, 2014, 11:59 p.m.

Python

65 +0

840 +2

330 +0

GitHub
stanford-corenlp-python by dasmith

Python wrapper for Stanford CoreNLP tools v3.4.1

created at Feb. 26, 2011, 6:20 p.m.

Python

41 +0

612 +0

228 -1

GitHub
textacy by chartbeat-labs

NLP, before and after spaCy

created at Feb. 3, 2016, 4:52 p.m.

Python

87 +0

2,217 +2

250 +0

GitHub
jellyfish by jamesturk

🪼 a python library for doing approximate and phonetic matching of strings.

created at July 9, 2010, 8:41 p.m.

Jupyter Notebook

42 +0

2,066 -1

158 -2

GitHub
fuzzywuzzy by seatgeek

Fuzzy String Matching in Python

created at July 8, 2011, 7:32 p.m.

Python

259 +1

9,231 +4

876 +1

GitHub
distance by doukremt

Levenshtein and Hamming distance computation

created at Nov. 3, 2013, 2:02 p.m.

C

5 +0

117 +0

17 +0

GitHub
PyStanfordDependencies by dmcc

Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies

created at Dec. 13, 2014, 6:38 p.m.

Python

4 +0

68 +0

17 +0

GitHub
data-science-ipython-notebooks by donnemartin

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

created at Jan. 23, 2015, 7:38 p.m.

Python

1,616 +0

27,477 +31

7,882 +2

GitHub