pylearn2 by lisa-lab

Warning: This project does not have any current developer. See bellow.

created at Nov. 22, 2010, 6 p.m.

Python

268 +0

2,754 +0

1,099 +1

GitHub
pattern by clips

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

created at May 3, 2011, 3:29 p.m.

Python

545 +0

8,687 +9

1,574 +0

GitHub
simpleai by simpleai-team

simple artificial intelligence utilities

created at July 24, 2012, 11:12 p.m.

Python

107 +0

956 +1

250 +0

GitHub
featureforge by machinalis

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

created at Feb. 17, 2014, 5:21 p.m.

Python

34 +0

382 +0

77 +0

GitHub
xgboost by dmlc

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

created at Feb. 6, 2014, 5:28 p.m.

C++

912 +1

25,661 +25

8,669 +0

GitHub
machine-learning by jeff1evesque

Web-interface + rest API for classification and regression (https://jeff1evesque.github.io/machine-learning.docs)

created at Aug. 13, 2014, 4:50 p.m.

JavaScript

22 +0

258 +0

86 +1

GitHub
chainer by chainer

A flexible framework of neural networks for deep learning

created at June 5, 2015, 5:50 a.m.

Python

288 +0

5,868 +1

1,366 -1

GitHub
auto_ml by ClimbsRocks

[UNMAINTAINED] Automated machine learning for analytics & production

created at Aug. 7, 2016, 9:35 p.m.

Python

98 +0

1,638 +0

313 +0

GitHub
dedupe by dedupeio

id A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

created at April 20, 2012, 2:57 p.m.

Python

120 +0

3,990 +1

540 +0

GitHub
DrQA by facebookresearch

Reading Wikipedia to Answer Open-Domain Questions

created at July 7, 2017, 10:44 p.m.

Python

157 +0

4,465 +0

900 +1

GitHub
polyglot by aboSamoor

Multilingual text (NLP) processing toolkit

created at June 30, 2014, 2:07 a.m.

Python

77 +0

2,272 +3

336 +0

GitHub
yase by PPACI

Yet Another Sequence Encoder - Encode sequences to vector of vector in python !

created at April 12, 2017, 6:45 p.m.

Python

2 +0

13 +0

1 +0

GitHub
cltk by cltk

The Classical Language Toolkit

created at Jan. 11, 2014, 11:59 p.m.

Python

65 +0

822 +2

325 +0

GitHub
stanford-corenlp-python by dasmith

Python wrapper for Stanford CoreNLP tools v3.4.1

created at Feb. 26, 2011, 6:20 p.m.

Python

41 +0

610 +0

229 +0

GitHub
textacy by chartbeat-labs

NLP, before and after spaCy

created at Feb. 3, 2016, 4:52 p.m.

Python

89 +0

2,180 +1

247 +0

GitHub
jellyfish by jamesturk

🪼 a python library for doing approximate and phonetic matching of strings.

created at July 9, 2010, 8:41 p.m.

Jupyter Notebook

42 +0

1,998 +1

156 +0

GitHub
fuzzywuzzy by seatgeek

Fuzzy String Matching in Python

created at July 8, 2011, 7:32 p.m.

Python

259 +0

9,137 +2

878 +0

GitHub
distance by doukremt

Levenshtein and Hamming distance computation

created at Nov. 3, 2013, 2:02 p.m.

C

5 +0

115 +0

17 +0

GitHub
PyStanfordDependencies by dmcc

Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies

created at Dec. 13, 2014, 6:38 p.m.

Python

4 +0

68 +0

17 +0

GitHub
data-science-ipython-notebooks by donnemartin

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

created at Jan. 23, 2015, 7:38 p.m.

Python

1,619 +0

26,549 +5

7,740 +7

GitHub