xgboost by dmlc

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

created at Feb. 6, 2014, 5:28 p.m.

C++

908 +0

26,303 +26

8,729 +3

GitHub
hadoop by apache

Apache Hadoop

created at Aug. 28, 2014, 7 a.m.

Java

987 -1

14,777 +7

8,866 +2

GitHub
tesseract by tesseract-ocr

Tesseract Open Source OCR Engine (main repository)

created at Aug. 12, 2014, 6:04 p.m.

C++

1,692 +1

62,379 +148

9,520 +14

GitHub
handson-ml by ageron

⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 instead.

created at Feb. 16, 2016, 7:48 p.m.

Jupyter Notebook

1,085 +0

25,200 +5

12,913 +4

GitHub
face_recognition by ageitgey

The world's simplest facial recognition api for Python and the command line

created at March 3, 2017, 9:52 p.m.

Python

1,567 +1

53,455 +73

13,492 +10

GitHub
superset by apache

Apache Superset is a Data Visualization and Data Exploration Platform

created at July 21, 2015, 6:55 p.m.

TypeScript

1,519 +2

62,806 +158

13,874 +46

GitHub
pydata-book by wesm

Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media

created at June 30, 2012, 6:39 p.m.

Jupyter Notebook

1,485 +0

22,245 +29

15,185 +14

GitHub
caffe by BVLC

Caffe: a fast open framework for deep learning.

created at Sept. 12, 2013, 6:39 p.m.

C++

2,093 +0

34,127 +10

18,681 -6

GitHub
keras by keras-team

Deep Learning for humans

created at March 28, 2015, 12:35 a.m.

Python

1,915 +1

62,055 +74

19,481 +6

GitHub
darknet by pjreddie

Convolutional Neural Networks

created at April 11, 2014, 7:59 a.m.

C

911 +1

25,843 +22

21,330 +4

GitHub
pytorch by pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

created at Aug. 13, 2016, 5:26 a.m.

Python

1,743 +1

83,990 +192

22,642 +50

GitHub
transformers by huggingface

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

created at Oct. 29, 2018, 1:56 p.m.

Python

1,125 +1

135,055 +359

27,021 +85

GitHub
spark by apache

Apache Spark - A unified analytics engine for large-scale data processing

created at Feb. 25, 2014, 8 a.m.

Scala

2,023 +1

39,929 +83

28,314 +8

GitHub
opencv by opencv

Open Source Computer Vision Library

created at July 19, 2012, 9:40 a.m.

C++

2,652 -2

79,137 +133

55,831 +20

GitHub
tensorflow by tensorflow

An Open Source Machine Learning Framework for Everyone

created at Nov. 7, 2015, 1:19 a.m.

C++

7,573 -5

186,403 +124

74,304 -1

GitHub