vaex by vaexio

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

created at Sept. 27, 2014, 9:44 a.m.

Python

143 +0

8,179 +8

588 +0

GitHub
altair by vega

Declarative statistical visualization library for Python

created at Sept. 19, 2015, 3:14 a.m.

Python

141 +0

8,954 +17

764 +1

GitHub
dvc by iterative

🦉 ML Experiments and Data Management with Git

created at March 4, 2017, 8:16 a.m.

Python

140 +0

13,174 +27

1,142 +3

GitHub
snips-nlu by snipsco

Snips Python library to extract meaning from text

created at Feb. 8, 2017, 4:16 p.m.

Python

135 +0

3,868 +1

516 -1

GitHub
deepface by serengil

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

created at Feb. 8, 2020, 8:42 p.m.

Python

134 +0

10,198 +98

1,828 +14

GitHub
python-recsys by ocelma

A python library for implementing a recommender system

created at Oct. 10, 2011, 4:17 a.m.

Python

134 +0

1,472 +1

439 +0

GitHub
pycascading by twitter-archive

A Python wrapper for Cascading

created at Dec. 9, 2011, 11:56 p.m.

Python

130 +0

223 +0

40 +0

GitHub
albumentations by albumentations-team

Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

created at June 6, 2018, 3:10 a.m.

Python

128 +0

13,472 +23

1,594 +0

GitHub
coach by IntelLabs

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

created at Oct. 1, 2017, 7:27 p.m.

Python

127 +0

2,309 +0

460 +0

GitHub
ggpy by yhat

ggplot port for python

created at Oct. 7, 2013, 1:30 p.m.

Python

126 +0

3,689 +2

571 +0

GitHub
haystack by deepset-ai

mag LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

created at Nov. 14, 2019, 9:05 a.m.

Python

126 +0

13,805 +71

1,641 +5

GitHub
dedupe by dedupeio

id A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

created at April 20, 2012, 2:57 p.m.

Python

120 +0

3,986 +3

539 +0

GitHub
mining by mining

Business Intelligence (BI) in Python, OLAP

created at Jan. 23, 2014, 7:20 p.m.

Python

118 +0

1,264 +0

231 +0

GitHub
optuna by optuna

A hyperparameter optimization framework

created at Feb. 21, 2018, 6:12 a.m.

Python

117 -1

9,728 +32

956 +4

GitHub
mlxtend by rasbt

A library of extension and helper modules for Python's data analysis and machine learning libraries.

created at Aug. 14, 2014, 1:56 a.m.

Python

117 +0

4,774 +5

846 +2

GitHub
vispy by vispy

Main repository for Vispy

created at March 21, 2013, 6:43 p.m.

Python

116 +0

3,226 +2

616 +0

GitHub
pytesseract by madmaze

A Python wrapper for Google Tesseract

created at Oct. 27, 2010, 11:02 p.m.

Python

108 +0

5,545 +15

696 +1

GitHub
simpleai by simpleai-team

simple artificial intelligence utilities

created at July 24, 2012, 11:12 p.m.

Python

107 +0

954 +1

249 +0

GitHub
sktime by sktime

A unified framework for machine learning with time series

created at Nov. 6, 2018, 3:08 p.m.

Python

102 +0

7,443 +26

1,285 +1

GitHub
brainstorm by IDSIA

Fast, flexible and fun neural networks.

created at Oct. 25, 2014, 10:20 a.m.

Python

99 -1

1,303 +0

153 +0

GitHub