txtai by neuml

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

created at Aug. 9, 2020, 7:14 p.m.

Python

92 +0

9,369 +118

602 +5

GitHub
altair by vega

Declarative statistical visualization library for Python

created at Sept. 19, 2015, 3:14 a.m.

Python

140 +0

9,365 +16

793 -1

GitHub
fuzzywuzzy by seatgeek

Fuzzy String Matching in Python

created at July 8, 2011, 7:32 p.m.

Python

259 +1

9,231 +4

876 +1

GitHub
stable-baselines3 by DLR-RM

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

created at May 5, 2020, 5:52 a.m.

Python

64 -1

9,142 +53

1,704 +6

GitHub
pattern by clips

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

created at May 3, 2011, 3:29 p.m.

Python

543 +0

8,751 +7

1,577 +0

GitHub
pymc by pymc-devs

Bayesian Modeling and Probabilistic Programming in Python

created at Feb. 20, 2015, 5:12 p.m.

Python

226 +1

8,720 +6

2,015 +4

GitHub
pyod by yzhao062

A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques

created at Oct. 3, 2017, 8:29 p.m.

Python

145 +0

8,586 +24

1,369 +2

GitHub
einops by arogozhnikov

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

created at Sept. 22, 2018, 12:45 a.m.

Python

69 +1

8,519 +18

350 +0

GitHub
espnet by espnet

End-to-End Speech Processing Toolkit

created at Dec. 13, 2017, 12:45 a.m.

Python

NEW!

181 +0

8,505 +0

2,185 +0

GitHub
vaex by vaexio

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

created at Sept. 27, 2014, 9:44 a.m.

Python

144 +0

8,297 +7

590 +0

GitHub
deeplake by activeloopai

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

created at Aug. 9, 2019, 6:17 a.m.

Python

90 +0

8,176 +18

627 +5

GitHub
autogluon by autogluon

Fast and Accurate ML in 3 Lines of Code

created at July 29, 2019, 6:51 p.m.

Python

97 +0

8,042 +23

928 +1

GitHub
sktime by sktime

A unified framework for machine learning with time series

created at Nov. 6, 2018, 3:08 p.m.

Python

108 +0

7,947 +19

1,375 +6

GitHub
Gymnasium by Farama-Foundation

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

created at Sept. 8, 2022, 1:58 a.m.

Python

42 +0

7,372 +67

827 +14

GitHub
featuretools by alteryx

An open source python library for automated feature engineering

created at Sept. 8, 2017, 10:15 p.m.

Python

158 +0

7,270 +11

878 -1

GitHub
BentoML by bentoml

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

created at April 2, 2019, 1:39 a.m.

Python

77 +0

7,154 +13

792 +1

GitHub
SerpentAI by SerpentAI

Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!

created at April 16, 2017, 9:48 p.m.

Python

338 +0

6,779 +6

786 +0

GitHub
pkuseg-python by lancopku

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

created at Aug. 5, 2018, 6:41 a.m.

Python

208 +0

6,541 +3

986 +0

GitHub
snownlp by isnowfy

Python library for processing Chinese text

created at Nov. 26, 2013, 11:46 a.m.

Python

350 +0

6,439 +9

1,367 +2

GitHub
nupic-legacy by numenta

Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.

created at April 5, 2013, 11:14 p.m.

Python

627 +0

6,336 +1

1,555 +0

GitHub