onyx by onyx-platform

Distributed, masterless, high performance, fault tolerant data processing

created at Dec. 2, 2013, 1:21 a.m.

Clojure

122 +0

2,050 +0

205 +0

GitHub
ggpy by yhat

ggplot port for python

created at Oct. 7, 2013, 1:30 p.m.

Python

125 +0

3,700 +0

572 +1

GitHub
coach by IntelLabs

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

created at Oct. 1, 2017, 7:27 p.m.

Python

126 +0

2,331 +1

461 +0

GitHub
qdrant by qdrant

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

created at May 30, 2020, 9:37 p.m.

Rust

126 +0

20,689 +82

1,410 +10

GitHub
weaviate by semi-technologies

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

created at March 30, 2016, 3:03 p.m.

Go

126 +0

11,587 +87

803 +6

GitHub
albumentations by albumentations-team

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

created at June 6, 2018, 3:10 a.m.

Python

127 -2

14,297 +39

1,648 -1

GitHub
ThinkBayes by AllenDowney

Code repository for Think Bayes.

created at July 8, 2013, 2:30 p.m.

TeX

128 +0

1,651 +3

1,938 +0

GitHub
pycascading by twitter-archive

A Python wrapper for Cascading

created at Dec. 9, 2011, 11:56 p.m.

Python

129 +0

222 +0

37 +0

GitHub
deepdetect by jolibrain

Deep Learning API and Server in C++14 support for PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE

created at May 22, 2015, 2:45 p.m.

C++

132 +0

2,519 +0

561 +0

GitHub
python-recsys by ocelma

A python library for implementing a recommender system

created at Oct. 10, 2011, 4:17 a.m.

Python

133 +0

1,475 +0

436 +0

GitHub
snips-nlu by snipsco

Snips Python library to extract meaning from text

created at Feb. 8, 2017, 4:16 p.m.

Python

134 +0

3,897 +0

512 -1

GitHub
dvc by iterative

🦉 Data Versioning and ML Experiments

created at March 4, 2017, 8:16 a.m.

Python

135 +0

13,946 +44

1,191 +4

GitHub
pycaret by pycaret

An open-source, low-code machine learning library in Python

created at Nov. 23, 2019, 6:40 p.m.

Jupyter Notebook

136 +0

8,964 +9

1,776 +2

GitHub
datumbox-framework by datumbox

Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.

created at Oct. 18, 2014, 6:10 p.m.

Java

137 +0

1,085 +0

282 +0

GitHub
altair by vega

Declarative statistical visualization library for Python

created at Sept. 19, 2015, 3:14 a.m.

Python

140 +0

9,393 +28

795 +2

GitHub
HLearn by mikeizbicki

Homomorphic machine learning

created at July 18, 2012, 12:08 a.m.

Haskell

141 +0

1,623 +1

135 +0

GitHub
deeppy by andersbll

Deep learning in Python

created at Sept. 18, 2014, 6:18 a.m.

Python

144 +0

1,380 +0

307 +0

GitHub
vaex by vaexio

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

created at Sept. 27, 2014, 9:44 a.m.

Python

144 +0

8,300 +3

591 +1

GitHub
haystack by deepset-ai

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

created at Nov. 14, 2019, 9:05 a.m.

Python

144 +0

17,834 +106

1,926 +9

GitHub
cortex by cortexlabs

Production infrastructure for machine learning at scale

created at Jan. 24, 2019, 4:43 a.m.

Go

145 +0

8,022 +2

607 +0

GitHub