vaex by vaexio

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

created at Sept. 27, 2014, 9:44 a.m.

Python

143 +0

8,203 +7

589 +0

GitHub
deeplake by activeloopai

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

created at Aug. 9, 2019, 6:17 a.m.

Python

87 +0

7,828 +18

599 -1

GitHub
swift by tensorflow

Swift for TensorFlow

created at April 24, 2018, 7:18 p.m.

Jupyter Notebook

261 +0

6,128 +7

606 +0

GitHub
cortex by cortexlabs

Production infrastructure for machine learning at scale

created at Jan. 24, 2019, 4:43 a.m.

Go

146 +1

7,999 +1

608 +0

GitHub
nlp.js by axa-group

An NLP library for building bots, with entity extraction, sentiment analysis, automatic language identify, and so more

created at July 30, 2018, 5:46 p.m.

JavaScript

106 +0

6,128 +5

617 +1

GitHub
vispy by vispy

Main repository for Vispy

created at March 21, 2013, 6:43 p.m.

Python

116 +0

3,248 +7

618 +2

GitHub
djl by deepjavalibrary

An Engine-Agnostic Deep Learning Framework in Java

created at Oct. 29, 2019, 10:38 p.m.

Java

105 +0

3,926 +1

622 +0

GitHub
compromise by spencermountain

modest natural-language processing

created at July 5, 2011, 9:04 a.m.

JavaScript

165 +0

11,272 +10

643 +0

GitHub
CompreFace by exadel-inc

Leading free and open-source face recognition system

created at July 6, 2020, 8:29 a.m.

Java

78 +0

4,792 +52

655 +5

GitHub
synaptic by cazala

architecture-free neural network library for node.js and the browser

created at Sept. 30, 2014, 6:07 p.m.

JavaScript

282 +0

6,915 +0

665 +0

GitHub
PCV by jesolem

Open source Python module for computer vision

created at March 30, 2012, 5:31 a.m.

Python

166 +0

1,916 +1

674 +0

GitHub
weaviate by semi-technologies

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

created at March 30, 2016, 3:03 p.m.

Go

112 +1

10,018 +64

685 +7

GitHub
cleanlab by cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

created at May 11, 2018, 1:55 a.m.

Python

85 +0

8,917 +14

688 +3

GitHub
breeze by scalanlp

Breeze is a numerical processing library for Scala.

created at July 8, 2009, 11:22 p.m.

Scala

208 +0

3,437 +0

693 +1

GitHub
pgmpy by pgmpy

Python Library for learning (Structure and Parameter), inference (Probabilistic and Causal), and simulations in Bayesian Networks.

created at Sept. 20, 2013, 8:18 a.m.

Python

74 +0

2,637 +2

694 +1

GitHub
scalding by twitter

A Scala API for Cascading

created at Jan. 10, 2012, 4:22 p.m.

Scala

323 +0

3,479 +1

703 +0

GitHub
Gymnasium by Farama-Foundation

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

created at Sept. 8, 2022, 1:58 a.m.

Python

40 +0

6,081 +43

703 +8

GitHub
spark-nlp by JohnSnowLabs

State of the Art Natural Language Processing

created at Sept. 24, 2017, 7:36 p.m.

Scala

99 +0

3,740 +10

704 +0

GitHub
pytesseract by madmaze

A Python wrapper for Google Tesseract

created at Oct. 27, 2010, 11:02 p.m.

Python

109 +1

5,614 +11

706 +1

GitHub
dynet by clab

DyNet: The Dynamic Neural Network Toolkit

created at Feb. 8, 2015, 11:09 p.m.

C++

185 +0

3,414 +2

707 +0

GitHub