vaex by vaexio

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

created at Sept. 27, 2014, 9:44 a.m.

Python

143 +0

8,215 +12

589 +0

GitHub
deeplake by activeloopai

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

created at Aug. 9, 2019, 6:17 a.m.

Python

87 +0

7,840 +12

602 +3

GitHub
swift by tensorflow

Swift for TensorFlow

created at April 24, 2018, 7:18 p.m.

Jupyter Notebook

261 +0

6,129 +1

606 +0

GitHub
cortex by cortexlabs

Production infrastructure for machine learning at scale

created at Jan. 24, 2019, 4:43 a.m.

Go

146 +0

8,003 +4

611 +3

GitHub
vispy by vispy

Main repository for Vispy

created at March 21, 2013, 6:43 p.m.

Python

116 +0

3,249 +1

618 +0

GitHub
nlp.js by axa-group

An NLP library for building bots, with entity extraction, sentiment analysis, automatic language identify, and so more

created at July 30, 2018, 5:46 p.m.

JavaScript

106 +0

6,139 +11

618 +1

GitHub
djl by deepjavalibrary

An Engine-Agnostic Deep Learning Framework in Java

created at Oct. 29, 2019, 10:38 p.m.

Java

105 +0

3,932 +6

623 +1

GitHub
compromise by spencermountain

modest natural-language processing

created at July 5, 2011, 9:04 a.m.

JavaScript

165 +0

11,281 +9

642 -1

GitHub
CompreFace by exadel-inc

Leading free and open-source face recognition system

created at July 6, 2020, 8:29 a.m.

Java

80 +2

4,847 +55

657 +2

GitHub
synaptic by cazala

architecture-free neural network library for node.js and the browser

created at Sept. 30, 2014, 6:07 p.m.

JavaScript

282 +0

6,913 -2

665 +0

GitHub
PCV by jesolem

Open source Python module for computer vision

created at March 30, 2012, 5:31 a.m.

Python

166 +0

1,915 -1

674 +0

GitHub
cleanlab by cleanlab

The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

created at May 11, 2018, 1:55 a.m.

Python

85 +0

8,938 +21

688 +0

GitHub
weaviate by semi-technologies

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

created at March 30, 2016, 3:03 p.m.

Go

113 +1

10,085 +67

688 +3

GitHub
breeze by scalanlp

Breeze is a numerical processing library for Scala.

created at July 8, 2009, 11:22 p.m.

Scala

208 +0

3,437 +0

692 -1

GitHub
pgmpy by pgmpy

Python Library for learning (Structure and Parameter), inference (Probabilistic and Causal), and simulations in Bayesian Networks.

created at Sept. 20, 2013, 8:18 a.m.

Python

74 +0

2,639 +2

695 +1

GitHub
scalding by twitter

A Scala API for Cascading

created at Jan. 10, 2012, 4:22 p.m.

Scala

323 +0

3,479 +0

703 +0

GitHub
spark-nlp by JohnSnowLabs

State of the Art Natural Language Processing

created at Sept. 24, 2017, 7:36 p.m.

Scala

99 +0

3,748 +8

704 +0

GitHub
pytesseract by madmaze

A Python wrapper for Google Tesseract

created at Oct. 27, 2010, 11:02 p.m.

Python

109 +0

5,625 +11

706 +0

GitHub
dynet by clab

DyNet: The Dynamic Neural Network Toolkit

created at Feb. 8, 2015, 11:09 p.m.

C++

185 +0

3,414 +0

707 +0

GitHub
Gymnasium by Farama-Foundation

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

created at Sept. 8, 2022, 1:58 a.m.

Python

40 +0

6,128 +47

710 +7

GitHub