milvus by milvus-io

A cloud-native vector database, storage for next generation AI applications

updated at May 26, 2024, 10:07 a.m.

Go

273 +0

27,424 +190

2,640 +13

GitHub
handson-ml by ageron

⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 instead.

updated at May 26, 2024, 10:06 a.m.

Jupyter Notebook

1,087 +0

25,108 +7

12,932 +3

GitHub
albumentations by albumentations-team

Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

updated at May 26, 2024, 9:46 a.m.

Python

129 +0

13,542 +35

1,600 +2

GitHub
pytorch-lightning by PyTorchLightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

updated at May 26, 2024, 9:42 a.m.

Python

245 -1

27,143 +73

3,266 +3

GitHub
deepface by serengil

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

updated at May 26, 2024, 9:35 a.m.

Python

135 +0

10,356 +77

1,846 +6

GitHub
jieba by fxsjy

结巴中文分词

updated at May 26, 2024, 9:21 a.m.

Python

1,285 +0

32,562 +32

6,706 +1

GitHub
mlx by ml-explore

MLX: An array framework for Apple silicon

updated at May 26, 2024, 9:08 a.m.

C++

135 +0

14,914 +81

846 +1

GitHub
sylvester by jcoglan

Vector, matrix and geometry math JavaScript

updated at May 26, 2024, 8:57 a.m.

JavaScript

48 +0

1,152 +1

124 +0

GitHub
streamlit by streamlit

Streamlit — A faster way to build and share data apps.

updated at May 26, 2024, 8:57 a.m.

Python

316 +2

32,374 +173

2,818 +12

GitHub
ColossalAI by hpcaitech

Making large AI models cheaper, faster and more accessible

updated at May 26, 2024, 8:46 a.m.

Python

379 +1

38,071 +50

4,265 +5

GitHub
roboschool by openai

DEPRECATED: Open-source software for robot simulation, integrated with OpenAI Gym.

updated at May 26, 2024, 8:34 a.m.

Python

282 +0

2,103 +1

489 +1

GitHub
CoreNLP by stanfordnlp

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

updated at May 26, 2024, 8:25 a.m.

Java

488 +0

9,501 +13

2,694 +0

GitHub
neuraltalk2 by karpathy

Efficient Image Captioning code in Torch, runs on GPU

updated at May 26, 2024, 8:23 a.m.

Jupyter Notebook

274 +0

5,462 -4

1,263 +1

GitHub
Gymnasium by Farama-Foundation

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

updated at May 26, 2024, 8:15 a.m.

Python

39 +1

5,940 +42

689 +6

GitHub
pydata-book by wesm

Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media

updated at May 26, 2024, 8:13 a.m.

Jupyter Notebook

1,474 +1

21,438 +37

14,884 +17

GitHub
introduction_to_ml_with_python by amueller

Notebooks and code for the book "Introduction to Machine Learning with Python"

updated at May 26, 2024, 8:12 a.m.

Jupyter Notebook

369 +0

7,229 +6

4,493 +2

GitHub
tokenizers by huggingface

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

updated at May 26, 2024, 8:12 a.m.

Rust

120 +0

8,541 +24

736 +4

GitHub
xgboost by dmlc

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

updated at May 26, 2024, 8:10 a.m.

C++

912 +1

25,661 +25

8,669 +0

GitHub
spark-nlp by JohnSnowLabs

State of the Art Natural Language Processing

updated at May 26, 2024, 8:02 a.m.

Scala

100 +0

3,720 +4

704 +2

GitHub
LightGBM by Microsoft

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

updated at May 26, 2024, 7:54 a.m.

C++

435 +0

16,148 +26

3,783 +5

GitHub