pandas-cookbook by jvns

Recipes for using Python's pandas library

updated at Nov. 17, 2024, 2:24 a.m.

Jupyter Notebook

305 +0

6,664 +7

2,317 +1

GitHub
opencv by opencv

Open Source Computer Vision Library

updated at Nov. 17, 2024, 2:22 a.m.

C++

2,652 -2

79,137 +133

55,831 +20

GitHub
jax by jax-ml

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

updated at Nov. 17, 2024, 2:12 a.m.

Python

337 +3

30,510 +74

2,802 +7

GitHub
pytorch-lightning by PyTorchLightning

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

updated at Nov. 17, 2024, 2:09 a.m.

Python

250 +1

28,392 +45

3,384 +1

GitHub
LightGBM by Microsoft

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

updated at Nov. 17, 2024, 2:09 a.m.

C++

434 +0

16,698 +19

3,834 +1

GitHub
ray by ray-project

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

updated at Nov. 17, 2024, 1:45 a.m.

Python

474 +0

34,015 +115

5,780 +19

GitHub
albumentations by albumentations-team

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

updated at Nov. 17, 2024, 1:30 a.m.

Python

129 +1

14,258 +30

1,649 +2

GitHub
fuzzywuzzy by seatgeek

Fuzzy String Matching in Python

updated at Nov. 17, 2024, 1:14 a.m.

Python

259 +1

9,231 +4

876 +1

GitHub
pycaret by pycaret

An open-source, low-code machine learning library in Python

updated at Nov. 17, 2024, 1:09 a.m.

Jupyter Notebook

136 +1

8,955 +26

1,774 +3

GitHub
rapaio by padreati

statistics, data mining and machine learning toolbox

updated at Nov. 17, 2024, 12:48 a.m.

Java

10 +0

69 +0

12 +0

GitHub
mindsdb by mindsdb

Platform for building AI that can learn and answer questions over federated data.

updated at Nov. 17, 2024, 12:40 a.m.

Python

398 -1

26,805 +47

4,887 +12

GitHub
spark by apache

Apache Spark - A unified analytics engine for large-scale data processing

updated at Nov. 17, 2024, 12:26 a.m.

Scala

2,023 +1

39,929 +83

28,314 +8

GitHub
pytorch_geometric by pyg-team

Graph Neural Network Library for PyTorch

updated at Nov. 17, 2024, 12:20 a.m.

Python

252 +0

21,387 +49

3,670 +12

GitHub
superset by apache

Apache Superset is a Data Visualization and Data Exploration Platform

updated at Nov. 17, 2024, 12:12 a.m.

TypeScript

1,519 +2

62,806 +158

13,874 +46

GitHub
tensorflow by tensorflow

An Open Source Machine Learning Framework for Everyone

updated at Nov. 17, 2024, 12:04 a.m.

C++

7,573 -5

186,403 +124

74,304 -1

GitHub
dash by plotly

Data Apps & Dashboards for Python. No JavaScript Required.

updated at Nov. 16, 2024, 11:47 p.m.

Python

426 +0

21,486 +33

2,070 +2

GitHub
weaviate by semi-technologies

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.

updated at Nov. 16, 2024, 11:47 p.m.

Go

126 -2

11,500 +90

797 +4

GitHub
stellargraph by stellargraph

StellarGraph - Machine Learning on Graphs

updated at Nov. 16, 2024, 11:47 p.m.

Python

62 +0

2,948 +1

431 +0

GitHub
polyglot by aboSamoor

Multilingual text (NLP) processing toolkit

updated at Nov. 16, 2024, 11:40 p.m.

Python

77 +0

2,315 +2

337 +0

GitHub
pytorch by pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

updated at Nov. 16, 2024, 11:30 p.m.

Python

1,743 +1

83,990 +192

22,642 +50

GitHub