nlp-datasets by niderhoff

Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)

created at March 24, 2016, 2:14 p.m.

Unknown languages

231 +0

5,648 +2

955 -2

GitHub
Awesome-Chinese-NLP by crownpku

A curated list of resources for Chinese NLP 中文自然语言处理相关资料

created at July 14, 2017, 4:07 a.m.

Unknown languages

389 +0

7,682 +9

1,706 +1

GitHub
sentence-transformers by UKPLab

Multilingual Sentence & Image Embeddings with BERT

created at July 24, 2019, 10:53 a.m.

Python

132 +0

13,922 +78

2,342 +6

GitHub
awesome-nlp by keon

book A curated list of resources dedicated to Natural Language Processing (NLP)

created at Dec. 1, 2015, 11:11 a.m.

Unknown languages

609 +0

16,065 +36

2,557 -1

GitHub
NLP-progress by sebastianruder

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

created at June 22, 2018, 5:43 p.m.

Python

1,282 +1

22,356 +26

3,608 +2

GitHub