NLP-progress by sebastianruder

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

created at June 22, 2018, 5:43 p.m.

Python

1,266 +1

22,738 -9

3,622 -1

GitHub
awesome-nlp by keon

book A curated list of resources dedicated to Natural Language Processing (NLP)

created at Dec. 1, 2015, 11:11 a.m.

Unknown languages

609 -1

16,826 +10

2,590 +3

GitHub
sentence-transformers by UKPLab

State-of-the-Art Text Embeddings

created at July 24, 2019, 10:53 a.m.

Python

143 +0

15,565 +53

2,505 +5

GitHub
natural by NaturalNode

general natural language facilities for node

created at May 7, 2011, 2:35 a.m.

JavaScript

243 +0

10,669 +11

857 -2

GitHub
Awesome-Chinese-NLP by crownpku

A curated list of resources for Chinese NLP 中文自然语言处理相关资料

created at July 14, 2017, 4:07 a.m.

Unknown languages

389 +0

7,824 +3

1,710 +0

GitHub
nlp-datasets by niderhoff

Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)

created at March 24, 2016, 2:14 p.m.

Unknown languages

234 +0

5,804 +12

965 +0

GitHub
low-resource-languages by RichardLitt

Resources for conservation, development, and documentation of low resource (human) languages.

created at July 23, 2014, 3:31 a.m.

TeX

35 +0

392 -1

56 +0

GitHub
awesome-nlp-polish by ksopyla

A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.

created at Dec. 28, 2019, 11:26 a.m.

Unknown languages

28 +0

293 -1

34 +0

GitHub
id-nlp-resource by kmkurn

A list of Indonesian NLP resources.

created at April 7, 2018, 4:13 a.m.

Unknown languages

15 +0

279 +0

48 +0

GitHub
awesome-hungarian-nlp by oroszgy

A curated list of NLP resources for Hungarian

created at April 16, 2017, 6:53 p.m.

Unknown languages

20 +0

227 +0

18 +0

GitHub
awesome-community-curated-nlp by alvations

Community Curated NLP List

created at March 16, 2017, 1:58 p.m.

Unknown languages

20 +0

197 +0

33 -1

GitHub
norwegian-nlp-resources by web64

Norwegian NLP Resources

created at Dec. 20, 2016, 12:04 p.m.

Unknown languages

21 +0

178 +1

15 +0

GitHub
awesome-danish by fnielsen

A curated list of awesome resources for Danish language technology

created at Aug. 31, 2018, 12:52 p.m.

Unknown languages

16 +0

169 +0

18 +0

GitHub
berts by dbmdz

DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models

created at Sept. 21, 2019, 1:58 p.m.

Unknown languages

15 +0

155 +0

12 +0

GitHub
textblob-de by markuskiller

German language support for TextBlob.

created at July 8, 2014, 10:24 a.m.

Python

5 +0

104 +0

12 +0

GitHub
uralicNLP by mikahama

An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanish, French, Arabic, Swedish, Norwegian, Russian and English. LLMs, FSTs and More!

created at Dec. 7, 2017, 1:18 p.m.

Python

7 +0

71 +0

7 +0

GitHub
german-elmo-model by t-systems-on-site-services-gmbh

This is a german ELMo deep contextualized word representation. It is trained on a special German Wikipedia Text Corpus.

created at June 30, 2019, 6:24 a.m.

Unknown languages

3 +0

28 +0

1 +0

GitHub
haxe-linguistics by sexybiggetje

Linguistical analysis and natural language processing library for Haxe.

created at Aug. 7, 2014, 6 p.m.

Haxe

3 +0

26 +0

2 +0

GitHub
german-transformer-training by German-NLP-Group

Plan and train German transformer models.

created at June 27, 2020, 5:59 p.m.

Python

6 +0

23 +0

2 +0

GitHub
EuroRomCom by kirkins

🇪🇺 Resources and Learning Games for European Romance Language Communication

created at March 10, 2017, 5:09 p.m.

JavaScript

5 +0

20 +0

1 +0

GitHub