Multilingual-BERT by tonianelope

Investigating multilingual language models (BERT) by using them for NER in German and English

updated at April 22, 2022, 1:05 p.m.

Jupyter Notebook

3 +0

14 +0

0 +0

GitHub
german-elmo-model by t-systems-on-site-services-gmbh

This is a german ELMo deep contextualized word representation. It is trained on a special German Wikipedia Text Corpus.

updated at Jan. 27, 2023, 11:56 a.m.

Unknown languages

3 +0

28 +0

1 +0

GitHub
haxe-linguistics by sexybiggetje

Linguistical analysis and natural language processing library for Haxe.

updated at Jan. 27, 2023, 10:41 p.m.

Haxe

3 +0

26 +0

2 +0

GitHub
gevalm by DFKI-NLP

Code and data for the paper "Evaluating German Transformer Language Models with Syntactic Agreement Tests" (Zaczynska et al., 2020)

updated at Aug. 30, 2023, 9:10 a.m.

Python

6 +0

7 +0

2 +0

GitHub
OpinionSpam by hdaSprachtechnologie

German Opionion Spam Corpus

updated at Oct. 22, 2023, 10:31 a.m.

Unknown languages

0 +0

2 +0

0 +0

GitHub
DysListGerman by Rauschii

DysList, a list of dyslexic errors annotated with linguistic, phonetic and visual features. Presented 2016 at the LREC conference: Rauschenberger, Maria; Rello, Luz; Füchse, Silke & Thomaschewski, Jörg. 2016. A Language Resource of German Errors Written by Children with Dyslexia. [In Press] Proc. LREC 2016. Portorož (Slovenia), 23-28, May. The Resource and further information are also available at the Web Research Group from UPF at http://grupoweb.upf.es/WRG/DysWebxia.php?lang=#resources

updated at Jan. 31, 2024, 9:57 p.m.

CSS

1 +0

5 +0

0 +0

GitHub
EuroRomCom by kirkins

🇪🇺 Resources and Learning Games for European Romance Language Communication

updated at June 10, 2024, 2 p.m.

JavaScript

5 +0

20 +0

1 +0

GitHub
berts by dbmdz

DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models

updated at Oct. 1, 2024, 5:54 p.m.

Unknown languages

15 +0

155 +0

12 +0

GitHub
textblob-de by markuskiller

German language support for TextBlob.

updated at Oct. 13, 2024, 3:03 a.m.

Python

5 +0

104 +0

12 +0

GitHub
german-transformer-training by German-NLP-Group

Plan and train German transformer models.

updated at Oct. 19, 2024, 9:58 a.m.

Python

6 +0

23 +0

2 +0

GitHub
gerpt2 by bminixhofer

German small and large versions of GPT2.

updated at Oct. 22, 2024, 7:42 p.m.

Python

1 +0

20 +0

0 +0

GitHub
awesome-community-curated-nlp by alvations

Community Curated NLP List

updated at Oct. 27, 2024, 7:25 a.m.

Unknown languages

20 +0

196 +0

33 +0

GitHub
norwegian-nlp-resources by web64

Norwegian NLP Resources

updated at Oct. 28, 2024, 10:39 a.m.

Unknown languages

21 +0

177 +0

15 +0

GitHub
id-nlp-resource by kmkurn

A list of Indonesian NLP resources.

updated at Nov. 16, 2024, 9:44 a.m.

Unknown languages

15 +0

279 +0

48 +0

GitHub
awesome-nlp-polish by ksopyla

A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.

updated at Nov. 19, 2024, 8:19 p.m.

Unknown languages

28 +0

294 +3

34 +0

GitHub
uralicNLP by mikahama

An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanish, French, Arabic, Swedish, Norwegian, Russian and English

updated at Nov. 21, 2024, 5:42 p.m.

Python

7 +0

70 +0

7 +0

GitHub
Awesome-Chinese-NLP by crownpku

A curated list of resources for Chinese NLP 中文自然语言处理相关资料

updated at Nov. 22, 2024, 7:20 a.m.

Unknown languages

389 +0

7,810 +3

1,714 +2

GitHub
nlp-datasets by niderhoff

Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)

updated at Nov. 23, 2024, 2:37 p.m.

Unknown languages

234 +0

5,784 +6

963 +1

GitHub
natural by NaturalNode

general natural language facilities for node

updated at Nov. 23, 2024, 11:29 p.m.

JavaScript

243 -1

10,638 +15

859 -1

GitHub
low-resource-languages by RichardLitt

Resources for conservation, development, and documentation of low resource (human) languages.

updated at Nov. 24, 2024, 12:08 a.m.

TeX

35 +0

391 +1

56 +0

GitHub