nlp-datasets in theimpossibleastronaut/awesome-linguistics

Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)

updated at Sept. 21, 2024, 3 p.m.

Unknown languages

233 +0

5,728 +4

961 -1

GitHub