nlp-datasets in theimpossibleastronaut/awesome-linguistics

Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)

updated at June 2, 2024, 11:46 a.m.

Unknown languages

232 +1

5,664 +3

957 +2

GitHub