nlp-datasets in theimpossibleastronaut/awesome-linguistics

Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)

updated at Nov. 23, 2024, 2:37 p.m.

Unknown languages

234 +0

5,784 +6

963 +1

GitHub