Search in 1019 Corpus-Based Monolingual Dictionaries for 291 Languages.

The corpus sin_wikipedia_2021 is a Sinhalese Wikipedia corpus based on material from 2021. It contains 277,930 sentences and 4,167,674 tokens. Details


Download parts of this corpus.
More details about this corpus on our corpus and language statistics page.