Search in 1042 Corpus-Based Monolingual Dictionaries for 290 Languages.

Selected language: Sinhala Wikipedia 2021

The corpus sin_wikipedia_2021 is a Sinhalese Wikipedia corpus based on material from 2021. It contains 277,930 sentences and 4,167,674 tokens. Details


Download parts of this corpus.
More details about this corpus on our corpus and language statistics page.