Search in 1021 Corpus-Based Monolingual Dictionaries for 292 Languages.

The corpus sin_wikipedia_2011 is a Sinhalese Wikipedia corpus based on material from 2011. It contains 115,651 sentences and 1,813,432 tokens. Details


Download parts of this corpus.
More details about this corpus on our corpus and language statistics page.