Search in 1018 Corpus-Based Monolingual Dictionaries for 290 Languages.

The corpus mal_wikipedia_2021 is a Malayalam Wikipedia corpus based on material from 2021. It contains 777,441 sentences and 7,269,090 tokens. Details


Download parts of this corpus.
More details about this corpus on our corpus and language statistics page.