Search in 1018 Corpus-Based Monolingual Dictionaries for 291 Languages.

Selected language: Somali Wikipedia 2021

Search suggestions:  dhacda  ·  joogaan  ·  horreysa  ·  Jowhar  ·  gaadhaa

The corpus som_wikipedia_2021 is a Somali Wikipedia corpus based on material from 2021. It contains 23,268 sentences and 435,154 tokens. Details

Download parts of this corpus.
More details about this corpus on our corpus and language statistics page.