Description
Swahili (individual language) Wikipedia corpus based on material from 2011
Details
Name |
swh_wikipedia_2011 |
Sentences |
81,593 |
Language |
Swahili
()
|
Types |
117,667 |
Genre |
Wikipedia |
Tokens |
1,420,477 |
Year |
2011 |
Link to the corpus
https://corpora.uni-leipzig.de?corpusId=swh_wikipedia_2011
We want to thank
Wikipedia: Data
Annotations
coocSim
POS (TreeTagger - unknown)
wordsLevenshteinSim
Cite this corpus
Leipzig Corpora Collection: Swahili (individual language) Wikipedia corpus based on material from 2011. Leipzig Corpora Collection. Dataset. https://corpora.uni-leipzig.de?corpusId=swh_wikipedia_2011.
BibTeX
@misc{swh_wikipedia_2011,
author = {Leipzig Corpora Collection},
title = {Swahili (individual language) Wikipedia corpus based on material from 2011},
howpublished = {https://corpora.uni-leipzig.de?corpusId=swh_wikipedia_2011},
note = {Accessed: 2024-12-05}
}