The corpus xho-za_web_2019 is a Xhosa Web text corpus (South Africa) based on material from 2019.
It contains 7,648 sentences and 111,009 tokens.
Details
Leipzig Corpora Collection: Xhosa Web text corpus (South Africa) based on material from 2019. Leipzig Corpora Collection. Dataset. https://corpora.uni-leipzig.de?corpusId=xho-za_web_2019.
BibTeX
@misc{xho-za_web_2019,
author = {Leipzig Corpora Collection},
title = {Xhosa Web text corpus (South Africa) based on material from 2019},
howpublished = {https://corpora.uni-leipzig.de?corpusId=xho-za_web_2019},
note = {Accessed: 2023-03-27}
}