The corpus kor-kr_web_2019 is a Korean Web text corpus (South Korea) based on material from 2019.
It contains 32,138,987 sentences and 510,229,996 tokens.
Details
Leipzig Corpora Collection: Korean Web text corpus (South Korea) based on material from 2019. Leipzig Corpora Collection. Dataset. https://corpora.uni-leipzig.de?corpusId=kor-kr_web_2019.
BibTeX
@misc{kor-kr_web_2019,
author = {Leipzig Corpora Collection},
title = {Korean Web text corpus (South Korea) based on material from 2019},
howpublished = {https://corpora.uni-leipzig.de?corpusId=kor-kr_web_2019},
note = {Accessed: 2025-03-24}
}