Description
Vietnamese news corpus based on material from 2022
Details
Name |
vie_news_2022 |
Sentences |
1,695,819 |
Language |
Vietnamese
()
|
Types |
449,626 |
Genre |
News |
Tokens |
31,056,977 |
Year |
2022 |
Link to the corpus
https://corpora.uni-leipzig.de?corpusId=vie_news_2022
Annotations
coocSim
POS (RDRPOSTaggerMASTER - https://github.com/datquocnguyen/RDRPOSTagger/archive/master.zip)
Cite this corpus
Leipzig Corpora Collection: Vietnamese news corpus based on material from 2022. Leipzig Corpora Collection. Dataset. https://corpora.uni-leipzig.de?corpusId=vie_news_2022.
BibTeX
@misc{vie_news_2022,
author = {Leipzig Corpora Collection},
title = {Vietnamese news corpus based on material from 2022},
howpublished = {https://corpora.uni-leipzig.de?corpusId=vie_news_2022},
note = {Accessed: 2024-12-08}
}