Leipzig Corpora Collection

Search in 1019 Corpus-Based Monolingual Dictionaries for 291 Languages.

Selected language: Vietnamese Newscrawl 2013

Search suggestions: chủ động · dễ · Trung tâm · tăng lên · Lãnh đạo

More information about: Vietnamese Newscrawl 2013 Change corpus

The corpus vie_newscrawl_2013_1M is a Vietnamese news subcorpus based on material crawled in 2013 (1,000,000 sentences). It contains 1,000,000 sentences and 18,037,085 tokens. Details

DOWNLOADS

Download parts of this corpus.

STATISTICS

More details about this corpus on our corpus and language statistics page.

Description

Vietnamese news subcorpus based on material crawled in 2013 (1,000,000 sentences)

Details

Name	vie_newscrawl_2013_1M	Sentences	1,000,000
Language	Vietnamese ()	Types	391,670
Genre	Newscrawl	Tokens	18,037,085
Year	2013

Link to the corpus

https://corpora.uni-leipzig.de?corpusId=vie_newscrawl_2013_1M

Cite this corpus

Leipzig Corpora Collection: Vietnamese news subcorpus based on material crawled in 2013 (1,000,000 sentences). Leipzig Corpora Collection. Dataset. https://corpora.uni-leipzig.de?corpusId=vie_newscrawl_2013_1M. BibTeX

@misc{vie_newscrawl_2013_1M,
    author = {Leipzig Corpora Collection},
    title = {Vietnamese news subcorpus based on material crawled in 2013 (1,000,000 sentences)},
    howpublished = {https://corpora.uni-leipzig.de?corpusId=vie_newscrawl_2013_1M},
    note = {Accessed: 2024-04-24}
}