Beschreibung
Gujarātī Nachrichten-Teilkorpus basierend auf Texten gecrawlt 2014 (1.000.000 Sätze)
Details
Name |
guj_newscrawl_2014_1M |
Sätze |
1.000.000 |
Sprache |
Gujarātī
()
|
Types |
583.642 |
Genre |
Newscrawl |
Tokens |
13.988.086 |
Jahr |
2014 |
Link zum Korpus
https://corpora.uni-leipzig.de?corpusId=guj_newscrawl_2014_1M
Zitieren Sie dieses Korpus
Leipzig Corpora Collection: Gujarati news subcorpus based on material crawled in 2014 (1,000,000 sentences). Leipzig Corpora Collection. Dataset. https://corpora.uni-leipzig.de?corpusId=guj_newscrawl_2014_1M.
BibTeX
@misc{guj_newscrawl_2014_1M,
author = {Leipzig Corpora Collection},
title = {Gujarati news subcorpus based on material crawled in 2014 (1,000,000 sentences)},
howpublished = {https://corpora.uni-leipzig.de?corpusId=guj_newscrawl_2014_1M},
note = {Accessed: 2025-03-27}
}