Beschreibung
Arabisches Nachrichten-Teilkorpus basierend auf Texten gecrawlt 2013 (1.000.000 Sätze)
Details
Name |
ara_newscrawl_2013_1M |
Sätze |
1.000.000 |
Sprache |
Arabisch
()
|
Types |
871.269 |
Genre |
Newscrawl |
Tokens |
20.759.565 |
Jahr |
2013 |
Link zum Korpus
https://corpora.uni-leipzig.de?corpusId=ara_newscrawl_2013_1M
Annotationen
wordsLevenshteinSim
Zitieren Sie dieses Korpus
Leipzig Corpora Collection: Arabic news subcorpus based on material crawled in 2013 (1,000,000 sentences). Leipzig Corpora Collection. Dataset. https://corpora.uni-leipzig.de?corpusId=ara_newscrawl_2013_1M.
BibTeX
@misc{ara_newscrawl_2013_1M,
author = {Leipzig Corpora Collection},
title = {Arabic news subcorpus based on material crawled in 2013 (1,000,000 sentences)},
howpublished = {https://corpora.uni-leipzig.de?corpusId=ara_newscrawl_2013_1M},
note = {Accessed: 2024-12-08}
}