Descripción
Arabic news subcorpus based on material crawled in 2013 (1,000,000 sentences)
Detalles
Nombre |
ara_newscrawl_2013_1M |
Frases |
1.000.000 |
Idioma |
Arabic
()
|
Tipos |
871.269 |
Género |
Newscrawl |
Tókenes |
20.759.565 |
Año |
2013 |
Enlace al corpus
https://corpora.uni-leipzig.de?corpusId=ara_newscrawl_2013_1M
Anotaciones
wordsLevenshteinSim
Cite este corpus
Leipzig Corpora Collection: Arabic news subcorpus based on material crawled in 2013 (1,000,000 sentences). Leipzig Corpora Collection. Dataset. https://corpora.uni-leipzig.de?corpusId=ara_newscrawl_2013_1M.
BibTeX
@misc{ara_newscrawl_2013_1M,
author = {Leipzig Corpora Collection},
title = {Arabic news subcorpus based on material crawled in 2013 (1,000,000 sentences)},
howpublished = {https://corpora.uni-leipzig.de?corpusId=ara_newscrawl_2013_1M},
note = {Accessed: 2024-04-24}
}