The corpus rus-tj_web_2019 is a Russian Web text corpus (Tajikistan) based on material from 2019.
It contains 3,442,116 sentences and 54,697,933 tokens.
Details
Leipzig Corpora Collection: Russian Web text corpus (Tajikistan) based on material from 2019. Leipzig Corpora Collection. Dataset. https://corpora.uni-leipzig.de?corpusId=rus-tj_web_2019.
BibTeX
@misc{rus-tj_web_2019,
author = {Leipzig Corpora Collection},
title = {Russian Web text corpus (Tajikistan) based on material from 2019},
howpublished = {https://corpora.uni-leipzig.de?corpusId=rus-tj_web_2019},
note = {Accessed: 2024-12-09}
}