Description
Finnish Web text corpus based on material from 2002
Details
Name |
fin_web_2002 |
Sentences |
4,737,045 |
Language |
Finnish
()
|
Types |
3,590,142 |
Genre |
Web |
Tokens |
55,173,465 |
Year |
2002 |
Link to the corpus
https://corpora.uni-leipzig.de?corpusId=fin_web_2002
Annotations
coocSim
GDEX
POS (TreeTagger - unknown)
wordsLevenshteinSim
Cite this corpus
Leipzig Corpora Collection: Finnish Web text corpus based on material from 2002. Leipzig Corpora Collection. Dataset. https://corpora.uni-leipzig.de?corpusId=fin_web_2002.
BibTeX
@misc{fin_web_2002,
author = {Leipzig Corpora Collection},
title = {Finnish Web text corpus based on material from 2002},
howpublished = {https://corpora.uni-leipzig.de?corpusId=fin_web_2002},
note = {Accessed: 2024-09-08}
}