Description
English Web text corpus (United Kingdom) based on material from 2002
Details
Name |
eng-uk_web_2002 |
Sentences |
49,628,893 |
Language |
English
()
|
Types |
4,785,862 |
Genre |
Web |
Tokens |
926,766,504 |
Year |
2002 |
Location |
United Kingdom of Great Britain and Northern Ireland |
Link to the corpus
https://corpora.uni-leipzig.de?corpusId=eng-uk_web_2002
Annotations
coocSim
GDEX
POS (TreeTagger - /disk/users/wortschatz/postagger/TreeTagger/parfiles/eng.par)
wordsLevenshteinSim
Cite this corpus
Leipzig Corpora Collection: English Web text corpus (United Kingdom) based on material from 2002. Leipzig Corpora Collection. Dataset. https://corpora.uni-leipzig.de?corpusId=eng-uk_web_2002.
BibTeX
@misc{eng-uk_web_2002,
author = {Leipzig Corpora Collection},
title = {English Web text corpus (United Kingdom) based on material from 2002},
howpublished = {https://corpora.uni-leipzig.de?corpusId=eng-uk_web_2002},
note = {Accessed: 2025-03-23}
}