Description
English Web text corpus (United Kingdom) based on material from 2012
Details
Name |
eng-uk_web_2012 |
Sentences |
6,683,819 |
Language |
English
()
|
Types |
1,248,835 |
Genre |
Web |
Tokens |
125,923,368 |
Year |
2012 |
Location |
United Kingdom of Great Britain and Northern Ireland |
Link to the corpus
https://corpora.uni-leipzig.de?corpusId=eng-uk_web_2012
Annotations
coocSim
GDEX
POS (TreeTagger - unknown)
wordsLevenshteinSim
Cite this corpus
Leipzig Corpora Collection: English Web text corpus (United Kingdom) based on material from 2012. Leipzig Corpora Collection. Dataset. https://corpora.uni-leipzig.de?corpusId=eng-uk_web_2012.
BibTeX
@misc{eng-uk_web_2012,
author = {Leipzig Corpora Collection},
title = {English Web text corpus (United Kingdom) based on material from 2012},
howpublished = {https://corpora.uni-leipzig.de?corpusId=eng-uk_web_2012},
note = {Accessed: 2024-10-06}
}