Leipzig Corpora Collection

Search in 1018 Corpus-Based Monolingual Dictionaries for 290 Languages.

Selected language: German News 2010

Search suggestions: 120 · Berlin · uppdrag · skola · timmar

More information about: German News 2010 Change corpus

The corpus deu_news_2010_1M is a German news subcorpus based on material from 2010 (1,000,000 sentences). It contains 1,000,000 sentences and 17,052,446 tokens. Details

DOWNLOADS

Download parts of this corpus.

STATISTICS

More details about this corpus on our corpus and language statistics page.

Further services:

Description

German news subcorpus based on material from 2010 (1,000,000 sentences)

Details

Name	deu_news_2010_1M	Sentences	1,000,000
Language	German ()	Types	844,797
Genre	News	Tokens	17,052,446
Year	2010

Link to the corpus

https://corpora.uni-leipzig.de?corpusId=deu_news_2010_1M

Annotations

coocSim
wordsLevenshteinSim

Cite this corpus

Leipzig Corpora Collection: German news subcorpus based on material from 2010 (1,000,000 sentences). Leipzig Corpora Collection. Dataset. https://corpora.uni-leipzig.de?corpusId=deu_news_2010_1M. BibTeX

@misc{deu_news_2010_1M,
    author = {Leipzig Corpora Collection},
    title = {German news subcorpus based on material from 2010 (1,000,000 sentences)},
    howpublished = {https://corpora.uni-leipzig.de?corpusId=deu_news_2010_1M},
    note = {Accessed: 2024-07-27}
}