deSkELL: German corpus for SkELL
The German corpus for SkELL (deSkELL) is a German corpus made up of texts collected from the Internet. The texts come from the deTenTen corpus 2013 according to the choice of Egon W. Stemle from Eurac Research. The corpus was specially built up in order to provide the best sentence examples.
SkELL is an abbreviation of Sketch Engine for Language Learning. It is a freely available web interface suitable for German learning.
Good sentence examples
The corpus consists of only sentences (adjoining sentences does not have to relate to each other) which were sorted according to their text quality. This quality is computed by GDEX system that adds a score to each sentence. The score is mainly based on the sentence length (minimum and maximum length) and a word frequency of particular words which occur in the sentence. The sentences are sorted in the way that the sentences with the highest score are displayed as first results of a concordance.
The deSkELL corpus was tagged by RFTagger using this POS tagset.