We have improved tools for processing Danish. Danish corpora are lemmatized, part-of-speech tagged and tokenized better.

The largest Danish corpus – Danish Web 2014 corpus (daTenTen14) – has already been reprocessed. The improved tools are also available for all user corpora. Your existing Danish corpora have to be recompiled to benefit from the improvements.

