Nynorskkorpuset: Norwegian corpus
The Nynorskkorpuset corpus is a Norweigan corpus made up of texts of the Nynorsk written standard of the Norwegian language. This corpus consists of fiction, newspaper texts, journal articles, textbook texts, religious texts and texts from the public target user. Texts cover the period from the 1870s to the present day, with the main emphasis on the last fifty years.
See more about the corpus at http://no2014.uio.no/korpuset/ (in Norwegian)
The Nynorskkorpuset corpus of Nynorsk is tagged by the Brill tagger using the following PoS tagset for Nynorsk.
This Nynorsk corpus is available to users with a regular subscription.