CzechParl: Corpus of Stenographic Protocols from Czech Parliament
The Corpus of Stenographic Protocols from Czech Parliament (CzechParl) is a language corpus built from stenographic protocols recorded during plenary meetings of the Czech parliament in its modern era from 1993 to 2012.
The corpus contains the language of politicians from the regular meeting of the Parliament of the Czech Republic. Users can search texts of a specific member of the parliament or a year, date as well as a certain role of the spokesperson.
The CzechParl corpus was annotated by the morphological analyser MAJKA using the following POS tagset legend. After that, there was applicated disambiagutor DESAMB.