A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.
Korean HanNanum part-of-speech tagset simplified is available in Korean corpora annotated by the Hannanum tool which is a morphological analyzer and a POS tagger
HanNanum – a Korean Morphological Analyzer and POS Tagger – is used for tagging Korean texts. The 22 tags present configuration of HanNanum demo java class is used to assign tags to words. Some own pre/post processing scripts were used to correct/improve lemmatization, add sentence markup, add English tag names.
An Example of a tag in the CQL concordance search box: [tag="V"] searches all verbs, e.g. 하는 (note: please make sure that you use straight double quotation marks)
Tagset
| ORIGINAL | SIMPLIFIED (in Sketch Engine) | DESCRIPTION | Example |
|---|---|---|---|
| NOMINALS | |||
| NC | N | noun | |
| NQ | NP | proper noun | |
| NB | N | noun | |
| NP | Pron | pronoun | |
| NN | Num | numeral | |
| PREDICATES | |||
| PV | V | verb | |
| PA | Adj | adjective | |
| PX | VAux | auxiliary verb | |
| MODIFIERS | |||
| MM | Det | determiner | |
| MA | Adv | adverb | |
| INTERJECTION | |||
| II | Interj | interjection | |
| RELATIONAL SUFFIX | |||
| JC | Suff | ||
| JX | Suff | ||
| JP | Suff | ||
| ENDING | |||
| EP | Suff | ||
| EC | Suff | ||
| ET | Suff | ||
| EF | Suff | ||
| AFFIXES | |||
| XP | Pref | 접두사 | |
| XS | Suff | ||
| SYMBOLS | |||
| SP | Sym | 쉼표 | |
| SL | Sym | 여는 따옴표 및 묶음표 | |
| SD | Sym | 이음표 | |
| SU | Sym | 단위 기호 | |
| SF | Sym | 마침표 | |
| SR | Sym | 닫는 따옴표 및 묶음표 | |
| SE | Sym | 줄임표 | |
| SY | Sym | 기타 기호 | |
| FOREIGN WORDS | |||
| F | X | ||
Source: https://www.sketchengine.eu/wp-content/uploads/Original-HanNanum-manual.pdf
or
Use Sketch Engine in minutes
Generating collocations, frequency lists, examples in contexts, n-grams or extracting terms is easy with Sketch Engine. Use our Quick Start Guide to learn it in minutes.




