A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense, etc.) of each token in a text corpus.
Marathi part-of-speech tagset developed by IIIT Hyderabad (International Institute of Information Technology – Hyderabad)
An Example of a tag in the CQL concordance search box: [tag="NN.*|NST"] finds all nouns, e.g. चांगुलपणा (note: please make sure that you use straight double quotation marks)
Tagset
| PoS Tag | Description | Note/Example |
|---|---|---|
| NN | Common Nouns | चांगुलपणा, मंडळी |
| NST | Noun Denoting Spatial and Temporal Expressions | मागे, पुढे, अगोदर |
| NNP | Proper Nouns (name of person) | मोहन, राम, गझलेमधील |
| PRP | Pronoun | मी, आपण, माझा |
| DEM | Demonstrative | तो, ती, हा, |
| VM | Verb Main (Finite or Non-Finite) | बसणे, केली |
| VAUX | Verb Auxiliary | नये, लागली |
| JJ | Adjective (Modifier of Noun) | प्रभावी, तसा |
| RB | Adverb (Modifier of Verb) | आता, आज, देखील |
| PSP | Postposition |
पर्यंतची, चा, पासून
|
| RP | Particle |
असेपर्यंत, म्हणजे
|
| QTF | Quantifiers |
अनेकदा, प्रत्येक, अन्य
|
| QTC | Cardinals | एक, एकच, एकाच |
| QTO | Ordinals | दुसर्, दुसऱ्या, प्रथम |
| CC | Conjuncts (Coordinating and Subordinating) | व |
| DMQ (WQ) | Question Word | कोणत्या, कुठल्याच, किती |
| INTF | Intensifier |
अतिशय, सर्वस्वी
|
| INJ | Interjection | अणू, हां |
| NEG | Negative | न, ना |
| XC | Compounds | – |
| RDP | Reduplication | सुध्दा, पासूनच |
| UNK | Unknown | फारच |
| SYM | Symbol | >>, +,= |




