A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.

Bulgarian part-of-speech tagset is available in Bulgarian corpora. Tagset was developed by The Department of Computational Linguistics.

An Example of a tag in the CQL concordance search box[tag="N.*"] finds all nouns, e.g. България, време (note: please make sure that you use straight double quotation marks)

Tagset

Categories of the Lemma

PoS Tag
Noun N
Verb V
Adjective A
Pronoun P
Adverb D
Numeral C
Preposition R
Conjunction J
Participle T
Interjection I
PoS Attribute Value Tag Position
Noun Type Common C 2nd
Noun Type Proper P 2nd
Noun Gender Masculine M 3rd
Noun Gender Feminine F 3rd
Noun Gender Neutral N 3rd
Noun Gender Masculine/Neutral S 3rd
Noun Gender Masculine/Feminine E 3rd
Noun Type Family name A 3rd
Noun Number Singularia tantum S 4th
Noun Number Pluralia tantum P 4th
Verb Type Personal 2 2nd
Verb Type Impersonal M 2nd
Verb Transitivity Transitive T 3rd
Verb Transitivity Intransitive I 3rd
Verb Aspect Perfective P 4th
Verb Aspect Imperfective I 4th
Pronoun Type Personal P 2nd
Pronoun Type Possessive L 2nd
Pronoun Type Reflexive H 2nd
Pronoun Type General B 2nd
Pronoun Type Relative G 2nd
Pronoun Type Indefinite I 2nd
Pronoun Type Negative W 2nd
Pronoun Type Indefinite I 2nd
Pronoun Type For persons and objects B 3rd
Pronoun Type For attributes C 3rd
Pronoun Type For quantity D 3rd
Pronoun Type For possession F 3rd
Numeral Type Ordinal masculine K 2nd
Numeral Type Ordinal approximate O 2nd
Numeral Type Ordinal one Q 2nd
Numeral Type Ordinal two S 2nd
Numeral Type Ordinal fraction U 2nd
Numeral Type Cardinal V 2nd

Categories of the Word Form

PoS Attribute Value Tag
Noun, Verb, Adjective, Pronoun, Numeral Number Singular s
Noun, Verb, Adjective, Pronoun, Numeral Number Plural p
Noun Number Countable c
Noun Case Vocative v
Noun, Verb, Adjective, Pronoun, Numeral Definiteness Indefinite article 0
Noun, Verb, Adjective, Pronoun, Numeral Definiteness Definite article d
Noun, Verb, Adjective, Pronoun, Numeral Definiteness Short definite article h
Noun, Verb, Adjective, Pronoun, Numeral Definiteness Long definite article l
Noun, Verb, Adjective, Pronoun, Numeral Gender Masculine m
Noun, Verb, Adjective, Pronoun, Numeral Gender Feminine f
Noun, Verb, Adjective, Pronoun, Numeral Gender Neuter n
Verb Tense Present R
Verb Tense Aorist E
Verb Tense Imperfect D
Verb Mood Imperative I
Verb Participle Present Y
Verb Participle Past perfective X
Verb Participle Past imperfective W
Verb Participle Verbal adverb Z
Verb, Pronoun Person First 1
Verb, Pronoun Person Second 2
Verb, Pronoun Person Third 3
Numeral Form Counting b
Numeral Form Masculine personal b
Numeral Form Approximate x
Pronoun Case Accusative a
Pronoun Case Dative d
Pronoun Value Description Masculine M
Pronoun Value Description Feminine F
Pronoun Value Description Neuter N
Pronoun Possessor’s Number Singular S
Pronoun Possessor’s Number Plural P
Pronoun Possessor’s person First 1
Pronoun Possessor’s person Second 2
Pronoun Possessor’s person Third 3
Pronoun Form Clitic C

Source: http://dcl.bas.bg/en

Bulgarian corpora

Sketch Engine offers dozens Bulgarian language corpora.

or