A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.

MULTEXT-East Morphosyntactic Romanian Specification is available in Romanian corpora. The MULTEXT-East resources are a multilingual dataset for language engineering research and development.

An example of a tag as used in the CQL concordance search: [tag="N.m.*"] finds all masculine nouns, e.g. alineatul, membru (note: please make sure that you use straight double quotation marks)

Tagset

Part-of-speech tagset categories

PoS-en Code
Noun N.*
Verb V.*
Adjective A.*
Pronoun P.*
Determiner D.*
Article T.*
Adverb R.*
Adposition S.*
Conjunction C.*
 Numeral M.*
 Interjections I.*
  Residual X.*
Abbreviation Y.*
 Particle Q.*

Noun (N)

 Position Attribute Value Tag
1. Part-of-Speech Noun N
2. Type common c
proper p
3. Gender masculine m
feminine f
neuter n
4. Number singular s
plural p
5. Case direct r
oblique o
vocative v
6. Definiteness yes y
no n
7. Clitic yes y
no n

Verb (V)

Position Attribute Value Tag
1. Part-of-Speech verb V
2. Type main m
auxiliary a
modal o
copulative c
3. VForm indicative i
subjunctive s
imperative m
infinitive n
participle p
gerund g
4. Tense present p
imperfect i
past s
pluperfect l
5. Person first 1
second 2
third 3
6. Number singular s
plural p
7. Gender masculine m
feminine f
neuter n
8. Voice
9. Negation
10. Definiteness
11. Clitic yes y
no n

Adjective (A)

Position Attribute Value Tag
1. Part-of-speech adjective A
2. Type qualificative f
3. Degree positive p
comparative c
superlative s
4. Gender masculine m
feminine f
neuter n
5. Number singular s
plural p
6. Case direct r
oblique o
vocative v
7. Definiteness yes y
no n
8- Clitic yes y
no n

Pronoun (P)

Position Attribute Value Tag
1. Part-of-speech pronoun P
2. Type demonstrative d
indefinite i
possesive s
int_rel w
personal p
reflexive x
negative z
emphatic h
3. Person first 1
second 2
third 3
4. Gender masculine m
feminine f
neuter n
5. Number singular s
plural p
6. Case nominative n
genitive g
dative d
accusative a
vocative v
direct r
oblique o
7. Owner_Number singular s
plural p
8. Owner_Gender
9. Clitic yes y
no n
10. Referent_Type
11. Syntactic_Type
12. Definiteness
13. Animate
14. Clitic_s
15. Pronoun_Form strong s
weak w

Determiner (D)

Positionn Attribute Value Tag
1. Part-of-speech determiner D
2. Type demonstrative d
indefinite i
possesive s
int_rel w
negative z
emphatic h
3. Person first 1
second 2
third 3
4. Gender masculine m
feminine f
neuter n
5. Number singular s
plural p
6. Case direct r
oblique o
7. Owner_Number singular s
plural p
8. Owner_Gender
9. Clitic yes y
no n
10. Modific_Type prenominal e
postnominal o

Article (T)

Position Attribute Value Tag
1. Part-of-speech article T
2. Type definite f
indefinite i
possessive s
demonstrative d
3. Gender masculine m
feminine f
neuter n
4. Number singular s
plural p
5. Case direct r
oblique o
6. Clitic yes y
no n

Adverb (R)

Position Attribute Value Tag
1. Part-of-speech adverb R
2. Type general g
particle p
negative z
modifier m
int_rel w
portmanteau c
3. Degree positive p
comparative c
superlative s
4. Clitic yes y
no n

Adposition (S)

Position Attribute Value Tag
1. Part-of-speech adposition S
2. Type preposition p
3. Formation simple s
compound c
4. Case genitive g
dative d
accusative a
5. Clitic yes y
no n

Conjunction (C)

Position Attribute Value Tag
1. Part-of-speech conjunction C
2. Type coordinating c
subordinating s
portmanteau r
3. Formation simple s
compound c
4. Coord_Type simple s
repetit r
correlat c
5. Sub_Type negative z
positive p
6. Clitic yes y
no n

Numeral (M)

Position Attribute Value Tag
1. Part-of-speech numeral M
2. Type cardinal c
ordinal o
fractal f
multiple m
collect l
3. Gender masculine m
feminine f
neuter n
4. Number singular s
plural p
5. Case direct r
oblique o
6. Form digit d
letter l
both b
roman r
7. Definiteness yes y
no n
8. Clitic yes y
no n

Interjections (I)

Position Attribute Value Tag
1. Part-of-speech interjections I

Residual (X)

Position Attribute Value Tag
1. Part-of-speech residual X

Abbreviation (Y)

Position Attribute Value Tag
1. Part-of-speech abbreviation Y
2. Syntactic_Type nominal n
verbal v
adjectival a
adverbial r
pronominal p
3. Gender masculine m
feminine f
neuter n
4. Number singular s
plural p
5. Case direct r
oblique o
6. Definiteness yes y
no n

Particle (Q)

Position Attribute Value Tag
1. Part-of-speech particle Q
2. Type negation z
infinitive n
subjunctive s
aspect
future
3. Formation
4. Clitic yes y
no n

Source: http://nl.ijs.si/ME/V3/msd/html/

Romanian text corpora in Sketch Engine

Sketch Engine offers dozens of Romanian language corpora.

or