A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.

Hebrew YAP part-of-speech tagset

Hebrew corpora annotated by the Yet Another (natural language) Parser (abbreviated as YAP) tool are tagged with this Hebrew part-of-speech scheme. YAP is part of the ONLP lab tool kit.

The following table shows Hebrew the part-of-speech tagset scheme used

An Example of a tag in the CQL concordance search box: [tag="NNP"] finds all proper nouns (note: please make sure that you use straight double quotation marks)

POS Tag Description Example
ABVERB The word כ before numerals  
AT Accusative marker את which appears as a separate word in written Hebrew  
BN BN – participle (בינוני)  
BNN Participle (בינוני) in construct state form  
CC Conjunction  
CD Numeral  
CDT Numeral in the construct state  
CONJ Coordinating conjunction ו  
COP Copula  
COP_TOINFINITIVE The infinitive form of the verb היה used as a copula  
DEF A special tag assigned to the definiteness marker ה, which appears with nouns, adjectives and numerals  
DT Used in the treebank only for the determiner כל with a pronominal suffix  
DTT Determiner  
DUMMY_AT Accusative marker אתwhen used with a pronominal suffix  
EX existential marker יש or אין  
IN Preposition  
INTJ Interjection  
JJ Adjective  
JJT Adjective in the construct state  
MD Modal predicates  
NN Noun  
NN_S_PP Noun with a pronominal suffix  
NNP Proper noun  
NNT Construct state noun  
P Prefix written as a separate word (אי, בלתי,אנטי etc.)  
POS Possessive preposition של and accusative marker את with a pronominal suffix  
PREPOSITION Inseparable preposition  
PRP Personal pronoun  
QW Question word  
S_PRN Personal pronoun attached to a preposition as a pronominal suffix  
TEMP Subordinating conjunction introducing time clauses  
VB Verb  
VB_TOINFINITIVE A verb in its infinitive form  
yy.* various symbols  
yyCLN Colon :
yyCM Comma ,
yyDASH Hyphen or dash – or –
yyDOT Period .
yyELPS Ellipsis (…)
yyEXCL Exclamation point !
yyLRB Left parenthesis (
yyQM Question mark ?
yyQUOT Quotation mark
yyRRB Right parenthesis )
yySCLN Semicolon ;

Reference

https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.22.8581&rep=rep1&type=pdf (Hebrew tagset can be found on page 7 and following)

Hebrew corpora

Sketch Engine provides access to Hebrew corpora.

or