Stanford Arabic parser tagset

A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.

Stanford Arabic parser tagset is available in Arabic corpora processed by the Stanford Arabic Parser. This tool is developed by The Stanford Natural Language Processing Group at Stanford University.

Arabic tagsets

used in Sketch Engine

An Example of a tag in the CQL concordance search box: [tag="VBD"] finds all verb past tenses, e.g. كان

Tagset summary

Basic notation

noun	(DT)?NN.*
verb	VB.*
adjective	(DT)?JJ.*
adverb	W?RB
conjunction	CC
preposition	IN
pronoun	PRP.?
cardinal number	CD

Complete notation

ADJ	adj
CC	Coordinating conjunction
CD	Cardinal number
DT	determiner
DTJJ	adjective with the determiner “Al” (ال)
DTJJR	adjective, comparative with the determiner “Al” (ال)
DTNN	noun, singular or mass with the determiner “Al” (ال)
DTNNP	Proper noun, singular with the determiner “Al” (ال)
DTNNPS	Proper noun, plural with the determiner “Al” (ال)
DTNNS	noun, plural with the determiner “Al” (ال)
IN	Preposition or subordinating conjunction
JJ	adjective
JJR	Adjective, comparative
NN	noun, singular or mass
NNP	Proper noun, singular
NNPS	Proper noun, plural
NNS	noun, plural
NOUN	noun
PRP	Personal pronoun
PRP$	Possessive pronoun
PUNC	punctuation
RB	adverb
RP	particle
UH	interjection
VB	verb, base form
VBD	Verb, past tense
VBG	verb, gerund or present participle
VBN	verb, past participle
VBP	Verb, non-3rd person singular present
VN	verb, past participle
WP	Wh-pronoun
WRB	Wh-adverb

Source: http://nlp.stanford.edu/software/parser-arabic-faq.shtml#d

Arabic text corpora in Sketch Engine

Sketch Engine offers dozens of Arabic language corpora.

available corpora

Other text corpora in Sketch Engine

Sketch Engine offers 700+ language corpora.

corpora in Sketch Engine

Tagset summary

Arabic text corpora in Sketch Engine

Other text corpora in Sketch Engine

for learners of languages

A Course in Lexicography and Lexical Computing

term extraction

learn sketch engine