A tagset is a list of part-of-speech tags (POS tags for short), i.e. labels used to indicate the part of speech and sometimes also other grammatical categories (case, tense etc.) of each token in a text corpus.
When creating user corpora, the recommended tagset is always preselected. Using a different tagset is only recommended for advanced users. Tagsets cannot be normally changed for preloaded corpora.
Since Word Sketches, thesaurus, term extraction and trends make use of POS tagging, their respective settings (e.g. Word Sketch grammar, term grammar) must be based on the same tagset as the one used in the corpus.
Arabic corpora in Sketch Engine can have these tagsets:
(to check the tagset used, go to Corpus Statistics and details page)