-
parallel corpus [ corpus-types ]
A parallel corpus consists of the same text translated into one or more languages. The texts are aligned (matching segments, usual sentences, are linked). The corpus allows searches in one or both languages to look up or compare translations.
-
PoS
part of speech, some typical examples of parts of speech are: noun, adjective, verb, adverb etc. -
POS tag [ attribute ]
A POS tag is the same as tag. -
POS tagger
POS (part of speech) tagging is a process of annotating each token with a tag carrying information about the part of speech and often also morphological and grammatical information such as number, gender, case, tense etc. The automatic tagging tool is called a tagger or POS tagger. See also lemmatization stemming -
positional attribute
A positional attribute is information added to each token in a corpus, e.g. its lemma (basic form of a word) or part of speech. Attributes differ between corpora and even between corpora in the same language. For example, the attributes of the plural of the English word dog might look like this.
Attributes available in the corpus are listed on the corpus statistics and detail page see also list of positional attributesword lemma tag lempos dogs dog n dog-n -
preloaded corpus [ corpus-types ]
a ready-to-use corpus included in Sketch Engine subscription or Trial access, not created by a user, e.g. British National Corpus



