• parallel corpus [ corpus-types ]

    A parallel corpus consists of the same text translated into one or more languages. The texts are aligned (matching segments, usual sentences, are linked). The corpus allows searches in one or both languages to look up or compare translations.  
  • PoS

    part of speech, some typical examples of parts of speech are: noun, adjective, verb, adverb etc.
  • POS tag [ attribute ]

    A POS tag is the same as tag.
  • POS tagger

    POS (part of speech) tagging is a process of annotating each token with a tag carrying information about the part of speech and often also morphological and grammatical information such as number, gender, case, tense etc. The automatic tagging tool is called a tagger or POS tagger. See also lemmatization stemming
  • positional attribute

    A positional attribute is information added to each token in a corpus, typically its lemma or tag. (more…)
  • preloaded corpus [ corpus-types ]

    a ready-to-use corpus included in Sketch Engine subscription or Trial access, not created by a user, e.g. British National Corpus
  • prevertical file

    A prevertical file is a pain text file that contains the corpus text and structures. Usually, it is a source file for creating vertical files which are created by the tokenization process from the prevertical. (more…)