• focus corpus

    In keyword and term extraction, the focus corpus is the corpus from which keywords and terms are extracted. Compare reference corpus.
  • freq/mill – frequency per million [ statistics ]

    a number of occurrences (hits) of an item normalised per million, also called as i.p.m. (instances per million). It is used to compare frequencies between corpora of different sizes. number of hits : corpus size in millions of tokens = frequency per million Example: A token found 10 times in a corpus of 1 million tokens will have a frequency per million equal to 10. A token found 100 times in a corpus of 100 million tokens will have a frequency per million equal to 1. The second token is less frequent. see also Statistics in Sketch Engine Frequency per million Average Reduced Frequency
  • frequency [ statistics ]

    Frequency (also absolute frequency) refers to the number of occurrences or hits. If a word, phrase, tag etc. has a frequency of 10, it means it was found 10 times or it exists 10 times. It is an absolute figure. It is not calculated using a specific formula. compare frequency per million see also ARF document frequency Statistics used in Sketch Engine