Keywords are words (single-token items), that appear more frequently in the focus corpus than in the reference corpus. They can be used to identify what is specific to one corpus (focus corpus) in comparison with another corpus (reference corpus). Keywords can be extracted using the Keywords & Terms tool in Sketch Engine. Typically, the largest corpus in the language will be selected as the reference corpus. The user can set a different corpus as the reference corpus.

see also


term extraction