Package description
===================

This package contains data for two languages: English and Czech with
corresponding suffixes _en and _cz respectively.

There were 3 annotators for English and 4 for Czech.

--------------------------------------------------------------------------------

dataset*
========

contains main data in several columns delimited with TAB:
headword[TAB]wordclass[TAB]frequency_band[TAB]collocate[TAB]judge01[TAB]judge02[TAB]...[TAB]judgeNN[TAB]rank

where

wordclass = n|v|j (noun, verb, adjective)
freqency_band = common|mid|low
judgeXX = 0|1 (bad or good label from annotator XX)
rank = 

stoplist*
=========

List of words which were used for filtering out candidate collocations

goldset*
========

List of good collocation pairs based on annotations. A collocate is good if at least
N-1 annotators labeled it as 'good'. In format

headword[TAB]collocate

README
======

This file describing the package.

--------------------------------------------------------------------------------

You may ask questions about this package via email address
support@sketchengine.co.uk
