Note: This entry is for the type of token. For the positional attribute, see word form.
A word is a type of token. Words are tokens which begin with a letter of the alphabet. All tokens in a corpus are divided into two groups: words and nonwords.
The regular expression Sketch Engine users to identify words is [[:alpha:]].*
Compare to nonword.