The Project Gutenberg English corpus is a corpus made up of all English e-books available in the Gutenberg database in October 2014.
The Project Gutenberg English corpus was tagged by TreeTagger using Penn TreeBank tagset.
A complete set of tools is available to work with this Gutenberg corpus to generate:
Sketch Engine offers a range of tools to work with this Gutenberg corpus.
Sketch Engine offers 450+ language corpora.
Generate collocations, frequency lists, examples in contexts, n-grams or extract terms. Use our Quick Start Guide to learn it in minutes.