BAWE – British Academic Written English corpus

The British Academic Written English (BAWE) is a British Academic corpus of academic works written at universities in the UK. It represents a pattern of British Academic English with fairly evenly distributed disciplinary areas (Arts and Humanities, Social Sciences, Life Sciences and Physical Sciences) and levels of study (undergraduate and taught masters level).

The whole corpus consists of 2761 pieces of proficient assessed student writing with length in the range 500–5000 words.

Part-of-speech tagset

This English corpus contains POS tags used in the CLAWS tagger version 7.

Tools work with the British English corpus

A complete set of Sketch Engine tools is available to work with this BAWE corpus to generate:

  • word sketch – English collocations categorized by grammatical relations
  • thesaurus – synonyms and similar words for every word
  • keywords – terminology extraction of one-word and multi-word units
  • word lists – lists of English nouns, verbs, adjectives etc. organized by frequency
  • n-grams – frequency list of multi-word units
  • concordance – examples in context

For more information about citing the BAWE corpus, please visit

A list of all corpus metadata is available in the document BAWE Corpus Holdings.

Search the BAWE corpus

Sketch Engine offers a range of tools to work with the BAWE corpus.

Concordance from the BAWE corpus

Try a 30-day free trial


Other English corpora

Explore our largest Timestamped English corpus with 50+ billion words.

Use Sketch Engine in minutes

Generating collocations, frequency lists, examples in contexts, n-grams or extracting terms is easy with Sketch Engine. Use our Quick Start Guide to learn it in minutes.