The Oxford English Corpus (OEC) consisted mainly of websites chosen in the way of presenting all types of English, from literary novels to everyday newspapers and the language of blogs and even social media. Besides UK and US English there are Englishes from Ireland, Australia, New Zealand, the Caribbean, Canada, India, Singapore, and South Africa. The last version of this corpus contains nearly 2.1 billion words (almost 2.5 billion tokens).

For more information visit Oxford Dictionaries’s website.

The corpus is supplied by Oxford University Press.

OED Access Policy

Access is restricted unless special permission is granted.

Permission from Oxford University Press is required to get access to the corpus. Researchers may contact . You need to include a brief summary of your research project. Please add a note you would like to access the corpus in Sketch Engine, including your user name in Sketch Engine. (This is a manual process that may take several days.)

Tools to work with the Oxford English Corpus

A complete set of tools is available to work with this English corpus to generate:

  • word sketch – English collocations categorized by grammatical relations
  • thesaurus – synonyms and similar words for every word
  • keywords – terminology extraction of one-word and multi-word units
  • word lists – lists of English nouns, verbs, adjectives etc. organized by frequency
  • n-grams – frequency list of multi-word units
  • concordance – examples in context
  • trends – diachronic analysis automatically identifies neologisms and changes in use
  • text type analysis – statistics of metadata in the corpus

v3 (February 2012)

  • “OEC + Biwec build v2” – size 2.073 billion words

Updates:

  • 2012-03-08 encoded, word sketches
  • 2011-04-05 doc.wordcount

v2 (January 2011)

  • size 2.008 billion words

Updates:

  • 2010-11-02 encoded, word sketches
  • 2011-03-05 doc.wordcount

v1 (2009)

  • size 1.736 billion words

Updates:

  • 2010-03-15 encoded
  • 2010-04-01 word sketches
  • 2011-03-05 doc.wordcount

Search the Oxford English Corpus

Sketch Engine offers a range of tools to work with this English Corpus.

concordance from Oxford English Corpus

or

Other English corpora

Explore our largest Timestamped English corpus with 70+ billion words.

Use Sketch Engine in minutes

Generating collocations, frequency lists, examples in contexts, n-grams or extracting terms. Use our Quick Start Guide to learn it in minutes.