European Literary Text Collection Corpora — ELTeC

The European Literary Text Collection (ELTeC) is a collection of corpora consisting of novels created under the COST Action “Distant Reading for European Literary History” (CA16204). It aims to provide comparable collections of around 100 novels per language, originally published between 1840 and 1920, in a growing number of European languages. Each language-specific corpus follows unified but flexible selection criteria. The ELTeC corpus is especially valuable for literary scholars, linguists, digital humanists, and anyone interested in comparative literary studies, diachronic language variation, or historical text mining.

More information at: https://zenodo.org/communities/eltec/records

Licence: CC BY 4.0

Search the European Literary Text Collection Corpora

Sketch Engine offers a range of tools to work with these corpora.

Tools to work with the European Literary Text Collection Corpora

A complete set of Sketch Engine tools is available to work with these corpora to generate:

  • word sketchcollocations categorized by grammatical relations
  • thesaurus – synonyms and similar words for every word
  • keywordsterminology extraction of one-word and multi-word units
  • word lists – lists of nouns, verbs, adjectives etc. organized by frequency
  • n-grams – frequency list of multi-word units
  • concordance – examples in context
  • text type analysis – statistics of metadata in the corpus

ELTeC (August 1)

  • in 14 languages
  • Licence: CC BY 4.0

European Literary Text Collection (ELTeC),
version 1.1.0, April 2021,
edited by Carolin Odebrecht,
Lou Burnard and Christof Schöch.
COST Action Distant Reading for European Literary History (CA16204).
DOI: doi.org/10.5281/zenodo.4662444.

Other English corpora

Explore our largest English Trends corpus with 85+ billion words.

Use Sketch Engine in minutes

Generate collocations, frequency lists, examples in contexts, n-grams or extract terms. Use our Quick Start Guide to learn it in minutes.