ESLORA: Corpus of the spoken Spanish

The ESLORA is a Spanish corpus consisting interviews and conversations with speakers from Galicia, recorded between 2001 and 2015. The corpus is suitable for analyzing the spoken part of the language.

The corpus is available in Sketch Engine thanks to University of Santiago de Compostela through the ESLORA projects.

More information about the corpus is available at the ESLORA project website: https://eslora.usc.es/

Part-of-speech tagset and lemmatization

The corpus uses XIADA POS tagging and lemmatization. Thereore, it uses different tags and is lemmatized differently than most other  Spanish corpora in Sketch Engine, namely the TenTen and Trends corpora. More information on the tagset is available at this website: https://eslora.usc.es/guide_tags

ESLORA corpus sizes

Number of words 750+ thousand
Number of tokens 910+ thousand
Number of sentences 60+ thousand
Number of documents 83

Search the ESLORA

Sketch Engine offers a range of tools to work with this Spanish corpus.

Tools to work with the ESLORA corpus from the web

A complete set of Sketch Engine tools is available to work with this Spanish corpus to generate:

  • word sketch – Spanish collocations categorized by grammatical relations
  • thesaurus – synonyms and similar words for every word
  • keywordsterminology extraction of one-word units
  • word lists – lists of Spanish nouns, verbs, adjectives etc. organized by frequency
  • n-grams – frequency list of multi-word units
  • concordance – examples in context
  • text type analysis – statistics of metadata in the corpus

ESLORA

  • published in Sketch Engine in February 2026

ESLORA: Corpus para el estudio del español oral >, versión 2.4 de noviembre de 2025, ISSN: 2444-1430.

Other Spanish corpora

Sketch Engine offers dozens of Spanish language corpora.

Use Sketch Engine in minutes

Generate collocations, frequency lists, examples in contexts, n-grams or extract terms. Use our Quick Start Guide to learn it in minutes.