ESLORA: Corpus of the spoken Spanish
The ESLORA is a Spanish corpus consisting interviews and conversations with speakers from Galicia, recorded between 2001 and 2015. The corpus is suitable for analyzing the spoken part of the language.
The corpus is available in Sketch Engine thanks to University of Santiago de Compostela through the ESLORA projects.
More information about the corpus is available at the ESLORA project website: https://eslora.usc.es/
Part-of-speech tagset and lemmatization
The corpus uses XIADA POS tagging and lemmatization. Thereore, it uses different tags and is lemmatized differently than most other Spanish corpora in Sketch Engine, namely the TenTen and Trends corpora. More information on the tagset is available at this website: https://eslora.usc.es/guide_tags
ESLORA corpus sizes
| Number of words | 750+ thousand |
| Number of tokens | 910+ thousand |
| Number of sentences | 60+ thousand |
| Number of documents | 83 |
Search the ESLORA
Sketch Engine offers a range of tools to work with this Spanish corpus.
Tools to work with the ESLORA corpus from the web
A complete set of Sketch Engine tools is available to work with this Spanish corpus to generate:
- word sketch – Spanish collocations categorized by grammatical relations
- thesaurus – synonyms and similar words for every word
- keywords – terminology extraction of one-word units
- word lists – lists of Spanish nouns, verbs, adjectives etc. organized by frequency
- n-grams – frequency list of multi-word units
- concordance – examples in context
- text type analysis – statistics of metadata in the corpus
Changelog
ESLORA
- published in Sketch Engine in February 2026
Bibliography
ESLORA: Corpus para el estudio del español oral
Use Sketch Engine in minutes
Generate collocations, frequency lists, examples in contexts, n-grams or extract terms. Use our Quick Start Guide to learn it in minutes.




