Corpus info page – corpus statistics and details
The corpus info page contains information and detailed statistics about the whole corpus.
To display the corpus info page:
- click the info_outline icon next to the name of the corpus at the top center of each screen
OR
- go to the corpus dashboard storage
- click CORPUS INFO

The name of the corpus.
Technical name (unique identifier), only needed when using the API and in Lexonomy. Also useful to distinguish corpora with identical names.
Corpus description.
Information about the language of the corpus, the links to the webpage with the corpus information, tagset, sketch and term grammars.
The total numbers found in the corpus. Available information differs between corpora – a corpus without paragraphs has no info about them.
The number of unique items in the corpus. Each is counted only once even if it appeared in the corpus many times.
Shows the number of structures in the corpus and structural attributes (metadata).
A list of some part-of-speech tags used in the corpus.
Lempos suffixes used in the corpus.
A list of subcorpora (user or preloaded) available in the corpus with information about their sizes. The sizes are estimates only.
Aligned languages table lists corpora that can be used in parallel with the selected corpus. This only applies for parallel corpora.