Corpus info page - icon

Corpus info page — corpus statistics and details

The corpus info page has contains information and detailed statistics about the whole corpus.

To reach the corpus info page , follow these steps:

  • select the corpus if it is not selected already
  • go to the corpus dashboard storage
  • click CORPUS INFO
    you can also click the info_outline icon next to the name of the corpus at the top of each screen

1
2
3
4
5
6
7
8
9
1

The name of the corpus.

2

Corpus description.

3

Information about the language of the corpus, the link to the webpage with the corpus information, link to the tagset and sketch grammar.

4

The total numbers found in the corpus. Available information differs between corpora – a corpus without paragraphs has no info about them.

5

A list of some part-of-speech tags used in the corpus.

6

Lempos suffixed used in the corpus.

7

The number of unique items in the corpus. Each is counted only once even if it appeared in the corpus many times.

8

Shows the number of structures in the corpus and structural attributes (metadata).

9

A list of subcorpora (user or preloaded) available in the corpus with information about their sizes. The sizes are estimates only.