prevertical file

A prevertical file is a pain text file that contains the corpus text and structures. Usually, it is a source file for creating vertical files which are created by the tokenization process from the prevertical.

An example of a prevertical file with corpus structures for documents, chapters, and paragraphs:

<doc genre="fiction" title="1984" author="G. Orwell">
<chapter no="1.1">
<p>
It was a bright cold day in April, and the clocks were striking thirteen. Winston Smith, his chin nuzzled into his breast in an effort to escape the vile wind, slipped quickly through the glass doors of Victory Mansions, though not quickly enough to prevent a swirl of gritty dust from entering along with him.
</p>
<p>
The hallway smelt of boiled cabbage and old rag mats. 
</p>
</chapter>
</doc>

Related topic: prepare a text for the vertical format.