Sketch Engine allows users to build corpora from their own documents. It is not uncommon that users have their corpus data in multiple files and want to upload all of them at the same time. Unfortunately, this is not supported. There are a few possible solutions:
- To upload multiple documents in an archive file (.zip, .tar, .tar.gz, and .tar.bz2).
- It is also possible to upload them one by one.
- Otherwise, convert all documents to plain text, concatenating them to a single file and upload only the concatenated file. Structural XML-like mark-up is supported for uploaded plain text files. This can be used for marking document boundaries and/or adding metadata about various parts of the text. Example: