The corpus prepared by Corpus factory method. It has 288 million words with encoding in UTF-8 and isn’t tagged yet.