ruSkELL: Russian Corpus for SkELL
Russian Corpus for SkELL is a Russian corpus specially built up for Rusian SkELL interface (ruSkELL) available at http://ruskell.sketchengine.co.uk/run.cgi/skell. The corpus does not contain whole documents but only sentences sorted according to their text quality. This score was computed by the GDEX system.
This corpus consists of texts (99.8 %) come from the Russian top-level domain .ru, the most frequent web domains are kontrolnaja.ru, news.yandex.ru, alterauto.ru, pressarchive.ru and com.sibpress.ru covering just 0.09 % off all corpus documents.
These sources provide a good example of how Russian is used in everyday, standard, formal and professional context almost 1 billion words in more than 68 million sentences.