The Yiddish Wikipedia Corpus (yiwiki) is a Yiddish corpus made up of texts collected from Yiddish internet encyclopedia Wikipedia in December 2018. The corpus consists of 2 million words.
A complete set of tools is available to work with this Wikipedia Yiddish corpus to generate:
Sketch Engine offers a range of tools to work with this Yiddish corpus from Wikipedia.
English Wikipedia corpus
Chinese Wikipedia corpus
Error corpus from English Wikipedia
Georgian Wikipedia corpus
Afrikaans Wikipedia corpus
We can build a Wikipedia corpus in any language for you. Please contact us.
Generate collocations, frequency lists, examples in contexts, n-grams or extract terms. Use our Quick Start Guide to learn it in minutes.