Building a Multilingual Outlier Detection Dataset for the Evaluation of Distributional Thesauri and Word Embeddings