We continue improving tools for processing languages. Greek corpora now have lemmatization and part-of-speech tagging available and they are tokenized better.
Author Archive for: michal
About Michal Cukr
This author has yet to write their bio.Meanwhile lets just say that we are proud Michal Cukr contributed a whooping 49 entries.
Entries by Michal Cukr
We have improved tools for processing Danish. Danish corpora are lemmatized, part-of-speech tagged and tokenized better.
The latest data until September 2017 have been added to The Timestamped JSI web corpora. Data in all 18 languages are updated with new data monthly and the smaller parts of corpora daily.
Find out how the writing in the main English newspapers has changed over the past two decades. Use diachronic analysis in the SiBol: English Broadsheet Newspapers 1993–2013 corpus.
A new corpus of academic English is now available in Sketch Engine. The corpus was collected from the database of open access journals, the Directory of Open Access Journals (DOAJ), and is comprised of 2.6 billion words.
The LexiCon Research Group at the University of Granada developed and provided their highly specialised English EcoLexicon corpus built up of environmental texts. The corpus is hosted as an open corpus and is freely accessible even without a Sketch Engine account.
We have doubled the size of the SiBol corpus, a 650-million-word collection of English Broadsheet Newspapers 1993–2013 documenting the language of English journalism.
A new Brexit corpus has been added to Sketch Engine. It is a collection of texts about the UK referendum on the withdrawal the United Kingdom from the European Union.
Discovering English with Sketch Engine by James Thomas has had an update.
Do you want to become a professional lexicographer or to learn about new trends in lexicography or meet other enthusiastic lexicographers from all over the world or all of these? Then join us in our masterclass workshop Lexicom, this year both in USA (June) and Europe (July). Register now at www.lexmasterclass.com!