The Cundeelee Wangka Stories is a Cundeelee Wangka – English parallel corpus made up of texts provided by Goldfields Aboriginal Language Centre in Kalgoorlie, Australia. The corpus was created for the purpose of Lexicom workshop 2018. The Cundeelee Wangka language belongs to the aboriginal languages.
The English part of the corpus was tagged by TreeTagger using Penn TreeBank tagset with Sketch Engine modifications.
To get access, please contact access Sue Hanson from Kalgoorlie <email@example.com> and provide a brief description of the purpose of your work. If you request will be accepted, please contact us at firstname.lastname@example.org with including the confirmation from Sue Hanson and your username so that we could grant you access to this corpus.
Tools to work with the Cundeelee Wangka Stories corpus
A complete set of tools is available to work with this Cundeelee Wangka – English parallel corpus to generate:
keywords – terminology extraction of one-word and multi-word units
word lists – lists of nouns, verbs, adjectives etc. organized by frequency