This blog post defines what POS tags are, explains manual and automatic tagging and points readers to Sketch Engine where they can have their texts tagged automatically in many languages. What are POS tags? POS tags (or part-of-speech tags) are special labels assigned to each token (word) in a text corpus to indicate the part […]
Term extraction or terminology extraction is an automatic method of analysing text in order to identify phrases which fulfil the criteria for terms. Terminology extraction has its use in translation and terminology management but also in text analytics where it is used for topic modelling, data mining and information retrieval from unstructured text.
https://www.sketchengine.eu/wp-content/uploads/2018-01-16_15-49-45-1.png606919Michal Cukrhttps://www.sketchengine.eu/wp-content/uploads/SE_logo_330x150-bleed-transp-bg.pngMichal Cukr2018-01-16 17:34:142018-02-12 17:12:39The best term extraction
A corpus is a collection of a very large amount of text that is used, together with a suitable corpus management software such as Sketch Engine, to learn about how language is used. It has become an indispensable tool for all modern linguists and lexicographers. A text corpus can consist of only one very long […]