text type analysis

Text type analysis – statistics of metadata in the corpus

The Text type analysis tool in Sketch Engine shows breakdown by metadata. For example, you can see how many documents, tokens or words there are in the corpus in texts downloaded from each website, written by each author or published in each year. The choice of options depends on the metadata available in the corpus.

The tool allows switching between sizes in the number of structures, tokens or words.

The filter allows to only display values starting, finishing or containing certain characters.

The tool is available from the corpus dashboard dashboard or by using the shortcut g + a (Go to Text type Analysis).

This screenshot shows the statistics of websites. These metadata are attached to the structure, i.e. documents. The sizes are shown in words and only websites containing ‘blog’ are shown.

The number of values in the chart can be controlled in settings and the pie chart can be downloaded.

The list of values (here websites) and their sizes contain local menus more_horiz to quickly jump to the concordance related to the website.

.av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-c865ede747bf61937831d7bad7e12d75{ top:21.7%; left:25.4%; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-c865ede747bf61937831d7bad7e12d75 .av-image-hotspot_inner{ background-color:#edae44; color:#000000; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-c865ede747bf61937831d7bad7e12d75 .av-image-hotspot-pulse{ background-color:#b02b2c; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-6-665a56419ec2db24f93e85fbe72bd9dd{ top:50.6%; left:23.1%; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-6-665a56419ec2db24f93e85fbe72bd9dd .av-image-hotspot_inner{ background-color:#edae44; color:#000000; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-6-665a56419ec2db24f93e85fbe72bd9dd .av-image-hotspot-pulse{ background-color:#b02b2c; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-5-7421f01e8b1ee8e041fb0695866d83d7{ top:57.4%; left:23.2%; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-5-7421f01e8b1ee8e041fb0695866d83d7 .av-image-hotspot_inner{ background-color:#edae44; color:#000000; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-5-7421f01e8b1ee8e041fb0695866d83d7 .av-image-hotspot-pulse{ background-color:#b02b2c; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-4-62a824c5c7b24a48349a11977fecf3bc{ top:65%; left:23.1%; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-4-62a824c5c7b24a48349a11977fecf3bc .av-image-hotspot_inner{ background-color:#edae44; color:#000000; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-4-62a824c5c7b24a48349a11977fecf3bc .av-image-hotspot-pulse{ background-color:#b02b2c; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-1-6333bb11f4bd12ad727f9da3e11bac0a{ top:71.9%; left:24.7%; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-1-6333bb11f4bd12ad727f9da3e11bac0a .av-image-hotspot_inner{ background-color:#edae44; color:#000000; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-1-6333bb11f4bd12ad727f9da3e11bac0a .av-image-hotspot-pulse{ background-color:#b02b2c; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-3-538b06172ea32884ad5d429987777a9c{ top:6.7%; left:92.4%; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-3-538b06172ea32884ad5d429987777a9c .av-image-hotspot_inner{ background-color:#edae44; color:#000000; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-3-538b06172ea32884ad5d429987777a9c .av-image-hotspot-pulse{ background-color:#b02b2c; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-2-de0dde37737af3ce8de742b370a93da8{ top:6.7%; left:96%; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-2-de0dde37737af3ce8de742b370a93da8 .av-image-hotspot_inner{ background-color:#edae44; color:#000000; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-2-de0dde37737af3ce8de742b370a93da8 .av-image-hotspot-pulse{ background-color:#b02b2c; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-1-1-ef80295bf934cf29cf2608ffd1990e1f{ top:95.7%; left:52.5%; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-1-1-ef80295bf934cf29cf2608ffd1990e1f .av-image-hotspot_inner{ background-color:#edae44; color:#000000; } .av-hotspot-image-container .av-image-hotspot.av-km4p2hgb-1-1-ef80295bf934cf29cf2608ffd1990e1f .av-image-hotspot-pulse{ background-color:#b02b2c; }
<div class='av-image-hotspot av-km4p2hgb-c865ede747bf61937831d7bad7e12d75 av-image-hotspot-1 ' data-avia-tooltip-position='top' data-avia-tooltip-alignment='left' data-avia-tooltip-class='av-tt-xlarge-width av-tt-pos-above av-tt-align-left av-mobile-fallback-active main_color av-tt-hotspot' data-avia-tooltip='

Available metadata and corpus structures to which the metadata are attached.

'>
1
<div class='av-image-hotspot av-km4p2hgb-6-665a56419ec2db24f93e85fbe72bd9dd av-image-hotspot-2 ' data-avia-tooltip-position='top' data-avia-tooltip-alignment='left' data-avia-tooltip-class='av-tt-xlarge-width av-tt-pos-above av-tt-align-left av-mobile-fallback-active main_color av-tt-hotspot' data-avia-tooltip='

Sizes can be shown as the number of structures or the number of tokens.

'>
2
<div class='av-image-hotspot av-km4p2hgb-5-7421f01e8b1ee8e041fb0695866d83d7 av-image-hotspot-3 ' data-avia-tooltip-position='top' data-avia-tooltip-alignment='left' data-avia-tooltip-class='av-tt-xlarge-width av-tt-pos-above av-tt-align-left av-mobile-fallback-active main_color av-tt-hotspot' data-avia-tooltip='

Statistics can be displayed from the whole corpus or only one subcorpus.

'>
3
<div class='av-image-hotspot av-km4p2hgb-4-62a824c5c7b24a48349a11977fecf3bc av-image-hotspot-4 ' data-avia-tooltip-position='top' data-avia-tooltip-alignment='left' data-avia-tooltip-class='av-tt-xlarge-width av-tt-pos-above av-tt-align-left av-mobile-fallback-active main_color av-tt-hotspot' data-avia-tooltip='

Results can be filtered by characters they contain.

'>
4
<div class='av-image-hotspot av-km4p2hgb-1-6333bb11f4bd12ad727f9da3e11bac0a av-image-hotspot-5 ' data-avia-tooltip-position='top' data-avia-tooltip-alignment='left' data-avia-tooltip-class='av-tt-xlarge-width av-tt-pos-above av-tt-align-left av-mobile-fallback-active main_color av-tt-hotspot' data-avia-tooltip='

The link to the Concordance frequency tool.

'>
5
<div class='av-image-hotspot av-km4p2hgb-3-538b06172ea32884ad5d429987777a9c av-image-hotspot-6 ' data-avia-tooltip-position='top' data-avia-tooltip-alignment='right' data-avia-tooltip-class='av-tt-large-width av-tt-pos-above av-tt-align-right av-mobile-fallback-active main_color av-tt-hotspot' data-avia-tooltip='

Chart options.

'>
6
<div class='av-image-hotspot av-km4p2hgb-2-de0dde37737af3ce8de742b370a93da8 av-image-hotspot-7 ' data-avia-tooltip-position='top' data-avia-tooltip-alignment='right' data-avia-tooltip-class='av-tt-large-width av-tt-pos-above av-tt-align-right av-mobile-fallback-active main_color av-tt-hotspot' data-avia-tooltip='

Download chart.

'>
7
<div class='av-image-hotspot av-km4p2hgb-1-1-ef80295bf934cf29cf2608ffd1990e1f av-image-hotspot-8 ' data-avia-tooltip-position='top' data-avia-tooltip-alignment='left' data-avia-tooltip-class='av-tt-xlarge-width av-tt-pos-above av-tt-align-left av-mobile-fallback-active main_color av-tt-hotspot' data-avia-tooltip='

This local menu displays the related concordance.

'>
8