Do many Sketch Engine users create their own corpora?

How popular is it to create your own corpus in Sketch Engine? And which languages are most popular among Sketch Engine users? Find out from this infographics.

We have added up sizes of all user corpora in Sketch Engine and here are the results.

Hover your mouse over the infographics to see more information. Scroll your mouse wheel to enlarge.

Create your own corpus from the web now!

More concordance context switch
CQL builder for corpus quieries

CQL builder

Quick Tip
Announcement of adding Belarusian corpus to Sketch Engine
Screenshot of word sketch from frTenTen French corpus
Amharic corpus

Better Danish

Brexit Corpus

news: parallel corpora

N’ko corpus

Sketch Engine CQL calendar

Calendar 2017

Audio recordings for the British National Corpus (BNC)

BNC audio

improved functionality for Bulgarian text
improved Thai support
Logo of SDL – Sketch Engine SDL Trados Studio plugin
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

Prices for Academic Individual Users

[raw] map_period = {"year" : 12, "quarter" :…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

Dutch Web Corpus

This corpus was created within the Corpus Factory project as…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

CLAWS tagset – mapping file

C8 to C7 mapping file. NS 2011-5-14. APPGE -> APPGE: possessive…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

Feed Corpus Project

FCP corpus aims to be a million word per day collection of POS-tagged…
Logo of Sketch Engine – a tool for discovering how language works

The New Corpus for Ireland | Nua-Chorpas na hÉireann

[ezcol_1half] The New Corpus for Ireland – user’s guide Welcome…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

Icelandic sample corpus

This is a small corpus of Icelandic texts prepared for the Sketch…
Logo of Sketch Engine – a tool for discovering how language works

Renaming Sketch Grammar relations

CD to directory which contains the compiled corpus files. cd…
Logo of Sketch Engine – a tool for discovering how language works

Adding sentence boundaries to a compiled corpus

This document explains how structures, such as documents, paragraph,…
Logo of Sketch Engine – a tool for discovering how language works

Compatibility Matrix

This page provides compatibility matrix of Sketch Engine components…
Logo of Sketch Engine – a tool for discovering how language works

Sketch Engine API for IntelliWebSearch

Sketch Engine is a corpus manager tool offering many corpus linguistics…
Logo of Sketch Engine – a tool for discovering how language works

Building word sketches from parsed corpora

Introduction Sketch Engine usually generates word sketches using…
Logo of Sketch Engine – a tool for discovering how language works

Word Sketches definition files

The following files can be used for building word sketches in…
Logo of Sketch Engine – a tool for discovering how language works

Word Sketch Index Format

This page is a brief overview of the development of the word…
Logo of Sketch Engine – a tool for discovering how language works

Highlight Only Part of a Complex Query

I want to align a concordance accoding to a part of the query.…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

Sketch Engine Localisation

The Sketch Engine interface can be translated into any other…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

JSON API – creating query

Sketch Engine uses HTTP REST API. All API methods (unless stated…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

Full Administration

This feature is available only for local installations (see the…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

czes corpus

CZES is a Czech corpus consisting of newspaper articles and magazine…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

Scottish Gaelic Wiki corpus

Scottish Gaelic Wikipedia corpus. Downloaded in February 2015.…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

Polish Web Corpus (PolishWaC)

Polish web as corpus has 103 million words and the encoding is…
Logo of Sketch Engine – a tool for discovering how language works

Parallel Corpora Registry Info

General Attribute Set ATTRIBUTE word STRUCTURE s{ ATTRIBUTE…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

Fryske Akademy Parallel Corpus

Frisian and Dutch not POS tagged aligned sentences Dutch…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

NepaliWaC corpus

Nepali web corpus downloaded by LCL on Dec 10, 2014. ~1200…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

SetswanaWaC corpus

(version 2) The corpus is prepared by Corpus factory method.…
Logo of Sketch Engine – a tool for discovering how language works

SpanishWaC corpus

This corpus was gathered using a list of URLs provided by Serge…
Logo of Sketch Engine – a tool for discovering how language works

SwedishWaC corpus

The corpus is prepared by Corpus factory method. Full details…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

SDeWaC corpus

SDeWaC is a subset of DeWaC. The creation of sDeWaC is described…
Logo of Sketch Engine – a tool for discovering how language works

WelshWaC corpus

The corpus is prepared by Corpus factory method by Anil in October…
Logo of Sketch Engine – a tool for discovering how language works

ThaiWaC corpus

The corpus is prepared by Corpus factory method. Full details…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

UKWaCsst corpus

UKWaC tagged with SuperSenseTagger (​sst-light) described in…
Logo of Sketch Engine – a tool for discovering how language works

Gujarati web corpus (guWaC)

GuWac web as corpus is a corpus of Gujarati language (Indo-Aryan…
Logo of Sketch Engine – a tool for discovering how language works

Patakis corpus

Patakis is a 100 million word collection of POS-tagged texts…
Logo of Sketch Engine – a tool for discovering how language works

FinnishWaC corpus

Finnish web as corpus.
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

danishWaC corpus

The corpus prepared by Corpus factory method. It has 288 million…
Logo of Sketch Engine – a tool for discovering how language works

Domain Specific Corpora

These corpora are prepared from specific domains, e.g. science,…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

e-flux corpus

The e-flux corpus is a web corpus of English art news digests.…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

Nineteenthcentury corpus

Actually, the 19th century corpus is only available to Osnabrück…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

Clustering

Clustering can be performed in Sketch Engine on the similar…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

Manual for GDEX

To quickly start using Good Dictionary EXamples, see the GDEX…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

Dynamic Functions

Please read first about what dynamic attributes are and how they…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

Corpus Factory Method

This page contains information about a corpus building method…
Logo of Sketch Engine – a tool for discovering how language works

New Model Corpus

The New model Corpus is a ~100 million words domain corpus built…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

LEXMCI

The 1.7 billion word LEXMCI corpus of English was created by…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

Corpus configuration example

If your vertical text contains only words and no annotation,…
Logo of Sketch Engine – a tool for discovering how language works

Preparing a Text Corpus for Sketch Engine: Overview

This page describes how to prepare a text corpus for indexation…
Logo of Sketch Engine – a tool for discovering how language works

Sketch Engine Video Tutorials

All videos are accessible also on our YouTube channel. Please…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

Common corpus structures

It is generally practical to divide a corpus into smaller parts…
Logo of Sketch Engine – a tool for discovering how language works

Variation in hit counts

It often seems like you have got a different hit count for the…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

Adam Kilgarriff: Structured bibliography

(note: written by Adam Kilgarriff on 27th April 2015; see also…
SkE research

Research Agenda

Lexical Computing's research interests lie at the intersection…
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works
Logo of Sketch Engine – a tool for discovering how language works

Adam’s blog

Logo of Sketch Engine – a tool for discovering how language works

Happy New Year!