OPUS MontenegrinSubs parallel corpora

A parallel corpus of English–Montengrin subtitles

The OPUS MontenegrinSubs parallel corpus is a set of two text corpora (English and Montenegrin) made up of movie subtitles. The corpora have aligned paragraphs that allow searching and analysing translations between all the languages.

The parallel corpora were originally collected, prepared and aligned by Joerg Tiedermann in the OPUS project. Afterwards, Nikola Ljubešić prepared this corpus data for Sketch Engine. See more in his paper Opus-MontenegrinSubs 1.0: First electronic corpus of the Montenegrin language.

This parallel corpus can be searched and analysed monolingually or multilingually using all available tools in Sketch Engine: concordances, collocations, word lists and more. Both corpora English and Montenegrin contain part-of-speech tagging and lemmatization.

Bibliographic references

Reference

Jörg Tiedemann, 2012, Parallel Data, Tools and Interfaces in OPUS. [pdf] In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC’2012).

Božović, P., Erjavec, T., Tiedemann, J., Ljubešić, N., & Gorjanc. V. (2018). Opus-MontenegrinSubs 1.0: First electronic corpus of the Montenegrin language. In Proceedings of the Conference on Language Technologies & Digital Humanities 2018 (pp. 24-28).

Search the OPUS MontenegrinSubs parallel corpora

Sketch Engine offers a range of tools to work with the English–Montenegrin parallel corpora.

open in Sketch Engine

Tip

Create your own multilingual or parallel corpora in Sketch Engine.

See our user guide.

More parallel corpora

DGT Translation Memory parallel corpora – European Union’s legislative documents

EUR-Lex 2/2016 parallel corpora – texts from the EUR-Lex database containing public EU documents

Eur-Lex judgments 12/2016 parallel corpora – judgments of the Court of Justice of the European Union

Europarl spoken parallel corpora – transcriptions of the European Parliament Proceedings

Open Parallel Corpus (OPUS) – translated texts from various sources, e.g. medical documents, subtitles, technical documentation, etc.

OpenSubtitles 2018 parallel corpora – movie subtitles from the OpenSubtitles database

United Nations Parallel Corpus (UNPC) – official records and other parliamentary documents of the United Nation

corpora in Sketch Engine

Use Sketch Engine in minutes

Generating collocations, frequency lists, examples in contexts, n-grams or extracting terms is easy with Sketch Engine. Use our Quick Start Guide to learn it in minutes.

Quick Start Guide

about Sketch Engine

A parallel corpus of English–Montengrin subtitles

Search the OPUS MontenegrinSubs parallel corpora

Tip

More parallel corpora

Use Sketch Engine in minutes

for learners of languages

A Course in Lexicography and Lexical Computing

term extraction

learn sketch engine