A parallel corpus of English–Montengrin subtitles
The OPUS MontenegrinSubs parallel corpus is a set of two text corpora (English and Montenegrin) made up of movie subtitles. The corpora have aligned paragraphs that allow searching and analysing translations between all the languages.
The parallel corpora were originally collected, prepared and aligned by Joerg Tiedermann in the OPUS project. Afterwards, Nikola Ljubešić prepared this corpus data for Sketch Engine. See more in his paper Opus-MontenegrinSubs 1.0: First electronic corpus of the Montenegrin language.
This parallel corpus can be searched and analysed monolingually or multilingually using all available tools in Sketch Engine: concordances, collocations, word lists and more. Both corpora English and Montenegrin contain part-of-speech tagging and lemmatization.
Jörg Tiedemann, 2012, Parallel Data, Tools and Interfaces in OPUS. [pdf] In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC’2012).
Božović, P., Erjavec, T., Tiedemann, J., Ljubešić, N., & Gorjanc. V. (2018). Opus-MontenegrinSubs 1.0: First electronic corpus of the Montenegrin language. In Proceedings of the Conference on Language Technologies & Digital Humanities 2018 (pp. 24-28).
Search the OPUS MontenegrinSubs parallel corpora
Sketch Engine offers a range of tools to work with the English–Montenegrin parallel corpora.
Create your own multilingual or parallel corpora in Sketch Engine.
See our user guide.
Use Sketch Engine in minutes
Generating collocations, frequency lists, examples in contexts, n-grams or extracting terms is easy with Sketch Engine. Use our Quick Start Guide to learn it in minutes.