Disambiguation in corpus of Modern Greek
DOI:
https://doi.org/10.36505/ExLing-2020/11/0027/000442Keywords:
language corpus, ambiguity, Modern Greek, automatic disambiguationAbstract
Corpus of Modern Greek appeared in 2011. All texts are morphologically annotated. Due to certain peculiarities of Modern Greek morphology, the majority of forms has more than one grammatic interpretation. In this presentation we describe the types of homonyms which are found in the Corpus and discuss possible patterns for automatic disambiguation. At the end, we mention a number of problematic cases that cannot be resolved now or require manual approach.
References
Arkhangelskiy, Timofey & Maxim Kisilier. 2018. Corpora of Modern Greek: achievements and goals (Корпуса греческого языка: достижения, цели и задачи). Indo-European Linguistics and Classical Philology 22(1), 50–59.
Kisilier, Maxim. 2020. Ότι, που and πως in Standard Modern Greek (Kомплементайзеры ότι, που и πως в новогреческом языке). Indo-European Linguistics and Classical Philology 24(1). 554–577.
Kuzmenko, Elizaveta & Elmira Mustakimova. 2015. Automatic disambiguation in the corpora of Greek and Yiddish. In Computational linguistics and intellectual technologies (Компьютерная лингвистика и интеллектуальные технологии: По материалам ежегодной Международной конференции «Диалог» (2015) 1.) Annual International Conference “Dialogue”. Vol. 1, 388–397. Moscow: Russian State University of Humanities.
Downloads
Published
Issue
Section
License
Articles are published under the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.