DISTRIBUTIONAL INVESTIGATION OF SOME FREQUENT TURKISH DERIVATIONAL AFFIXES FOR EXPLORING THEIR SEMANTICS

2021-7-14
Özdemir, Gizem Nur
In agglutinating languages such as Turkish, the process of derivation is mostly performed by adding suffixes at the end of words. Most of the derivational suffixes carry a distinctive semantic content and representing them has an important role in computational tasks, such as question answering. In this thesis, we aim to explore the structure of some frequent Turkish derivational suffixes in distributional vector space by clustering word embedding vectors of them and analyzing their underlying semantic properties. Suffix vectors are obtained by subtracting the vector of the base form of the derived word from the derived word’s word vector. We used a pre-trained word embedding model for obtaining word vectors and multiple unsupervised clustering algorithms with different parameters for clustering them. Our assumption is if a derivational suffix category manages to dominate one or more clusters, it is possible to obtain reliable representations of it in the distributional vector space. Our results show that many Turkish derivational suffix categories have this capability. We analyzed the underlying semantic structure of the generated clusters in terms of the thematic roles the suffixes are selecting, the UCCA labels and the UD relations the stem and the derived word can get.

Suggestions

Idioms as multi-word expressions in Turkish
Güven, Arzu Burcu; Bozşahin, Hüseyin Cem; Department of Cognitive Sciences (2020-10)
Idioms constitute several challenges for both Natural Language Processing (NLP) and linguistic analysis. A better understanding of idioms will yield valuable insights about natural language as well as the way it is processed. The relevance of idioms, along with the fact that Turkish is a rather unexplored language from this perspective, motivates us to work on Turkish idioms. Here, we aim to demonstrate a grammatical study on Turkish idioms that were selected in accordance with distributional models.
Morphological priming in Turkish nominal compound processing
Özer, Sibel; Hohenberger, Annette Edeltraud; Department of Cognitive Sciences (2010)
Compounding, constructing new words out of previously known words by means of simple concatenation mostly, can be counted as one of the major word production mechanisms in the majority of languages. Their importance in the history of human languages warrants a detailed study with respect to the language faculty and related cognitive aspects. In the last decade, compound production as well as comprehension have become highly debated and investigated areas of research. Morphological priming is one frequently ...
Power of frequencies : n-grams and semi-supervised morphological segmentation in Turkish
Kılıç, Özkan; Bozşahin, Hüseyin Cem; Department of Cognitive Sciences (2013)
Turkish is an agglutinating language with a non-rigid word order. When communicating, the word internal structure in Turkish is required to be segmented because Turkish morphosyntax is tortuous and it plays a central role in semantic analysis. Distinguishing a sub-word unit actually means performing a morph segmentation task, which is accomplished by children at an astonishing success rate. In this study, morph segmentation of Turkish words was demonstrated with a semi-supervised Hidden Markov Model, which ...
Usage disambiguation of Turkish discourse connectives
Başıbüyük, Kezban; Zeyrek Bozşahin, Deniz (2023-01-01)
This paper describes a rule-based approach and a machine learning approach to disambiguate the discourse usage of Turkish connectives, which not only has single and phrasal connectives as most languages do, but also suffixal connectives that largely correspond to subordinating conjunctions in English. Since these connectives have different linguistic characteristics, two sets of linguistic rules are devised to disambiguate their discourse usage. The linguistic rules are used in the rule-based approach and e...
Preserved morphological processing in heritage speakers: Evidence from a masked priming study on Turkish
Jacob, Gunnar; Safak, Duygu Fatma; Demir, Orhan; Kırkıcı, Bilal (null; 2017-11-08)
In a masked morphological priming experiment, we compared the processing of derived and inflected morphologically complex Turkish words in heritage speakers of Turkish living in Berlin and in native speakers of Turkish raised and living in Turkey. The results show significant derivational and inflectional priming effects of a similar magnitude in the heritage group and the control group. For both participant groups, semantic and orthographic control conditions indicate that these priming effects are genuine...
Citation Formats
G. N. Özdemir, “DISTRIBUTIONAL INVESTIGATION OF SOME FREQUENT TURKISH DERIVATIONAL AFFIXES FOR EXPLORING THEIR SEMANTICS,” M.S. - Master of Science, Middle East Technical University, 2021.