Idioms as multi-word expressions in Turkish

Download
2020-10
Güven, Arzu Burcu
Idioms constitute several challenges for both Natural Language Processing (NLP) and linguistic analysis. A better understanding of idioms will yield valuable insights about natural language as well as the way it is processed. The relevance of idioms, along with the fact that Turkish is a rather unexplored language from this perspective, motivates us to work on Turkish idioms. Here, we aim to demonstrate a grammatical study on Turkish idioms that were selected in accordance with distributional models.

Suggestions

Paracompositionality, MWEs and argument substitution
Bozşahin, Hüseyin Cem (2018-08-11)
Multi-word expressions, verb-particle constructions, idiomatically combining phrases, and phrasal idioms have something in common: not all of their elements contribute to the argument structure of the predicate implicated by the expression. Radically lexicalized theories of grammar that avoid string-, term-, logical form-, and tree-writing, and categorial grammars that avoid wrap operation, make predictions about the categories involved in verb-particles and phrasal idioms. They may require singleton types,...
An examination of quantifier scope ambiguity in Turkish
Kurt, Kürşad; Bozşahin, Hüseyin Cem; Department of Cognitive Sciences (2006)
This study investigates the problem of quantifier scope ambiguity in natural languages and the various ways with which it has been accounted for, some of which are problematic for monotonic theories of grammar like Combinatory Categorial Grammar (CCG) which strive for solutions that avoid non-monotonic functional application, and assume complete transparency between the syntax and the semantics interface of a language. Another purpose of this thesis is to explore these proposals on examples from Turkish and...
Usage disambiguation of Turkish discourse connectives
Başıbüyük, Kezban; Zeyrek Bozşahin, Deniz (2023-01-01)
This paper describes a rule-based approach and a machine learning approach to disambiguate the discourse usage of Turkish connectives, which not only has single and phrasal connectives as most languages do, but also suffixal connectives that largely correspond to subordinating conjunctions in English. Since these connectives have different linguistic characteristics, two sets of linguistic rules are devised to disambiguate their discourse usage. The linguistic rules are used in the rule-based approach and e...
Twitter Sentiment Analysis Experiments Using Word Embeddings on Datasets of Various Scales
Arslan, Yusuf; Kucuk, Dilek; Birtürk, Ayşe Nur (2018-06-15)
Sentiment analysis is a popular research topic in social media analysis and natural language processing. In this paper, we present the details and evaluation results of our Twitter sentiment analysis experiments which are based on word embeddings vectors such as word2vec and doc2vec, using an ANN classifier. In these experiments, we utilized two publicly available sentiment analysis datasets and four smaller datasets derived from these datasets, in addition to a publicly available trained vector model over ...
Identification of Discourse Relations in Turkish Discourse Bank
Kutlu, Ferhat; Zeyrek Bozşahin, Deniz; Department of Cognitive Sciences (2023-1-25)
Discourse is the level of language where linguistic units are organized in a structured and coherent way. One of the major problems in the field of discourse in particular, and NLU in general is how to build better models to sense the way constitutive units of discourse stick together to form a coherent whole. The discourse would be coherent if it had meaningful connections between its parts. Discourse relations, i.e., semantic or pragmatic relations between discourse units (clauses or sentences), are one o...
Citation Formats
A. B. Güven, “Idioms as multi-word expressions in Turkish,” M.S. - Master of Science, Middle East Technical University, 2020.