A knowledge-poor pronoun resolution system for Turkish

Download
2005
Küçük, Dilek
This thesis presents a knowledge-poor pronoun resolution system for Turkish which resolves third person personal pronouns and possessive pronouns. The system is knowledge-poor in the sense that it makes use of limited linguistic and semantic knowledge to resolve the pronouns. As pronoun resolution proposals for languages like English, French and Spanish, the core of the system is the constraints and preferences which are determined empirically. The system has four modules: sentence splitting, pronoun extraction, forming the list of candidate antecedents and determination of the antecedent. It takes a Turkish text as input and rewrites this text with the considered pronouns replaced with their proposed antecedents. In order to compare the success rate of the system, two different baseline algorithms are implemented. The original system is tested against these baseline algorithms on two sample Turkish texts from different sources. Some suggestions to improve the success rate of the system and to extend the domain of the system are also presented.

Suggestions

Coupling speech recognition and rule-based machine translation
Köprü, Selçuk; Yazıcı, Adnan; Department of Computer Engineering (2008)
The objective of this thesis was to study the coupling of automatic speech recognition (ASR) systems with rule-based machine translation (MT) systems. In this thesis, a unique approach to integrating ASR with MT for speech translation (ST) tasks was proposed. The proposed approach is unique, essentially because it includes the rst rule-based MT system that can process speech data in a word graph format. Compared to other rule-based MT systems, our system processes both a word graph and a stream of words. Th...
A clustering method for the problem of protein subcellular localization
Bezek, Perit; Atalay, Mehmet Volkan; Department of Computer Engineering (2006)
In this study, the focus is on predicting the subcellular localization of a protein, since subcellular localization is helpful in understanding a protein’s functions. Function of a protein may be estimated from its sequence. Motifs or conserved subsequences are strong indicators of function. In a given sample set of protein sequences known to perform the same function, a certain subsequence or group of subsequences should be common; that is, occurrence (frequency) of common subsequences should be high. Our ...
A COMPARATIVE ANALYSIS OF ENGLISH LANGUAGE TEACHING (ELT) AND ENGLISH LANGUAGE AND LITERATURE (ELL) STUDENTS’ LEVEL OF RECOGNITION OF EXTENDED COMPLEX SENTENCES
Çağatay, Sibel (2019-12-01)
Complex sentences are formulated by combining an independent or a main clause with a dependent or a subordinate clause. This sentence type can be extended at the beginning, in the middle or at the end of the sentence, which makes these sentences even harder to recognize on the part of the learners. For this reason, the present study particularly aims at finding out to what extent first year English Language Education (ELT) students and English Language and Literature (ELL) students, as prospective language ...
Resolution of Dropped Pronouns in a Phrase Structure Grammar
Şehitoğlu, Onur Tolga (1997-08-11)
In this work intra-sentential resolution of Turkish dropped pronouns in a phrase structure grammar is studied. Turkish is a pro-drop and free constituent order language. The resolution scheme for dropped pronouns depends on the constituent order. We introduce resolution rules for different surface orders and propose an implementation for an HPSG based parser. Implementation is based on incremental processing of non-local referential index sets during parsing.
A hierarchical representation of form documents for identification and retrieval
Duygulu, Pinar; Atalay, Mehmet Volkan (Springer Science and Business Media LLC, 2003-12-01)
In this paper, we present a logical representation for form documents to be used for identification and retrieval. A hierarchical structure is proposed to represent the structure of a form by using lines and the XY-tree approach. The approach is top-down and no domain knowledge such as the preprinted data or filled-in data is used. Geometrical modifications and slight variations are handled by this representation. Logically identical forms are associated to the same or similar hierarchical structure. Identi...
Citation Formats
D. Küçük, “A knowledge-poor pronoun resolution system for Turkish,” M.S. - Master of Science, Middle East Technical University, 2005.