A knowledge-poor pronoun resolution system for Turkish

Download
2005
Küçük, Dilek
This thesis presents a knowledge-poor pronoun resolution system for Turkish which resolves third person personal pronouns and possessive pronouns. The system is knowledge-poor in the sense that it makes use of limited linguistic and semantic knowledge to resolve the pronouns. As pronoun resolution proposals for languages like English, French and Spanish, the core of the system is the constraints and preferences which are determined empirically. The system has four modules: sentence splitting, pronoun extraction, forming the list of candidate antecedents and determination of the antecedent. It takes a Turkish text as input and rewrites this text with the considered pronouns replaced with their proposed antecedents. In order to compare the success rate of the system, two different baseline algorithms are implemented. The original system is tested against these baseline algorithms on two sample Turkish texts from different sources. Some suggestions to improve the success rate of the system and to extend the domain of the system are also presented.

Suggestions

Coupling speech recognition and rule-based machine translation
Köprü, Selçuk; Yazıcı, Adnan; Department of Computer Engineering (2008)
The objective of this thesis was to study the coupling of automatic speech recognition (ASR) systems with rule-based machine translation (MT) systems. In this thesis, a unique approach to integrating ASR with MT for speech translation (ST) tasks was proposed. The proposed approach is unique, essentially because it includes the rst rule-based MT system that can process speech data in a word graph format. Compared to other rule-based MT systems, our system processes both a word graph and a stream of words. Th...
A clustering method for the problem of protein subcellular localization
Bezek, Perit; Atalay, Mehmet Volkan; Department of Computer Engineering (2006)
In this study, the focus is on predicting the subcellular localization of a protein, since subcellular localization is helpful in understanding a protein’s functions. Function of a protein may be estimated from its sequence. Motifs or conserved subsequences are strong indicators of function. In a given sample set of protein sequences known to perform the same function, a certain subsequence or group of subsequences should be common; that is, occurrence (frequency) of common subsequences should be high. Our ...
Resolution of Dropped Pronouns in a Phrase Structure Grammar
Şehitoğlu, Onur Tolga (1997-08-11)
In this work intra-sentential resolution of Turkish dropped pronouns in a phrase structure grammar is studied. Turkish is a pro-drop and free constituent order language. The resolution scheme for dropped pronouns depends on the constituent order. We introduce resolution rules for different surface orders and propose an implementation for an HPSG based parser. Implementation is based on incremental processing of non-local referential index sets during parsing.
A hierarchical representation of form documents for identification and retrieval
Duygulu, Pinar; Atalay, Mehmet Volkan (Springer Science and Business Media LLC, 2003-12-01)
In this paper, we present a logical representation for form documents to be used for identification and retrieval. A hierarchical structure is proposed to represent the structure of a form by using lines and the XY-tree approach. The approach is top-down and no domain knowledge such as the preprinted data or filled-in data is used. Geometrical modifications and slight variations are handled by this representation. Logically identical forms are associated to the same or similar hierarchical structure. Identi...
A proposed model for Turkish land forces command software intensive systems acquisition process
Bilir, Nevzat; Bilgen, Semih; Department of Information Systems (2002)
Turkish Land Forces Command uses information technology opportunities to satisfy its needs arising from the obligation for being a strong army.One of the main difficulties in using information technologies is the acquisition of these systems. This thesis reviews the fundamentals of software intensive systems acquisition process and then proposes a new model for Turkish Land Forces Command.Software acquisition process models are discussed and compared with each other.The current Turkish Land Forces Com...
Citation Formats
D. Küçük, “A knowledge-poor pronoun resolution system for Turkish,” M.S. - Master of Science, Middle East Technical University, 2005.