Language modeling for Turkish continuous speech recognition

Download

index.pdf

Date

2003

Author

Şahin, Serkan

Metadata

Show full item record

Item Usage Stats

300
views

0
downloads

This study aims to build a new language model for Turkish continuous speech recognition. Turkish is very productive language in terms of word forms because of its agglutinative nature. For such languages like Turkish, the vocabulary size is far from being acceptable from only one simple stem, thousands of new words can be generated using inflectional and derivational suffixes. In this work, word are parsed into their stem and endings. First of all, we consider endings as words and we obtained bigram probabilities using stem and endings. Then, bigram probabilities are obtained using only the stems. Single pass recognition was performed by using bigram probabilities. As a second job, two pass recognition was performed. Firstly, previous bigram probabilities were used to create word lattices. Secondly, trigram probabilities were obtained from a larger text. Finally, one-best results were obtained by using word lattices and trigram probabilities. All work is done in Hidden Markov Model Toolkit (HTK) environment, except parsing and network transforming.

Subject Keywords

Turkish language, Comparative and general grammar

URI

http://etd.lib.metu.edu.tr/upload/2/1223254/index.pdf
https://hdl.handle.net/11511/13780

Collections

Graduate School of Natural and Applied Sciences, Thesis

Suggestions

OpenMETU
Core

Large vocabulary continuous speech recognition for Turkish Using HTK Çömez, Murat Ali; Çiloğlu, Tolga; Department of Electrical and Electronics Engineering (2003) This study aims to build a new language model that can be used in a Turkish large vocabulary continuous speech recognition system. Turkish is a very productive language in terms of word forms because of its agglutinative nature. For such languages like Turkish, the vocabulary size is far from being acceptable. From only one simple stem, thousands of new word forms can be generated using inflectional or derivational suffixes. In this thesis, words are parsed into their stems and endings. One ending includes ...
Structural priming in Turkish genitive-possessive constructions Bahadır, Gözde; Hohenberger, Annette Edeltraud; Zeyrek Bozşahin, Deniz; Department of Cognitive Sciences (2012) This study addresses the question of the mental representation and processing of language by investigating “structural priming” in Turkish Genitive-Possessive (GEN-POSS) constructions. Structural priming is the facilitating effect of having already experienced a structural form on its subsequent processing. We investigate this phenomenon on a construction pair in Turkish, which shares the same external GEN-POSS morpho-syntactic template despite having distinct grammatical categories. The structures under sc...
Morphological processing of inflected and derived words in L1 Turkish and L2 English Şafak, Duygu Fatma; Kırkıcı, Bilal; Department of English Language Teaching (2015) The present study aims at examining how inflected and derived words are processed during the early stages of visual word recognition in a native language (L1) and in a second language (L2). A second aim of the study is to find out whether or not the semantic and surface-form properties of morphologically complex words affect early word recognition processes. Two masked priming experiments were conducted to investigate morphological processing in L1 Turkish and in L2 English. In the first experiment, 40 L1 s...
Syntactic priming of relative clause attachment in monolingual Turkish speakers and Turkish learners of English Başer, Zeynep; Hohenberger, Annette Edeltraud; Zeyrek Bozşahin, Deniz; Department of Cognitive Sciences (2018) The purpose of this study is to investigate the syntactic priming of relative clause attachment in monolingual Turkish speakers and Turkish learners of English with different levels of proficiency in English. Turkish and English belong to typologically different groups of languages. Within the scope of this study, we investigate syntactic priming of relative clause attachments, which enables us to examine and compare the strategies employed for ambiguity resolution both in Turkish and English. The data was ...
An Experimental study on acquisition of prepositions in English as a third language Çabuk, Sakine; Sağın Şimşek, Sultan Çiğdem; Gracanın Yüksek, Martına; Department of English Language Teaching (2016) This study explores the role of cross-linguistic influence in third language acquisition process by examining English adpositions. Comprehension, processing and production of English prepositions (in, on, at, behind, over, to) were examined through off-line and on-line data collection tasks to find out which of the two known languages (L1 or L2) is the major source of cross-linguistic influence on the acquisition of English (L3) adpositions given the fact that adpositions are morphologically and syntactical...

Citation Formats

S. Şahin, “Language modeling for Turkish continuous speech recognition,” M.S. - Master of Science, Middle East Technical University, 2003.