Turkish text compression using Huffman coding

Download
1996
Çiftçisoy, Yasemin

Suggestions

Turkish speech corpora and recognition tools developed by porting SONIC: Towards multilingual speech recognition
Salor, Ozgul; Pellom, Bryan L.; Çiloğlu, Tolga; Demirekler, Mubeccel (2007-10-01)
This paper presents work on developing speech corpora and recognition tools for Turkish by porting SONIC, a speech recognition tool developed initially for English at the Center for Spoken Language Research of the University of Colorado at Boulder. The work presented in this paper had two objectives: The first one is to collect a standard phonetically-balanced Turkish microphone speech corpus for general research use. A 193-speaker triphone-balanced audio corpus and a pronunciation lexicon for Turkish have ...
Turkish clickbait detection in social media via machine learning algorithms
Genç, Şura; Sürer, Elif; Çakır, Murat Perit; Department of Cognitive Sciences (2021-8-26)
Clickbait strategy, mostly used in headlines and teaser messages, aims to attract people’s attention, and make them click on the link by using intriguing expressions with various text-related features. Clickbait, which has become very common especially in social media in recent years, is a major problem for the flow of information. Since the information promised in the clickbait headline is generally not included in the main text, clickbait headlines disappoint readers and is problematic for ethics of journ...
Turkish Discourse Bank: Porting a discourse annotation style to a morphologically rich language
Zeyrek Bozşahin, Deniz; Sevdik Çallı, Ayışığı B.; Çakıcı, Ruket (2013-05-01)
This paper briefly describes the Turkish Discourse Bank, the first publicly available annotated discourse resource for Turkish. It focuses on the challenges posed by annotating Turkish, a free word order language with rich inflectional and derivational morphology. It shows the usefulness of the PDTB style annotation but points out the need to expand this annotation style with the needs of the target language.
Turkish large vocabulary continuous speech recognition by using limited audio corpus
Susman, Derya; Yazıcı, Adnan; Köprü, Selçuk; Department of Computer Engineering (2012)
Speech recognition in Turkish Language is a challenging problem in several perspectives. Most of the challenges are related to the morphological structure of the language. Since Turkish is an agglutinative language, it is possible to generate many words from a single stem by using suffixes. This characteristic of the language increases the out-of-vocabulary (OOV) words, which degrade the performance of a speech recognizer dramatically. Also, Turkish language allows words to be ordered in a free manner, whic...
Turkish consumers' perceptions of environmental claims
Tarhan, Ayşe Buyçe; Çağlı, Uğur; Department of Business Administration (1996)
Citation Formats
Y. Çiftçisoy, “Turkish text compression using Huffman coding,” Middle East Technical University, 1996.