A Turkish Database For Psycholonguistic Studies

2016-11-01
Acar, Elif Ahşen
Zeyrek Bozşahin, Deniz
Kurfalı, Murathan
Bozşahin, Hüseyin Cem
This study primarily aims to build a Turkish psycholinguistic database including three variables: word frequency, age of acquisition (AoA), and imageability, where AoA and imageability information are limited to nouns. We used a corpus-based approach to obtain information about the AoA variable. We built two corpora: a child literature corpus (CLC) including 535 books written for 3-12 years old children, and a corpus of transcribed children’s speech (CSC) at ages 1;4-4;8. A comparison between the word frequencies of CLC and CSC gave positive correlation results, suggesting the usability of the CLC to extract AoA information. We assumed that frequent words of the CLC would correspond to early acquired words whereas frequent words of a corpus of adult language would correspond to late acquired words. To validate AoA results from our corpus-based approach, a rated AoA questionnaire was conducted on adults. Imageability values were collected via a different questionnaire conducted on adults. We conclude that it is possible to deduce AoA information for high frequency words with the corpus-based approach. The results about low frequency words were inconclusive, which is attributed to the fact that corpus-based AoA information is affected by the strong negative correlation between corpus frequency and rated AoA.
Language Resources and Evaluation (23 Mayıs 2016)

Suggestions

A Turkish database for psycholinguistic studies based on frequency age of acquisition and imageability
Acar, Elif Ahsen; Zeyrek Bozşahin, Deniz; Kurfalı, Murathan; Bozşahin, Hüseyin Cem (2016-05-13)
This study primarily aims to build a Turkish psycholinguistic database including three variables: word frequency, age of acquisition (AoA), and imageability, where AoA and imageability information are limited to nouns. We used a corpus-based approach to obtain information about the AoA variable. We built two corpora: a child literature corpus (CLC) including 535 books written for 3-12 years old children, and a corpus of transcribed children’s speech (CSC) at ages 1;4-4;8. A comparison between the word frequ...
Adapting and testing psycholinguistic toolboxes for Turkish visual word recognition studies
Erten, Begüm; Bozşahin, Hüseyin Cem; Zeyrek Bozşahin, Deniz; Department of Cognitive Sciences (2013)
This study presents two different software programs to be used in Turkish visual word recognition studies: KelimetriK and Wuggy with a Turkish plug-in extension. KelimetriK is a query-based software program developed as part of this thesis. KelimetriK provides word and bi-gram/tri-gram frequencies, orthographic neighborhood (ON), orthographic relatedness (transposed letter similarity and subset/superset similarity) and OLD20 (orthographic Levensthein Distance 20) scores. Wuggy is a pseudoword (i.e. wordlike...
A note on the contact between Kurmanji Kurdish and Turkish at lexical and morphological level
Çabuk Ballı, Sakine (SAGE Publications, 2019-08-01)
Turkish-Kurdish social setting where the Turkish and Kurdish languages are in contact for a long time induces borrowing and change at different levels.This study explores the contact between Kurmanji Kurdish and Turkish that take place at both morphological and lexical level. The data consist of three hours of recordings of family talks on the phone. Corpus analysis of data obtained from audio and video recordings of a family talk on the phone was done. Preliminary findings revealed that verbs are borrowed ...
Measuring age of information on real-life connections
Beytur, Hasan Burhan; Baghaee, Sajjad; Uysal, Elif (2019-04-01)
Age of Information (AoI) is a relatively new metric to measure freshness of networked application such as real-time monitoring of status updates or control. The AoI metric is discussed in the literature mainly in a theoretical way. In this work, we want to point out the issues related to the measuring AoI-related values, such as synchronization and calculation of the values. We discussed the effect of synchronization error in the measurement and a solution for calculating an estimate of average AoI without ...
A comparative evaluation of XML repositories
Ünal, Özgül; Doğaç, Asuman; Department of Information Systems (2002)
Recently XML has established itself as the standard for representing data in scientific and business applications. Starting out as a standard data exchange format over the web, it has become instrumental in all kinds of applications. Almost all standardization efforts on the web today are based on XML. As a consequence, the amount of XML data being stored and processed is large and will be increasing at a very rapid rate. This has caused XML data management to become a focus of research efforts in the datab...
Citation Formats
E. A. Acar, D. Zeyrek Bozşahin, M. Kurfalı, and H. C. Bozşahin, “A Turkish Database For Psycholonguistic Studies,” Portoroz, Slovenya, 2016, vol. 10, p. 3600, Accessed: 00, 2021. [Online]. Available: https://hdl.handle.net/11511/84842.