A gaze-centered multimodal approach to face-to-face interaction

Download

index.pdf

Date

2020

Author

Arslan Aydın, Ülkü

Metadata

Show full item record

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Item Usage Stats

276
views

155
downloads

Face-to-face conversation implies that interaction should be characterized as an inherently multimodal phenomenon involving both verbal and nonverbal signals. Gaze is a nonverbal cue that plays a key role in achieving social goals during the course of conversation. The purpose of this study is twofold: (i) to examine gaze behavior (i.e., aversion and gaze on face) and relations between gaze and speech in face to face interaction, (ii) to construct computational models to predict gaze behavior using high-level speech features. We employed a job interview setting, where pairs (a professional interviewer and an interviewee) conducted mock job interviews. Twenty-eight pairs of native speakers took part in the experiment. Two eye-tracking glasses recorded the scene video, the audio and the eye gaze position of the participants. To achieve the first purpose, we developed an open-source framework, named MAGiC (A Multimodal Framework for Analyzing Gaze in Communication), for the analyses of multimodal data including video recording data for face tracking, gaze data from the eye trackers, and the audio data for speech segmentation. We annotated speech with two methods: (i) ISO 24617-2 Standard for Dialogue Act Annotation and, (ii) using tags employed by the previous studies that examined gaze behavior in a social context. We then trained simplified versions of two CNN architectures (VGGNet and ResNet) by using both speech annotation methods.

Subject Keywords

Interpersonal communication., Mobile Eye Tracking, face-to-face interaction, gaze analysis, ISO 24617-2 standard, CNN for time series

URI

http://etd.lib.metu.edu.tr/upload/12625103/index.pdf
https://hdl.handle.net/11511/45474

Collections

Graduate School of Informatics, Thesis

Suggestions

OpenMETU
Core

A gaze-centered multimodal approach to human-human social interaction Acartürk, Cengiz; Kalkan, Sinan (2017-06-23) This study aims at investigating gaze aversion behavior in human-human dyads during the course of a conversation. Our goal is to identify the parametric infrastructure, which will underlie the development of gaze behavior in Human Robot Interaction. We employed a job interview setting, where pairs (an interviewer and an interviewee) conducted mock job interviews. Three pairs of native speakers took part in the experiment. Two eye-tracking glasses recorded the scene video, the audio and the eye gaze position...
An Eye Tracking Analysis of Conversational Violations in Dyadic and Collaborative Interaction Cagiltay, Bengisu; Acartürk, Cengiz (2022-01-01) Linguistic principles are crucial in maintaining reliable and transparent communication for dyadic interactions. However, violating these principles might result in unwieldy and problematic communications. We use gaze as a medium to explore how visual attention and task performance changes when conversational violations occur. We conducted an eye-tracking study (N = 17) measuring changes in visual patterns in response to social communication errors, specifically Grice's Maxims violations. Our study investig...
The Processing of ambiguous morphemes in Turkish Ataman, Esra.; Kırkıcı, Bilal; Department of English Literature (2019) Studies investigating the processing of linguistic ambiguity have to date mostly focused on lexical ambiguity. Morphemic ambiguity, on the other hand, has been less frequently studied in spite of its cross-linguistic prevalence. An intermediate level of representation (i.e. the lemma level) between form and meaning has been claimed to successfully account for the processing of ambiguous morphemes in English and Chinese. Moreover, meaning frequency has been found to affect the processing of these morphemes. ...
The Role of symmetry and facial expressions of emotions in evaluation of attractiveness and perceived symmetry : an eye tracking study Hepsomalı, Pırıl; Gökçay, Didem; Department of Cognitive Sciences (2013) In social interaction, faces convey plenty of information such as gender, age, attractiveness and expressions of emotions. Amongst these cues, attractiveness and facial expressions of emotions are considered more substantial, since processing and evaluation of such information rapidly has adaptive relevance in order to avoid or approach. One of the indicators of attractiveness, symmetry, is preferred by many species and it is known that symmetrical faces are rated as more attractive by humans. Moreover, fac...
The Effect of intranasal oxytocin on pupil dilation during trustworthiness evaluation and facial expression recognition tasks Saraçaydın, Fatma Gülhan; Gökçay, Didem; Department of Cognitive Sciences (2015) Our ability to recognize facial expressions and emotions can be modulated by both external and internal factors. One of these internal factors is the neuropeptide “oxytocin”. Many studies have highlighted the involvement of oxytocin in recognition of facial expressions and approach-related trusting behaviors. In the current study, we investigated the effects of oxytocin on recognition accuracy and trustworthiness judgements using facial expressions. We used a subset of expressions and images from the Karoli...

Citation Formats

Ü. Arslan Aydın, “A gaze-centered multimodal approach to face-to-face interaction,” Thesis (Ph.D.) -- Graduate School of Informatics. Cognitive Sciences., Middle East Technical University, 2020.