AVISION Audio and visual attention models applied to 2D and 3D audio-visual content

2011-08-29
Just, N.
Laabs, M.
Unver, E.
Günel Kılıç, Banu
Worrall, S.
Kondoz, A.M.
Within this paper a highly flexible approach for audio and visual attention modeling is presented. The developed system aims to be widely adaptable for different application scenarios within multimedia processing and coding. Possible use cases are presented and their influence on the system concept is shown. Furthermore the development of an attention model within the EU-funded research project DIOMEDES is described. This project focuses on developing a system for hybrid delivery of 3D stereoscopic and multi-view content to the homes through multiple transmission paths. The attention model, which is based on the framework presented, is used to enhance the content encoding process. This publication gives an overview over system design aspects as well as algorithms used for attention modeling.
2011 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)

Suggestions

Occlusion-aware 3-D multiple object tracking for visual surveillance
Topçu, Osman; Alatan, Abdullah Aydın; Ercan, Ali Özer; Department of Electrical and Electronics Engineering (2013)
This thesis work presents an occlusion-aware particle filter framework for online tracking of multiple people with observations from multiple cameras with overlapping fields of view for surveillance applications. Surveillance problem involves inferring motives of people from their actions, deduced from their trajectories. Visual tracking is required to obtain these trajectories and it is a challenging problem due to motion model variations, size and illumination changes and especially occlusions between mov...
Photometric stereo considering highlights and shadows
Büyükatalay, Soner; Halıcı, Uğur; Birgül, Özlem; Department of Electrical and Electronics Engineering (2011)
Three dimensional (3D) shape reconstruction that aims to reconstruct 3D surface of objects using acquired images, is one of the main problems in computer vision. There are many applications of 3D shape reconstruction, from satellite imaging to material sciences, considering a continent on earth or microscopic surface properties of a material. One of these applications is the automated firearm identification that is an old, yet an unsolved problem in forensic science. Firearm evidence matching algorithms rel...
3D Face Reconstruction Using Stereo Images and Structured Light
OZTURK, Ahmet Oguz; Halıcı, Uğur; ULUSOY PARNAS, İLKAY; AKAGUNDUZ, Erdem (2008-04-22)
In this paper, the 3D face scanner that we developed using stereo cameras and structured light together is presented. Structured light having a pattern of vertical lines is used to create feature points and to match them easily. 3D point cloud obtained by stereo analysis is post processed to obtain the 3D model in obj format.
Virtual reality in requirement analysis for CIM system development suitable for SMEs
Erenay, O; Hashemipour, M; Kayaligil, S (2002-10-15)
This paper presents a methodology, based on Virtual Reality (VR), for representing a manufacturing system in order to help with the requirement analysis (RA) in CIM system development, suitable for SMEs. The methodology can reduce the costs and the time involved at this stage by producing precise and accurate plans, specification requirements, and a design for CIM information systems. These are essentials for small and medium scale manufacturing enterprises. Virtual Reality is computer-based and has better ...
Looking through the model’s eye: A systematic review of eye movement modeling example studies
Tunga, Yeliz; Cagiltay, Kursat (2023-01-01)
Eye movement modeling examples (EMME) are novel types of video modeling examples that contain additional eye-movement recordings of the model to provide attentional guidance. Increasing demand in using instructional videos and interest in using eye-tracking in education makes EMME an appealing research subject. Hence, this study aims to systematically review empirical studies employed EMME to synthesize extant literature and reveal literature gaps for further studies. Thirty one peer-reviewed studies contai...
Citation Formats
N. Just, M. Laabs, E. Unver, B. Günel Kılıç, S. Worrall, and A. M. Kondoz, “AVISION Audio and visual attention models applied to 2D and 3D audio-visual content,” presented at the 2011 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB), Nuremberg, Germany, 2011, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/31352.