Semantic classification and retrieval system for environmental sounds

Download
2012
Okuyucu, Çiğdem
The growth of multimedia content in recent years motivated the research on audio classification and content retrieval area. In this thesis, a general environmental audio classification and retrieval approach is proposed in which higher level semantic classes (outdoor, nature, meeting and violence) are obtained from lower level acoustic classes (emergency alarm, car horn, gun-shot, explosion, automobile, motorcycle, helicopter, wind, water, rain, applause, crowd and laughter). In order to classify an audio sample into acoustic classes, MPEG-7 audio features, Mel Frequency Cepstral Coefficients (MFCC) feature and Zero Crossing Rate (ZCR) feature are used with Hidden Markov Model (HMM) and Support Vector Machine (SVM) classifiers. Additionally, a new classification method is proposed using Genetic Algorithm (GA) for classification of semantic classes. Query by Example (QBE) and keyword-based query capabilities are implemented for content retrieval.

Suggestions

Semantik video modeling and retrieval with visual, auditory, textual sources
Durak, Nurcan; Yazıcı, Adnan; Department of Computer Engineering (2004)
The studies on content-based video indexing and retrieval aim at accessing video content from different aspects more efficiently and effectively. Most of the studies have concentrated on the visual component of video content in modeling and retrieving the video content. Beside visual component, much valuable information is also carried in other media components, such as superimposed text, closed captions, audio, and speech that accompany the pictorial component. In this study, semantic content of video is m...
Eye tracking in multimodal comprehension of graphs
Acartürk, Cengiz (2012-07-31)
Eye tracking methodology has been a major empirical research approach for the study of online comprehension processes in reading and scene viewing. The use of eye tracking methodology for the study of diagrammatic representations, however, has been relatively limited so far. The investigation of specific types of diagrammatic representations, such as statistical graphs is even scarce. In this study, we propose eye tracking as an empirical research approach for a systematic analysis of multimodal comprehensi...
Visual quality assessment for stereoscopic video sequences
Sarıkan, Selim Sefa; Akar, Gözde; Department of Electrical and Electronics Engineering (2011)
The aim of this study is to understand the effect of different depth levels on the overall 3D quality and develop an objective video quality metric for stereoscopic video sequences. Proposed method is designed to be used in video coding stages to improve overall 3D video quality. This study includes both objective and subjective evaluation. Test sequences with different coding schemes are used. Computer simulation results show that overall quality has a strong correlation with the quality of the background,...
Mapping Human–Computer Interaction Research Themes and Trends from Its Existence to Today: A Topic Modeling-Based Review of past 60 Years
GÜRCAN, FATİH; Cagiltay, Nergiz Ercil; Çağıltay, Kürşat (Informa UK Limited, 2020-01-01)
As it covers a wide spectrum, the research literature of human-computer interaction (HCI) studies has a rich and multi-disciplinary content where there are limited studies demonstrating the big picture of the field. Such an analysis provides researchers with a better understanding of the field, revealing current issues, challenges, and potential research gaps. This study aims to explore the research trends in the developmental stages of the HCI studies over the past 60 years. Automated text mining with prob...
Accessing Science Through Media: Uses and Gratifications Among Fourth and Fifth Graders for Science Learning
BURAKGAZI, Sevinc Gelmez; Yıldırım, Ali (SAGE Publications, 2014-04-01)
This qualitative phenomenological study aims to investigate fourth and fifth graders' uses of mass media (TV, newspapers, Internet, magazines) and to assess their various features as sources for science learning. The data were collected from 47 purposefully selected students through focus groups and were analyzed through qualitative analysis using uses and gratifications theory as a conceptual framework. The results indicated that students were active in choosing and utilizing media to meet their cognitive,...
Citation Formats
Ç. Okuyucu, “Semantic classification and retrieval system for environmental sounds,” M.S. - Master of Science, Middle East Technical University, 2012.