A Flexible and Scalable Audio Information Retrieval System for Mixed-Type Audio Signals

2011-10-01
Dogan, Ebru
SERT, MUSTAFA
Yazıcı, Adnan
The content-based classification and retrieval of real-world audio clips is one of the challenging tasks in multimedia information retrieval. Although the problem has been well studied in the last two decades, most of the current retrieval systems cannot provide flexible querying of audio clips due to the mixed-type form (e.g., speech over music and speech over environmental sound) of audio information in real world. We present here a complete, scalable, and extensible content-based classification and retrieval system for mixed-type audio clips. The system gives users an opportunity for flexible querying of audio data semantically by providing four alternative ways, namely, querying by mixed-type audio classes, querying by domain-based fuzzy classes, querying by temporal information and temporal relationships, and querying by example (QBE). In order to reduce the retrieval time, a hash-based indexing technique is introduced. Two kinds of experiments were conducted on the audio tracks of the TRECVID news broadcasts to evaluate the performance of the proposed system. The results obtained from our experiments demonstrate that the Audio Spectrum Flatness feature in MPEG-7 standard performs better in music audio samples compared to other kinds of audio samples and the system is robust under different conditions. (C) 2011 Wiley Periodicals, Inc.
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS

Suggestions

A Virtual reality-based training environment designed for hands-on experience of software development
Güleç, Ulaş; İşler, Veysi; Yılmaz, Murat; Department of Computer Engineering (2018)
This thesis study proposes an environment that provides an interactive virtual reality experience for individuals about the tasks of software development process starting from requirement analysis through software testing. The environment transports participants to the virtual world of a software development organization where they experience development problems. In this environment, the participant takes on the role of a novice software developer being recruited into a virtual software development organiz...
A simulation tool for mc6811
Sarıkan (Tuncer), Nazlı; Güran, Hasan; Department of Electrical and Electronics Engineering (2004)
The aim of this thesis study is to develop a simulator for an 8-bit microcontroller and the written document of this thesis study analyses the process of devoloping a software for simulating an 8 bit microcontroller, MC68HC11. In this simulator study a file processing including the parsing of the assembler code and the compilation of the parsed instructions is studied. Also all the instruction execution process containing the cycle and instruction execution and the interrupt routine execution is observed th...
An intelligent fuzzy object-oriented database framework for video database applications
Özgür, Nezihe Burcu; Yazıcı, Adnan; Department of Computer Engineering (2007)
Video database applications call for flexible and powerful modeling and querying facilities, which require an integration or interaction between database and knowledge base technologies. It is also necessary for many real life video database applications to incorporate uncertainty, which naturally occurs due to the complex and subjective semantic content of video data. In this thesis study, firstly, a fuzzy conceptual data model is introduced to represent the semantic content of video data. UML (Unified Mod...
A temporal neural network model for constructing connectionist expert system knowledge bases
Alpaslan, Ferda Nur (Elsevier BV, 1996-04-01)
This paper introduces a temporal feedforward neural network model that can be applied to a number of neural network application areas, including connectionist expert systems. The neural network model has a multi-layer structure, i.e. the number of layers is not limited. Also, the model has the flexibility of defining output nodes in any layer. This is especially important for connectionist expert system applications.
A fuzzy petri net model for intelligent databases
Bostan, Burçin; Yazıcı, Adnan; Department of Computer Engineering (2005)
Knowledge intensive applications require an intelligent environment, which can perform deductions in response to user queries or events that occur inside or outside of the applications. For that, we propose a Fuzzy Petri Net (FPN) model to represent the knowledge and the behavior in an intelligent object-oriented database environment, which integrates fuzzy, active and deductive rules with database objects. By gaining intelligent behaviour, the system maintains objects to perceive dynamic occurences and use...
Citation Formats
E. Dogan, M. SERT, and A. Yazıcı, “A Flexible and Scalable Audio Information Retrieval System for Mixed-Type Audio Signals,” INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, pp. 952–970, 2011, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/43057.