A robust and time-efficient fingerprinting model for musical audio

Date

2006-07-01

Author

SERT, MUSTAFA
Baykal, Buyurman
Yazıcı, Adnan

Metadata

Show full item record

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Item Usage Stats

116
views

0
downloads

An audio fingerprinting system deals with four challenging tasks: The robustness, the reliability, the compactness, and the scalability. By preserving the others, we explore the compactness and robustness aspects of audio fingerprinting systems and propose a description and storage model based on structural analysis of audio clips. The proposed method constructs the fingerprints from the most representative section of an audio clip. Contrary to similar studies, there is no need to construct and store all the fingerprints of each frame within the database; only one fingerprint per clip is sufficient. We make use of the Audio Spectrum Flatness (ASF) and the Audio Signature (AS) features of the MPEG-7 standard, which are new to the audio feature family and have not been considered as much as other feature types. The fingerprints are stored in the form of XML, thus providing the interoperability on a world-wide scale. XML-based representation of fingerprints is very suitable particularly for portable devices such as a PDA or a mobile phone due to the transportation issues. The proposed approach is evaluated on a test bed consisting of 540 musical clips based on the MPEG-7 features. The well known MFCC feature set is also considered in the experiments for the evaluation of features(1).

Subject Keywords

Audio excerpt analysis, Audio fingerprint, Audio spectrum flatness, Audio signature

URI

https://hdl.handle.net/11511/55836

Conference Name

10th IEEE International Symposium on Consumer Electronics (ISCE 2006)

Collections

Department of Electrical and Electronics Engineering, Conference / Seminar

Suggestions

OpenMETU
Core

A cross-layer protocol for wireless sensor networks Akyildiz, Ian F.; Vuran, Mehmet C.; Akan, Ozgur B. (2006-03-24) Severe energy constraints of battery-powered sensor nodes necessitate energy-efficient communication protocols in order to fulfill application objectives of wireless sensor networks (WSN). However, the vast majority of the existing solutions are based on classical layered protocols approach. It is much more resource-efficient to have a unified scheme which melts common protocol layer functionalities into a cross-layer module for resource-constrained sensor nodes. To the best of our knowledge, to date, there...
A Digital signal processor based development platform for thermal imagers Özkan, Mustafa; Beşikci, Cengiz; Tuncer, Temel Engin; Department of Electrical and Electronics Engineering (2003) The performance of thermal imaging systems depends on the sophistication level of the signal processing algorithms and the careful design of the required electronics. Digital Signal Processor (DSP) boards are very beneficial in designing such systems, and applying high level signal processing algorithms. The aim of this study is to design and implement a development platform for thermal imagers using a double floating point processor DSP board.A front-end electronics board has been employed as the interface...
AN AUTOMATIC MODE MATCHING SYSTEM FOR A HIGH Q-FACTOR MEMS GYROSCOPE USING A DECOUPLED PERTURBATION SIGNAL Yesil, F.; Alper, S. E.; Akın, Tayfun (2015-06-25) This paper reports a closed-loop controller system developed for in-run automatic matching of the drive and sense mode resonance frequencies of a MEMS gyroscope with a high quality factor (Q). This is achieved by injecting a perturbation signal to the quadrature cancellation loop, while keeping it decoupled from the angular rate control loop. The new controller is implemented in a CMOS ASIC together with the other sensor control loops, and it is verified to maintain matched-mode state under changing environ...
A Flexible and Scalable Audio Information Retrieval System for Mixed-Type Audio Signals Dogan, Ebru; SERT, MUSTAFA; Yazıcı, Adnan (Wiley, 2011-10-01) The content-based classification and retrieval of real-world audio clips is one of the challenging tasks in multimedia information retrieval. Although the problem has been well studied in the last two decades, most of the current retrieval systems cannot provide flexible querying of audio clips due to the mixed-type form (e.g., speech over music and speech over environmental sound) of audio information in real world. We present here a complete, scalable, and extensible content-based classification and retri...
A Reduced complexity ungerboeck type receiver for multi-code signaling in dispersive channels Güvensen, Gökhan Muzaffer; Tanık, Yalçın; Yılmaz, Ali Özgür; Department of Electrical and Electronics Engineering (2014) The main aim in this thesis is to propose multiple signaling waveforms (multi-code) based yet spectrally efficient modulation schemes and competent receiver architectures realizing soft-input-soft-output (SISO) detection. We search for generic suboptimal receiver architectures for Multi-Code Signaling (MCS), which can be represented as selection of one out of M waveforms per signaling interval. The proposed receiver architectures exhibit almost optimal performance at significantly reduced complexity in high...

Citation Formats

M. SERT, B. Baykal, and A. Yazıcı, “A robust and time-efficient fingerprinting model for musical audio,” presented at the 10th IEEE International Symposium on Consumer Electronics (ISCE 2006), St Petersburg, RUSSIA, 2006, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/55836.