Musical instrument recognition with wavelet envelopes

Date

2002-09-16

Author

Hacıhabiboğlu, Hüseyin

Metadata

Show full item record

Item Usage Stats

165
views

0
downloads

Automatic recognition of instrument type from raw audio data containing monophonic music is a fundamental problem for audio content analysis. There are many methods for the solution of this problem, which use common spectro-temporal properties like cepstral coefficients or spectral envelopes. A new method for instrument recognition utilising short-time amplitude envelopes of wavelet coefficients as feature vectors is presented. The classification engine is a distinctively small multilayer perceptron (MLP) network. A correct classification rate which is comparable to previously reported correct classification rates is attained for a set of three instruments containing flute, clarinet and trumpet.

URI

https://hdl.handle.net/11511/70965
http://www.sea-acustica.es/fileadmin/publicaciones/Sevilla02_mus01007.pdf

Conference Name

Forum Acusticum (16 - 20 Eylül 2002)

Collections

Graduate School of Informatics, Conference / Seminar

Suggestions

OpenMETU
Core

Instrument based wavelet packet decomposition for audio feature extraction Hacıhabiboğlu, Hüseyin (null; 2001-09-10) Feature extraction from audio data is a major concern in computer assisted music applications and content based audio retrieval. For general non-stationary signals, wavelet packet decomposition is used with entropy functions for best basis search. Musical instruments have well defined frequency ranges. Thus when audio data containing a solo instrument is concerned, wavelet packet decomposition may be adapted to that instrument's individual characteristics. The method discussed in this paper uses a number of...
Spherical harmonics based acoustic scene analysis for object-based audio Çöteli, Mert Burkay; Hacıhabiboğlu, Hüseyin; Department of Information Systems (2021-2-19) Object-based audio relies on elemental audio signals from individual sound sources and their associated metadata to be reconstructed at the listener side. While defining audio objects in a production setting is straightforward, it is not trivial to extract audio objects from more realistic recording scenarios such as concerts. Thus, existing object-based audio standards also define scene-based formats alongside objectbased representations that provide immersive audio, but without the flexibility provided by...
Blind source separation and directional audio synthesis for binaural auralization of multiple sound sources using microphone array recordings Günel Kılıç, Banu; Hacıhabiboğlu, Hüseyin (2008-01-01) Microphone array signal processing techniques are extensively used for sound source localisation, acoustical characterisation and sound source separation, which are related to audio analysis. However, the use of microphone arrays for auralisation, which is generally related to synthesis, has been limited so far. This paper proposes a method for binaural auralisation of multiple sound sources based on blind source separation (BSS) and binaural audio synthesis. A BSS algorithm is introduced that exploits the ...
Perceptual Soundfield Reconstruction in Three Dimensions via Sound Field Extrapolation Erdem, Ege; De Sena, Enzo; Hacıhabiboğlu, Hüseyin; Cvetkovic, Zoran (2019-05-01) Perceptual sound field reconstruction (PSR) is a spatial audio recording and reproduction method based on the application of stereophonic panning laws in microphone array design. PSR allows rendering a perceptually veridical and stable auditory perspective in the horizontal plane of the listener, and involves recording using nearcoincident microphone arrays. This paper extends the PSR concept to three dimensions using sound field extrapolation carried out in the spherical-harmonic domain. Sound field render...
3D perceptual soundfield reconstruction via sound field extrapolation Erdem, Eg; Hacıhabiboğlui Hüseyin.; Department of Multimedia Informatics (2020) Perceptual sound field reconstruction (PSR) is a spatial audio recording and reproduction method based on the application of stereophonic panning laws in microphone array design. PSR allows rendering a perceptually veridical and stable auditory perspective in the horizontal plane of the listener, and involves recording using nearcoincident microphone arrays. This thesis extends the two dimensional PSR concept to three dimensions and allows reconstructing an arbitrary sound field based on measurements with a...

Citation Formats

H. Hacıhabiboğlu, “Musical instrument recognition with wavelet envelopes,” presented at the Forum Acusticum (16 - 20 Eylül 2002), Sevilla, İspanya, 2002, Accessed: 00, 2021. [Online]. Available: https://hdl.handle.net/11511/70965.