Multiple Sound Source Localization With Steered Response Power Density and Hierarchical Grid Refinement

Download

index.pdf

Date

2018-11-01

Author

COTELI, Mert Burkay
OLGUN, Orhun
Hacıhabiboğlu, Hüseyin

Metadata

Show full item record

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Item Usage Stats

218
views

0
downloads

Estimation of the direction-of-arrival (DOA) of sound sources is an important step in sound field analysis. Rigid spherical microphone arrays allow the calculation of a compact spherical harmonic representation of the sound field. The standard method for analyzing sound fields recorded using such arrays is steered response power (SRP) maps wherein the source DOA can be estimated as the steering direction that maximizes the output power of a maximally directive beam. This approach is computationally costly since it requires steering the beam in all possible directions. This paper presents an extension to SRP called steered response power density (SRPD) and an associated, signal-adaptive search method called hierarchical grid refinement for reducing the number of steering directions needed for DOA estimation. The proposed method can localize near-coherent as well as incoherent sources while jointly providing the number of prominent sources in the scene. It is shown to be robust to reverberation and additive white noise. An evaluation of the proposed method using simulations and real recordings under highly reverberant conditions as well as a comparison with the state-of-the-art methods are presented.

Subject Keywords

Source localization, Rigid spherical microphone arrays, Steered response power maps, Direction-of-arrival estimation

URI

https://hdl.handle.net/11511/30025

Journal

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

DOI

https://doi.org/10.1109/taslp.2018.2858932

Collections

Graduate School of Informatics, Article

Suggestions

OpenMETU
Core

Multiple Sound Source Localization with Rigid Spherical Microphone Arrays via Residual Energy Test Coteli, Mert Burkay; Hacıhabiboğlu, Hüseyin (2019-05-01) The estimation of the directions-of-arrival (DOAs) of multiple sound sources is a fundamental stage in acoustic scene analysis. Many application areas such as robot audition and object-based audio (OBA) broadcast require that DOA estimation is computationally efficient to allow real-time operation. We propose a new DOA estimation approach based on a sparse representation of recorded sound fields as a linear combination of spatially bandlimited impulses in this paper. The proposed algorithm operates on a tim...
Entropy-based direction-of-arrival estimation methods for rigid spherical microphone arrays Olgun, Orhun; Hacıhabiboğlu, Hüseyin; Department of Modeling and Simulation (2019) Direction-of-arrival (DOA) estimation of sound sources is a popular topic of research and has an important role in several different applications including spatial audio. Recent advances in microphone arrays made more accurate sound field analysis possible. Spherical microphone arrays afford a trivial calculation of spherical harmonic decomposition of sound fields and can be employed in different DOA estimation methods in spherical harmonics domain. This thesis proposes a novel DOA estimation method called ...
Spherical harmonics based acoustic scene analysis for object-based audio Çöteli, Mert Burkay; Hacıhabiboğlu, Hüseyin; Department of Information Systems (2021-2-19) Object-based audio relies on elemental audio signals from individual sound sources and their associated metadata to be reconstructed at the listener side. While defining audio objects in a production setting is straightforward, it is not trivial to extract audio objects from more realistic recording scenarios such as concerts. Thus, existing object-based audio standards also define scene-based formats alongside objectbased representations that provide immersive audio, but without the flexibility provided by...
LOCALIZATION OF MULTIPLE SOURCES IN THE SPHERICAL HARMONIC DOMAIN WITH HIERARCHICAL GRID REFINEMENT AND EB-MUSIC Olgun, Orhun; Hacıhabiboğlu, Hüseyin (2018-09-20) Direction-of-arrival (DOA) estimation is an important step in acoustic scene analysis. Multiple signal classification in the eigenbeam domain (EB-MUSIC) is an accurate direction-of-arrival (DOA) estimation method for rigid spherical microphone arrays. Two important issues with this method are 1) the requirement of prior information about the number of coherent source components, and 2) its computational cost. In this paper, a computationally efficient two-stage method, which can alleviate these problems, is...
Blind source separation and directional audio synthesis for binaural auralization of multiple sound sources using microphone array recordings Günel Kılıç, Banu; Hacıhabiboğlu, Hüseyin (2008-01-01) Microphone array signal processing techniques are extensively used for sound source localisation, acoustical characterisation and sound source separation, which are related to audio analysis. However, the use of microphone arrays for auralisation, which is generally related to synthesis, has been limited so far. This paper proposes a method for binaural auralisation of multiple sound sources based on blind source separation (BSS) and binaural audio synthesis. A BSS algorithm is introduced that exploits the ...

Citation Formats

M. B. COTELI, O. OLGUN, and H. Hacıhabiboğlu, “Multiple Sound Source Localization With Steered Response Power Density and Hierarchical Grid Refinement,” IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, pp. 2215–2229, 2018, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/30025.