Multiple Sound Source Localization with Rigid Spherical Microphone Arrays via Residual Energy Test

Coteli, Mert Burkay
Hacıhabiboğlu, Hüseyin
The estimation of the directions-of-arrival (DOAs) of multiple sound sources is a fundamental stage in acoustic scene analysis. Many application areas such as robot audition and object-based audio (OBA) broadcast require that DOA estimation is computationally efficient to allow real-time operation. We propose a new DOA estimation approach based on a sparse representation of recorded sound fields as a linear combination of spatially bandlimited impulses in this paper. The proposed algorithm operates on a time-frequency representation of the spherical harmonic components of the sound field. We describe a residual energy test that can identify time-frequency bins with a single active source. DOA estimation is carried out at each time-frequency bin by seeking a single-source dictionary atom which provides the best match to the steered response function calculated at the selected bins. We demonstrate the accuracy of the proposed method via a set of emulations using acoustic impulse responses measured in a highly reverberant room.


Multiple Sound Source Localization With Steered Response Power Density and Hierarchical Grid Refinement
COTELI, Mert Burkay; OLGUN, Orhun; Hacıhabiboğlu, Hüseyin (2018-11-01)
Estimation of the direction-of-arrival (DOA) of sound sources is an important step in sound field analysis. Rigid spherical microphone arrays allow the calculation of a compact spherical harmonic representation of the sound field. The standard method for analyzing sound fields recorded using such arrays is steered response power (SRP) maps wherein the source DOA can be estimated as the steering direction that maximizes the output power of a maximally directive beam. This approach is computationally costly s...
Olgun, Orhun; Hacıhabiboğlu, Hüseyin (2018-09-20)
Direction-of-arrival (DOA) estimation is an important step in acoustic scene analysis. Multiple signal classification in the eigenbeam domain (EB-MUSIC) is an accurate direction-of-arrival (DOA) estimation method for rigid spherical microphone arrays. Two important issues with this method are 1) the requirement of prior information about the number of coherent source components, and 2) its computational cost. In this paper, a computationally efficient two-stage method, which can alleviate these problems, is...
Coteli, Mert Burkay; Hacıhabiboğlu, Hüseyin (2018-09-20)
Acoustic source separation refers to the extraction of individual source signals from microphone array recordings of multiple sources made in multipath environments such as rooms. The most straightforward approach to acoustic source separation involves spatial filtering via beamforming. While beamforming works well for a few sources and under low reverberation, its performance diminishes for a high number of sources and/or high reverberation. An informed acoustic source separation method based on the applic...
Data-driven Threshold Selection for Direct Path Dominance Test
Olgun, Orhun; Hacıhabiboğlu, Hüseyin (2019-09-09)
Direction-of-arrival estimation methods, when used with recordings made in enclosures are negatively affected by the reflections and reverberation in that enclosure. Direct path dominance (DPD) test was proposed as a pre-processing stage which can provide better DOA estimates by selecting only the time-frequency bins with a single dominant sound source component prior to DOA estimation, thereby reducing the total computational cost. DPD test involves selecting bins for which the ratio of the two largest sin...
Spherical harmonics based acoustic scene analysis for object-based audio
Çöteli, Mert Burkay; Hacıhabiboğlu, Hüseyin; Department of Information Systems (2021-2-19)
Object-based audio relies on elemental audio signals from individual sound sources and their associated metadata to be reconstructed at the listener side. While defining audio objects in a production setting is straightforward, it is not trivial to extract audio objects from more realistic recording scenarios such as concerts. Thus, existing object-based audio standards also define scene-based formats alongside objectbased representations that provide immersive audio, but without the flexibility provided by...
Citation Formats
M. B. Coteli and H. Hacıhabiboğlu, “Multiple Sound Source Localization with Rigid Spherical Microphone Arrays via Residual Energy Test,” 2019, Accessed: 00, 2020. [Online]. Available: