Data-driven Threshold Selection for Direct Path Dominance Test

Direction-of-arrival estimation methods, when used with recordings made in enclosures are negatively affected by the reflections and reverberation in that enclosure. Direct path dominance (DPD) test was proposed as a pre-processing stage which can provide better DOA estimates by selecting only the time-frequency bins with a single dominant sound source component prior to DOA estimation, thereby reducing the total computational cost. DPD test involves selecting bins for which the ratio of the two largest singular values of the local spatial correlation matrix is above a threshold. The selection of this threshold is typically carried out in an ad hoc manner, which hinders the generalisation of this approach. This selection method also potentially increases the total computational cost or reduces the accuracy of DOA estimation. We propose a DPD test threshold selection method based on a data-driven statistical model. The model is based on the approximation of the singular value ratio distribution of the spatial correlation matrices as a generalised Pareto distribution and allows selecting time-frequency bins based on their probability of occurrence. We demonstrate the application of this threshold selection method via emulations using acoustic impulse responses measured in a highly reverberant room with a rigid spherical microphone array.
23rd International Congress on Acoustics


Subarray selection in octave arrays
Kaderoğlu, Ali Rıza; Çiloğlu, Tolga; Department of Electrical and Electronics Engineering (2022-2-04)
Sensor layout of the octave array is designed to process a frequency range of several octaves. Array aperture of the highest octave band is limited to uniform line array (ULA) segment at the center. Forming a sparse array layout for the highest frequency band by using some of the remaining elements apart from ULA could bring some advantages such as higher array gain and better detection performance. Using a sparse array has a drawback of violating the spatial Nyquist limit, which may cause high sidelobes to...
Multiple Sound Source Localization with Rigid Spherical Microphone Arrays via Residual Energy Test
Coteli, Mert Burkay; Hacıhabiboğlu, Hüseyin (2019-05-01)
The estimation of the directions-of-arrival (DOAs) of multiple sound sources is a fundamental stage in acoustic scene analysis. Many application areas such as robot audition and object-based audio (OBA) broadcast require that DOA estimation is computationally efficient to allow real-time operation. We propose a new DOA estimation approach based on a sparse representation of recorded sound fields as a linear combination of spatially bandlimited impulses in this paper. The proposed algorithm operates on a tim...
Spherical harmonics based acoustic scene analysis for object-based audio
Çöteli, Mert Burkay; Hacıhabiboğlu, Hüseyin; Department of Information Systems (2021-2-19)
Object-based audio relies on elemental audio signals from individual sound sources and their associated metadata to be reconstructed at the listener side. While defining audio objects in a production setting is straightforward, it is not trivial to extract audio objects from more realistic recording scenarios such as concerts. Thus, existing object-based audio standards also define scene-based formats alongside objectbased representations that provide immersive audio, but without the flexibility provided by...
Coteli, Mert Burkay; Hacıhabiboğlu, Hüseyin (2018-09-20)
Acoustic source separation refers to the extraction of individual source signals from microphone array recordings of multiple sources made in multipath environments such as rooms. The most straightforward approach to acoustic source separation involves spatial filtering via beamforming. While beamforming works well for a few sources and under low reverberation, its performance diminishes for a high number of sources and/or high reverberation. An informed acoustic source separation method based on the applic...
Panoramic recording and reproduction of multichannel audio using a circular microphone array
Hacıhabiboğlu, Hüseyin (2009-10-18)
Multichannel audio reproduction generally suffers from one or both of the following problems: i) the recorded audio has to be artificially manipulated to provide the necessary spatial cues, which reduces the consistency of the reproduced sound field with the actual one, and ii) reproduction is not panoramic, which degrades realism when the listener is not seated in a desired ideal position facing the center channel. A recording method using a circularly symmetric array of differential microphones, and a rep...
Citation Formats
O. Olgun and H. Hacıhabiboğlu, “Data-driven Threshold Selection for Direct Path Dominance Test,” presented at the 23rd International Congress on Acoustics, Aachen, Germany, 2019, Accessed: 00, 2020. [Online]. Available: