Entropy-based direction-of-arrival estimation methods for rigid spherical microphone arrays

Download
2019
Olgun, Orhun
Direction-of-arrival (DOA) estimation of sound sources is a popular topic of research and has an important role in several different applications including spatial audio. Recent advances in microphone arrays made more accurate sound field analysis possible. Spherical microphone arrays afford a trivial calculation of spherical harmonic decomposition of sound fields and can be employed in different DOA estimation methods in spherical harmonics domain. This thesis proposes a novel DOA estimation method called Hierarchical Grid Refinement (HiGRID) for rigid spherical microphone arrays (RSMA). This method is based on the calculation of the sector averaged directional response power of a steered beam over a sparse set of directions on the unit sphere. The selection of the direction for which response power is to be calculated is determined using spatial entropy as a criterion. A new clustering method based on connected components labelling is also proposed for counting sources and estimating their DOAs. In addition to HiGRID, this work investigates several state-of-the-art DOA estimation techniques. These include the improvement of DOA estimation performance or computational efficiency of Eigenbeam Multiple Signal Classification (EB-MUSIC) and Direct Path Dominance (DPD) test. HiGRID is first used as source counting method prior to EB-MUSIC to decrease the computational cost of DOA estimation. HiGRID is then used as a DOA estimation method following the DPD test which increases the DOA estimation accuracy while reducing the total computational cost. A new data-driven statistical method for DPD test threshold selection is also proposed. This allows the an informed selection of DPD test threshold based on effective rank statistics of spatial correlation matrices obtained from RSMAs. Comparison of HiGRID with previous DOA estimation methods with real and simulated recordings are presented. Evaluations of proposed algorithms for EB*MUSIC and DPD test are also presented in terms of DOA estimation errors using simulated recordings. HiGRID and its combinations with EB-MUSIC and DPD test performed favourably in comparison with other state-of-the-art DOA estimation methods indicating the utility of the proposed methods in DOA estimation.

Suggestions

Multiple Sound Source Localization With Steered Response Power Density and Hierarchical Grid Refinement
COTELI, Mert Burkay; OLGUN, Orhun; Hacıhabiboğlu, Hüseyin (2018-11-01)
Estimation of the direction-of-arrival (DOA) of sound sources is an important step in sound field analysis. Rigid spherical microphone arrays allow the calculation of a compact spherical harmonic representation of the sound field. The standard method for analyzing sound fields recorded using such arrays is steered response power (SRP) maps wherein the source DOA can be estimated as the steering direction that maximizes the output power of a maximally directive beam. This approach is computationally costly s...
Spherical harmonics based acoustic scene analysis for object-based audio
Çöteli, Mert Burkay; Hacıhabiboğlu, Hüseyin; Department of Information Systems (2021-2-19)
Object-based audio relies on elemental audio signals from individual sound sources and their associated metadata to be reconstructed at the listener side. While defining audio objects in a production setting is straightforward, it is not trivial to extract audio objects from more realistic recording scenarios such as concerts. Thus, existing object-based audio standards also define scene-based formats alongside objectbased representations that provide immersive audio, but without the flexibility provided by...
ACOUSTIC SOURCE SEPARATION USING RIGID SPHERICAL MICROPHONE ARRAYS VIA SPATIALLY WEIGHTED ORTHOGONAL MATCHING PURSUIT
Coteli, Mert Burkay; Hacıhabiboğlu, Hüseyin (2018-09-20)
Acoustic source separation refers to the extraction of individual source signals from microphone array recordings of multiple sources made in multipath environments such as rooms. The most straightforward approach to acoustic source separation involves spatial filtering via beamforming. While beamforming works well for a few sources and under low reverberation, its performance diminishes for a high number of sources and/or high reverberation. An informed acoustic source separation method based on the applic...
Data-driven Threshold Selection for Direct Path Dominance Test
Olgun, Orhun; Hacıhabiboğlu, Hüseyin (2019-09-09)
Direction-of-arrival estimation methods, when used with recordings made in enclosures are negatively affected by the reflections and reverberation in that enclosure. Direct path dominance (DPD) test was proposed as a pre-processing stage which can provide better DOA estimates by selecting only the time-frequency bins with a single dominant sound source component prior to DOA estimation, thereby reducing the total computational cost. DPD test involves selecting bins for which the ratio of the two largest sin...
SOUND SOURCE LOCALISATION PERFORMANCE OF OPEN SPHERICAL ACOUSTIC INTENSITY PROBES UNDER REVERBERANT CONDITIONS
Hacıhabiboğlu, Hüseyin (2014-04-25)
Open-spherical acoustic intensity probes are microphone arrays based on the Kirchhoff-Helmholtz integral and are used in the measurement of active acoustic intensity. The acoustic intensity measurements obtained by these arrays can be used to localise sound sources. Previously, the performance of these arrays in acoustic free field conditions were obtained using numerical simulations and it was shown that they provide better performance than other types of probes. This paper discusses the implementation of ...
Citation Formats
O. Olgun, “Entropy-based direction-of-arrival estimation methods for rigid spherical microphone arrays,” Thesis (M.S.) -- Graduate School of Informatics. Modeling and Simulation., Middle East Technical University, 2019.