Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
Multiple Sound Source Localization With Steered Response Power Density and Hierarchical Grid Refinement
Download
index.pdf
Date
2018-11-01
Author
COTELI, Mert Burkay
OLGUN, Orhun
Hacıhabiboğlu, Hüseyin
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
189
views
0
downloads
Cite This
Estimation of the direction-of-arrival (DOA) of sound sources is an important step in sound field analysis. Rigid spherical microphone arrays allow the calculation of a compact spherical harmonic representation of the sound field. The standard method for analyzing sound fields recorded using such arrays is steered response power (SRP) maps wherein the source DOA can be estimated as the steering direction that maximizes the output power of a maximally directive beam. This approach is computationally costly since it requires steering the beam in all possible directions. This paper presents an extension to SRP called steered response power density (SRPD) and an associated, signal-adaptive search method called hierarchical grid refinement for reducing the number of steering directions needed for DOA estimation. The proposed method can localize near-coherent as well as incoherent sources while jointly providing the number of prominent sources in the scene. It is shown to be robust to reverberation and additive white noise. An evaluation of the proposed method using simulations and real recordings under highly reverberant conditions as well as a comparison with the state-of-the-art methods are presented.
Subject Keywords
Source localization
,
Rigid spherical microphone arrays
,
Steered response power maps
,
Direction-of-arrival estimation
URI
https://hdl.handle.net/11511/30025
Journal
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
DOI
https://doi.org/10.1109/taslp.2018.2858932
Collections
Graduate School of Informatics, Article
Suggestions
OpenMETU
Core
Multiple Sound Source Localization with Rigid Spherical Microphone Arrays via Residual Energy Test
Coteli, Mert Burkay; Hacıhabiboğlu, Hüseyin (2019-05-01)
The estimation of the directions-of-arrival (DOAs) of multiple sound sources is a fundamental stage in acoustic scene analysis. Many application areas such as robot audition and object-based audio (OBA) broadcast require that DOA estimation is computationally efficient to allow real-time operation. We propose a new DOA estimation approach based on a sparse representation of recorded sound fields as a linear combination of spatially bandlimited impulses in this paper. The proposed algorithm operates on a tim...
Entropy-based direction-of-arrival estimation methods for rigid spherical microphone arrays
Olgun, Orhun; Hacıhabiboğlu, Hüseyin; Department of Modeling and Simulation (2019)
Direction-of-arrival (DOA) estimation of sound sources is a popular topic of research and has an important role in several different applications including spatial audio. Recent advances in microphone arrays made more accurate sound field analysis possible. Spherical microphone arrays afford a trivial calculation of spherical harmonic decomposition of sound fields and can be employed in different DOA estimation methods in spherical harmonics domain. This thesis proposes a novel DOA estimation method called ...
Spherical harmonics based acoustic scene analysis for object-based audio
Çöteli, Mert Burkay; Hacıhabiboğlu, Hüseyin; Department of Information Systems (2021-2-19)
Object-based audio relies on elemental audio signals from individual sound sources and their associated metadata to be reconstructed at the listener side. While defining audio objects in a production setting is straightforward, it is not trivial to extract audio objects from more realistic recording scenarios such as concerts. Thus, existing object-based audio standards also define scene-based formats alongside objectbased representations that provide immersive audio, but without the flexibility provided by...
LOCALIZATION OF MULTIPLE SOURCES IN THE SPHERICAL HARMONIC DOMAIN WITH HIERARCHICAL GRID REFINEMENT AND EB-MUSIC
Olgun, Orhun; Hacıhabiboğlu, Hüseyin (2018-09-20)
Direction-of-arrival (DOA) estimation is an important step in acoustic scene analysis. Multiple signal classification in the eigenbeam domain (EB-MUSIC) is an accurate direction-of-arrival (DOA) estimation method for rigid spherical microphone arrays. Two important issues with this method are 1) the requirement of prior information about the number of coherent source components, and 2) its computational cost. In this paper, a computationally efficient two-stage method, which can alleviate these problems, is...
Blind source separation and directional audio synthesis for binaural auralization of multiple sound sources using microphone array recordings
Günel Kılıç, Banu; Hacıhabiboğlu, Hüseyin (2008-01-01)
Microphone array signal processing techniques are extensively used for sound source localisation, acoustical characterisation and sound source separation, which are related to audio analysis. However, the use of microphone arrays for auralisation, which is generally related to synthesis, has been limited so far. This paper proposes a method for binaural auralisation of multiple sound sources based on blind source separation (BSS) and binaural audio synthesis. A BSS algorithm is introduced that exploits the ...
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
M. B. COTELI, O. OLGUN, and H. Hacıhabiboğlu, “Multiple Sound Source Localization With Steered Response Power Density and Hierarchical Grid Refinement,”
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
, pp. 2215–2229, 2018, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/30025.