Rotation Calibration of Rigid Spherical Microphone Arrays for Multi-perspective 6DoF Audio Recordings

2021-09-08
The preferred approach for multi-perspective six-degrees-of-freedom (6DoF) audio involves using multiple rigid spherical microphone arrays (RSMA) that can capture higher-order Ambisonics. RSMAs are spherically symmetric and allow the calculation of the local decomposition of sound fields over spherical harmonic functions. When multiple such arrays are used, multiple scattering occurs that can be equalized via methods that rely on multipole expansions if the positions of the arrays are known and the coordinate frames of arrays are aligned. In a practical scenario, however, when such a set of arrays are placed in location, their positions will not be exact and their coordinate frames will not be fully aligned. This paper is concerned with the correction of rotational mismatches in azimuth angles of individual RSMAs. The effects of such misalignment on the calculation of the local multipole expansion of the sound field are numerically explored. A rotation calibration approach that depends on non-linear optimisation is proposed. Numerical evaluations of different scenarios are presented.
2021 Immersive and 3D Audio: from Architecture to Automotive (I3DA)

Suggestions

Frequency tunable terahertz metamaterials using broadside coupled split-ring resonators
Ekmekci, E.; Strikwerda, A. C.; Fan, K.; Keiser, G.; Zhang, X.; Sayan, Gönül; Averitt, R. D. (American Physical Society (APS), 2011-05-19)
We present frequency tunable metamaterial designs at terahertz (THz) frequencies using broadside coupled split-ring resonator (BC-SRR) arrays. Frequency tuning, arising from changes in near-field coupling, is obtained by in-plane displacement of the two SRR layers. For electrical excitation, the resonance frequency continuously redshifts as a function of displacement. The maximum frequency shift occurs for vertical displacement of half a unit cell, resulting in a shift of 663 GHz (51% of f(0)). We discuss t...
Single Image Noise Level Estimation Using Dark Channel Prior
Yeşilyurt, Aziz Berkay; Erol, Aybüke; Kamışlı, Fatih; Alatan, Abdullah Aydın (2019-09-22)
Noise level is required as an input parameter in various image processing applications. In this work, we use the dark channel prior (DCP) to estimate the noise level of an image degraded by additive white Gaussian noise. We develop an approximate model of the probability density function of the dark channel of the noisy image. Using this model, the noise level is determined with the maximum likelihood estimation method from the dark channel intensity values of the noisy image. The results show that our meth...
Errors-and-Erasures Decoding for Block Codes With Feedback
Nakiboğlu, Barış (2012-01-01)
Inner and outer bounds are derived on the optimal performance of fixed-length block codes on discrete memoryless channels with feedback and errors-and-erasures decoding. First, an inner bound is derived using a two-phase encoding scheme with communication and control phases together with the optimal decoding rule for the given encoding scheme, among decoding rules that can be represented in terms of pairwise comparisons between the messages. Then, an outer bound is derived using a generalization of the stra...
3D perceptual soundfield reconstruction via sound field extrapolation
Erdem, Eg; Hacıhabiboğlui Hüseyin.; Department of Multimedia Informatics (2020)
Perceptual sound field reconstruction (PSR) is a spatial audio recording and reproduction method based on the application of stereophonic panning laws in microphone array design. PSR allows rendering a perceptually veridical and stable auditory perspective in the horizontal plane of the listener, and involves recording using nearcoincident microphone arrays. This thesis extends the two dimensional PSR concept to three dimensions and allows reconstructing an arbitrary sound field based on measurements with a...
Dimension reduced robust beamforming for towed arrays
Topçu, Emre; Candan, Çağatay; Department of Electrical and Electronics Engineering (2015)
Adaptive beamforming methods are used to obtain higher signal to interference plus noise ratio at the array output. However, these methods are very sensitive to steering vector and covariance matrix estimation errors. To overcome this issue, robust methods are usually employed. On the other hand, implementation of these robust methods can be computationally expensive for arrays with large number of sensors. Reduced dimension techniques aim to lower the computational load of adaptive beamforming algorithms w...
Citation Formats
O. Olgun, E. Erdem, and H. Hacıhabiboğlu, “Rotation Calibration of Rigid Spherical Microphone Arrays for Multi-perspective 6DoF Audio Recordings,” presented at the 2021 Immersive and 3D Audio: from Architecture to Automotive (I3DA), Bologna, İtalya, 2021, Accessed: 00, 2021. [Online]. Available: https://hdl.handle.net/11511/94775.