An iterative adaptive multi-modal stereo-vision method using mutual information

2015-01-01
Yaman, Mustafa
Kalkan, Sinan
We propose a method for computing disparity maps from a multi-modal stereo-vision system composed of an infrared-visible camera pair. The method uses mutual information (MI) as the basic similarity measure where a segment-based adaptive windowing mechanism is proposed along with a novel MI computation surface with joint prior probabilities incorporated. The computed cost confidences are aggregated using a novel adaptive cost aggregation method, and the resultant minimum cost disparities in segments are plane-fitted in their respective segments which are iteratively refined by merging and splitting segments reducing dependency to initial segmentation. Finally, the estimated disparities are iteratively refined by repeating all the steps. On an artificially-modified version of the Middlebury dataset and a Kinect dataset that we created in this study, we show that (i) our proposal improves the quality of existing MI formulation, and (ii) our method can provide depth comparable to the quality of Kinect depth data.
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION

Suggestions

Surface Reconstruction from Multiple Images Filtering Non Lambert Regions
BÜYÜATALAY, Soner; BİRGÜL, ÖZLEM; Halıcı, Uğur (2009-09-10)
In this study a new algorithm for 3D surface reconstruction from multiple images using a modified photometric stereo method is proposed and tested. The new algorithm, Filtered Lambert Photometric Stereo (FLPS), determines the non-Lambert pixels in the available images using a linearity test and constructs filtering masks for each image that corresponds to specular and self or cast shadow regions. Then, the photometric stereo is applied after eliminating the points in these masks. Tests carried out on synthe...
Multimodal Stereo Vision Using Mutual Information with Adaptive Windowing
Yaman, Mustafa; Kalkan, Sinan (2013-05-23)
This paper proposes a method for computing disparity maps from a multimodal stereovision system composed of an infrared and a visible camera pair. The method uses mutual information (MI) as the basic similarity measure where a segmentation-based adaptive windowing mechanism is proposed for greatly enhancing the results. On several datasets, we show that (i) our proposal improves the quality of existing MI formulation, and (ii) our method can provide depth comparable to the quality of Kinect depth data.
Multi-image region growing for integrating disparity maps
Leloglu, UĞUR MURAT; Halıcı, Uğur (1999-01-01)
In this paper, a multi-image region growing algorithm to obtain planar 3-D surfaces in the object space from multiple dense disparity maps, is presented. A surface patch is represented by a plane equation and a set of pixels in multiple images. The union of back projections of all pixels in the set onto the infinite plane, forms the surface patch. Thanks to that hybrid representation of planar surfaces, region growing (both region aggregation and region merging) is performed on all images simultaneously. Pl...
Comparative evaluation of ISAR processing algorithms
Tufan, Alper; Dural Ünver, Mevlüde Gülbin; Koç, Seyit Sencer; Department of Electrical and Electronics Engineering (2012)
In this thesis, Inverse Synthtetic Aperture Radar image reconstruction techniques, named as Range Doppler, Back Projection, Polar Formatting, Multiple Signal Classification (MUSIC) and Time Frequency techniques are analysed and compared using simulations. Time Frequency techniques investigated in this thesis are Short Time Fourier Transform, Wigner-Ville Distribution, Smoothed Wigner-Ville Distribution and Choi-Williams Distribution. First, some fundamental concepts of ISAR, such as resolution, range profil...
An Energy Efficient Additive Neural Network
Afrasiyabi, Arman; Nasır, Barış; Yildiz, Ozan; Yarman Vural, Fatoş Tunay; ÇETİN, AHMET ENİS (2017-05-18)
In this paper, we propose a new energy efficient neural network with the universal approximation property over space of Lebesgue integrable functions. This network, called additive neural network, is very suitable for mobile computing. The neural structure is based on a novel vector product definition, called ef-operator, that permits a multiplier-free implementation. In ef-operation, the "product" of two real numbers is defined as the sum of their absolute values, with the sign determined by the sign of th...
Citation Formats
M. Yaman and S. Kalkan, “An iterative adaptive multi-modal stereo-vision method using mutual information,” JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, pp. 115–131, 2015, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/35067.