An iterative adaptive multi-modal stereo-vision method using mutual information

Date

2015-01-01

Author

Yaman, Mustafa
Kalkan, Sinan

Metadata

Show full item record

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Item Usage Stats

415
views

0
downloads

We propose a method for computing disparity maps from a multi-modal stereo-vision system composed of an infrared-visible camera pair. The method uses mutual information (MI) as the basic similarity measure where a segment-based adaptive windowing mechanism is proposed along with a novel MI computation surface with joint prior probabilities incorporated. The computed cost confidences are aggregated using a novel adaptive cost aggregation method, and the resultant minimum cost disparities in segments are plane-fitted in their respective segments which are iteratively refined by merging and splitting segments reducing dependency to initial segmentation. Finally, the estimated disparities are iteratively refined by repeating all the steps. On an artificially-modified version of the Middlebury dataset and a Kinect dataset that we created in this study, we show that (i) our proposal improves the quality of existing MI formulation, and (ii) our method can provide depth comparable to the quality of Kinect depth data.

Subject Keywords

Multi-modal stereo-vision, Mutual information, Adaptive windowing, Adaptive cost aggregation, Iterative stereo, Plane fitting, RGB-D, Middleburry dataset

URI

https://hdl.handle.net/11511/35067

Journal

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION

DOI

https://doi.org/10.1016/j.jvcir.2014.11.010

Collections

Department of Computer Engineering, Article

Suggestions

OpenMETU
Core

Multimodal Stereo Vision Using Mutual Information with Adaptive Windowing Yaman, Mustafa; Kalkan, Sinan (2013-05-23) This paper proposes a method for computing disparity maps from a multimodal stereovision system composed of an infrared and a visible camera pair. The method uses mutual information (MI) as the basic similarity measure where a segmentation-based adaptive windowing mechanism is proposed for greatly enhancing the results. On several datasets, we show that (i) our proposal improves the quality of existing MI formulation, and (ii) our method can provide depth comparable to the quality of Kinect depth data.
Multi-image region growing for integrating disparity maps Leloglu, UĞUR MURAT; Halıcı, Uğur (1999-01-01) In this paper, a multi-image region growing algorithm to obtain planar 3-D surfaces in the object space from multiple dense disparity maps, is presented. A surface patch is represented by a plane equation and a set of pixels in multiple images. The union of back projections of all pixels in the set onto the infinite plane, forms the surface patch. Thanks to that hybrid representation of planar surfaces, region growing (both region aggregation and region merging) is performed on all images simultaneously. Pl...
OPTIMIZATION OF ENCODING AND ERROR PROTECTION PARAMETERS FOR 3D VIDEO BROADCAST OVER DVB-H Aksay, Anil; Bugdayci, Done; Akar, Gözde (2011-05-18) In this study, we propose a heuristic methodology for modeling the end-to-end distortion characteristics of an error resilient broadcast system for 3D video overDigital Video Broadcasting -Handheld (DVB-H). We also use this model to optimally select the parameters of the video encoder and the error correction scheme, namely, Multi Protocol Encapsulation Forward Error Correction (MPE-FEC), minimizing the overall distortion. The proposed method models the RQ curve of video encoder and performance of channel c...
An Energy Efficient Additive Neural Network Afrasiyabi, Arman; Nasır, Barış; Yildiz, Ozan; Yarman Vural, Fatoş Tunay; ÇETİN, AHMET ENİS (2017-05-18) In this paper, we propose a new energy efficient neural network with the universal approximation property over space of Lebesgue integrable functions. This network, called additive neural network, is very suitable for mobile computing. The neural structure is based on a novel vector product definition, called ef-operator, that permits a multiplier-free implementation. In ef-operation, the "product" of two real numbers is defined as the sum of their absolute values, with the sign determined by the sign of th...
Comparative evaluation of ISAR processing algorithms Tufan, Alper; Dural Ünver, Mevlüde Gülbin; Koç, Seyit Sencer; Department of Electrical and Electronics Engineering (2012) In this thesis, Inverse Synthtetic Aperture Radar image reconstruction techniques, named as Range Doppler, Back Projection, Polar Formatting, Multiple Signal Classification (MUSIC) and Time Frequency techniques are analysed and compared using simulations. Time Frequency techniques investigated in this thesis are Short Time Fourier Transform, Wigner-Ville Distribution, Smoothed Wigner-Ville Distribution and Choi-Williams Distribution. First, some fundamental concepts of ISAR, such as resolution, range profil...

Citation Formats

M. Yaman and S. Kalkan, “An iterative adaptive multi-modal stereo-vision method using mutual information,” JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, pp. 115–131, 2015, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/35067.