Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
An iterative adaptive multi-modal stereo-vision method using mutual information
Date
2015-01-01
Author
Yaman, Mustafa
Kalkan, Sinan
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
386
views
0
downloads
Cite This
We propose a method for computing disparity maps from a multi-modal stereo-vision system composed of an infrared-visible camera pair. The method uses mutual information (MI) as the basic similarity measure where a segment-based adaptive windowing mechanism is proposed along with a novel MI computation surface with joint prior probabilities incorporated. The computed cost confidences are aggregated using a novel adaptive cost aggregation method, and the resultant minimum cost disparities in segments are plane-fitted in their respective segments which are iteratively refined by merging and splitting segments reducing dependency to initial segmentation. Finally, the estimated disparities are iteratively refined by repeating all the steps. On an artificially-modified version of the Middlebury dataset and a Kinect dataset that we created in this study, we show that (i) our proposal improves the quality of existing MI formulation, and (ii) our method can provide depth comparable to the quality of Kinect depth data.
Subject Keywords
Multi-modal stereo-vision
,
Mutual information
,
Adaptive windowing
,
Adaptive cost aggregation
,
Iterative stereo
,
Plane fitting
,
RGB-D
,
Middleburry dataset
URI
https://hdl.handle.net/11511/35067
Journal
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION
DOI
https://doi.org/10.1016/j.jvcir.2014.11.010
Collections
Department of Computer Engineering, Article
Suggestions
OpenMETU
Core
Multimodal Stereo Vision Using Mutual Information with Adaptive Windowing
Yaman, Mustafa; Kalkan, Sinan (2013-05-23)
This paper proposes a method for computing disparity maps from a multimodal stereovision system composed of an infrared and a visible camera pair. The method uses mutual information (MI) as the basic similarity measure where a segmentation-based adaptive windowing mechanism is proposed for greatly enhancing the results. On several datasets, we show that (i) our proposal improves the quality of existing MI formulation, and (ii) our method can provide depth comparable to the quality of Kinect depth data.
Multi-image region growing for integrating disparity maps
Leloglu, UĞUR MURAT; Halıcı, Uğur (1999-01-01)
In this paper, a multi-image region growing algorithm to obtain planar 3-D surfaces in the object space from multiple dense disparity maps, is presented. A surface patch is represented by a plane equation and a set of pixels in multiple images. The union of back projections of all pixels in the set onto the infinite plane, forms the surface patch. Thanks to that hybrid representation of planar surfaces, region growing (both region aggregation and region merging) is performed on all images simultaneously. Pl...
OPTIMIZATION OF ENCODING AND ERROR PROTECTION PARAMETERS FOR 3D VIDEO BROADCAST OVER DVB-H
Aksay, Anil; Bugdayci, Done; Akar, Gözde (2011-05-18)
In this study, we propose a heuristic methodology for modeling the end-to-end distortion characteristics of an error resilient broadcast system for 3D video overDigital Video Broadcasting -Handheld (DVB-H). We also use this model to optimally select the parameters of the video encoder and the error correction scheme, namely, Multi Protocol Encapsulation Forward Error Correction (MPE-FEC), minimizing the overall distortion. The proposed method models the RQ curve of video encoder and performance of channel c...
An Energy Efficient Additive Neural Network
Afrasiyabi, Arman; Nasır, Barış; Yildiz, Ozan; Yarman Vural, Fatoş Tunay; ÇETİN, AHMET ENİS (2017-05-18)
In this paper, we propose a new energy efficient neural network with the universal approximation property over space of Lebesgue integrable functions. This network, called additive neural network, is very suitable for mobile computing. The neural structure is based on a novel vector product definition, called ef-operator, that permits a multiplier-free implementation. In ef-operation, the "product" of two real numbers is defined as the sum of their absolute values, with the sign determined by the sign of th...
Comparative evaluation of ISAR processing algorithms
Tufan, Alper; Dural Ünver, Mevlüde Gülbin; Koç, Seyit Sencer; Department of Electrical and Electronics Engineering (2012)
In this thesis, Inverse Synthtetic Aperture Radar image reconstruction techniques, named as Range Doppler, Back Projection, Polar Formatting, Multiple Signal Classification (MUSIC) and Time Frequency techniques are analysed and compared using simulations. Time Frequency techniques investigated in this thesis are Short Time Fourier Transform, Wigner-Ville Distribution, Smoothed Wigner-Ville Distribution and Choi-Williams Distribution. First, some fundamental concepts of ISAR, such as resolution, range profil...
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
M. Yaman and S. Kalkan, “An iterative adaptive multi-modal stereo-vision method using mutual information,”
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION
, pp. 115–131, 2015, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/35067.