Robust extraction of sparse 3d points from image sequences

Download
2008
Vural, Elif
In this thesis, the extraction of sparse 3D points from calibrated image sequences is studied. The presented method for sparse 3D reconstruction is examined in two steps, where the first part addresses the problem of two-view reconstruction, and the second part is the extension of the two-view reconstruction algorithm to include multiple views. The examined two-view reconstruction method consists of some basic building blocks, such as feature detection and matching, epipolar geometry estimation, and the reconstruction of cameras and scene structure. Feature detection and matching is achieved by Scale Invariant Feature Transform (SIFT) method. For the estimation of epipolar geometry, the 7-point and 8-point algorithms are examined for Fundamental matrix (F-matrix) computation, while RANSAC and PROSAC are utilized for the robustness and accuracy for model estimation. In the final stage of two-view reconstruction, the camera projection matrices are computed from the F-matrix, and the locations of 3D scene points are estimated by triangulation; hence, determining the scene structure and cameras up to a projective transformation. The extension of the two-view reconstruction to multiple views is achieved by estimating the camera projection matrix of each additional view from the already reconstructed matches, and then adding new points to the scene structure by triangulating the unreconstructed matches. Finally, the reconstruction is upgraded from projective to metric by a rectifying homography computed from the camera calibration information. In order to obtain a refined reconstruction, two different methods are suggested for the removal of erroneous points from the scene structure. In addition to the examination of the solution to the reconstruction problem, experiments have been conducted that compare the performances of competing algorithms used in various stages of reconstruction. In connection with sparse reconstruction, a rate-distortion efficient piecewise planar scene representation algorithm that generates mesh models of scenes from reconstructed point clouds is examined, and its performance is evaluated through experiments.

Suggestions

3D object recognition from range images using transform invariant object representation
AKAGÜNDÜZ, erdem; Ulusoy, İlkay (Institution of Engineering and Technology (IET), 2010-10-28)
3D object recognition is performed using a scale and orientation invariant feature extraction method and a scale and orientation invariant topological representation. 3D surfaces are represented by sparse, repeatable, informative and semantically meaningful 3D surface structures, which are called multiscale features. These features are extracted with their scale (metric size and resolution) using the classified scale-space of 3D surface curvatures. Triplets of these features are used to represent the surfac...
Image segmentation based on variational techniques
Altınoklu, Metin Burak; Ünver, Baki Zafer; Department of Electrical and Electronics Engineering (2009)
In this thesis, the image segmentation methods based on the MumfordShah variational approach have been studied. By obtaining an optimum point of the Mumford-Shah functional which is a piecewise smooth approximate image and a set of edge curves, an image can be decomposed into regions. This piecewise smooth approximate image is smooth inside of regions, but it is allowed to be discontinuous region wise. Unfortunately, because of the irregularity of the Mumford Shah functional, it cannot be directly used for ...
Image-based extraction of material reflectance properties of a 3D rigid object
Erdem, ME; Erdem, IA; Yilmaz, UG; Atalay, Mehmet Volkan (2004-01-01)
In this study, an appearance reconstruction method based on extraction of material reflectance properties of a three-dimensional (3D) object from its two-dimensional (2D) images is explained. One of the main advantages of this system is that the reconstructed object can be rendered in real-time with photorealistic quality in varying illumination conditions. The reflectance of the object is decomposed into diffuse and specular components. While the diffuse component is stored in a global texture, the specula...
Computer simulation and implementation of a visual 3-d eye gaze tracker for autostreoscopic displays
İnce, Kutalmış Gökalp; Alatan, Abdullah Aydın; Department of Electrical and Electronics Engineering (2009)
In this thesis, a visual 3-D eye gaze tracker is designed and implemented to tested via computer simulations and on an experimental setup. Proposed tracker is designed to examine human perception on autostereoscopic displays when the viewer is 3m away from such displays. Two different methods are proposed for calibrating personal parameters and gaze estimation, namely line of gaze (LoG) and line of sight (LoS) solutions. 2-D and 3-D estimation performances of the proposed system are observed both using comp...
Multiview 3d reconstruction of a scene containing independently moving objects
Tola, Engin; Alatan, Abdullah Aydın; Department of Electrical and Electronics Engineering (2005)
In this thesis, the structure from motion problem for calibrated scenes containing independently moving objects (IMO) has been studied. For this purpose, the overall reconstruction process is partitioned into various stages. The first stage deals with the fundamental problem of estimating structure and motion by using only two views. This process starts with finding some salient features using a sub-pixel version of the Harris corner detector. The features are matched by the help of a similarity and neighbo...
Citation Formats
E. Vural, “Robust extraction of sparse 3d points from image sequences,” M.S. - Master of Science, Middle East Technical University, 2008.