Planar 3d scene representations for depth compression

Özkalaycı, Burak Oğuz
The recent invasion of stereoscopic 3D television technologies is expected to be followed by autostereoscopic and holographic technologies. Glasses-free multiple stereoscopic pair displaying capabilities of these technologies will advance the 3D experience. The prospective 3D format to create the multiple views for such displays is Multiview Video plus Depth (MVD) format based on the Depth Image Based Rendering (DIBR) techniques. The depth modality of the MVD format is an active research area whose main objective is to develop DIBR friendly efficient compression methods. As a part this research, the thesis proposes novel 3D planar-based depth representations. The planar approximation of the stereo depth images is formulated as an energy-based co-segmentation problem by a Markov Random Field model. The energy terms of this problem are designed to mimic the rate-distortion tradeoff for a depth compression application. A heuristic algorithm is developed for practical utilization of the proposed planar approximations in stereo depth compression. The co-segmented regions are also represented as layered planar structures forming a novel single referenced MVD format. The proposed planar based depth compression solutions are compared against the state-of-the art image/video and MVD compression standards. The compression performances are analyzed for depth reconstruction and novel view rendering by DIBR techniques. All the experiments are performed with the ground truth texture of the MVD data, since the scope of the thesis is limited with the depth modality. The visual and objective evaluations show that the proposed planar representations are promising for efficient depth compression with artifact-free novel view rendering. As a remarkable contribution, the proposed layered planar MVD representation also brings the depth perception quality considerations in the MVD compression schemes by decoupling the texture and geometry to a wide extent.


3D object recognition from range images
İzciler, Fatih; Halıcı, Uğur; Department of Electrical and Electronics Engineering (2012)
Recognizing generic objects by single or multi view range images is a contemporary popular problem in 3D object recognition area with developing technology of scanning devices such as laser range scanners. This problem is vital to current and future vision systems performing shape based matching and classification of the objects in an arbitrary scene. Despite improvements on scanners, there are still imperfections on range scans such as holes or unconnected parts on images. This studyobjects at proposing an...
Range data recognition: segmentation, matching, and similarity retrieval
Yalçın Bayramoğlu, Neslihan; Alatan, Abdullah Aydın; Department of Electrical and Electronics Engineering (2011)
The improvements in 3D scanning technologies have led the necessity for managing range image databases. Hence, the requirement of describing and indexing this type of data arises. Up to now, rather much work is achieved on capturing, transmission and visualization; however, there is still a gap in the 3D semantic analysis between the requirements of the applications and the obtained results. In this thesis we studied 3D semantic analysis of range data. Under this broad title we address segmentation of range...
Photometric stereo considering highlights and shadows
Büyükatalay, Soner; Halıcı, Uğur; Birgül, Özlem; Department of Electrical and Electronics Engineering (2011)
Three dimensional (3D) shape reconstruction that aims to reconstruct 3D surface of objects using acquired images, is one of the main problems in computer vision. There are many applications of 3D shape reconstruction, from satellite imaging to material sciences, considering a continent on earth or microscopic surface properties of a material. One of these applications is the automated firearm identification that is an old, yet an unsolved problem in forensic science. Firearm evidence matching algorithms rel...
A Portable stereo-video streaming system
Zerman, Emin; Akar, Gözde; Department of Electrical and Electronics Engineering (2013)
In the last decade, 3D technologies have made a great advancement in reaching the end users. Many of the cutting-edge technology had the opportunity to reach the customers. With the increase in popularity of mobile electronics and the new mobile 3D multimedia displaying methods, the mobile 3D became a very important multimedia factor in the last decade. This thesis presents an implementation of real-time stereo-video capture, compression, and wireless streaming from embedded platforms to mobile devices, as ...
Spatial 3D local descriptors for object recognition in RGB-D images
Loğoğlu, K. Berker; Temizel, Alptekin; Kalkan, Sinan; Department of Information Systems (2016)
Introduction of the affordable but relatively high resolution color and depth synchronized RGB-D sensors, along with the efforts on open-source point-cloud processing tools boosted research in both computer vision and robotics. One of the key areas which have drawn particular attention is object recognition since it is one of the crucial steps for various applications. In this thesis, two spatially enhanced local 3D descriptors are proposed for object recognition tasks: Histograms of Spatial Concentric Surf...
Citation Formats
B. O. Özkalaycı, “Planar 3d scene representations for depth compression,” Ph.D. - Doctoral Program, Middle East Technical University, 2014.