3-d time-varying scene capture technologies - A survey

2007-11-01
STOYKOVA, Elena
Alatan, Abdullah Aydın
BENZİE, Philip
GRAMMALİDİS, Nikos
MALASSİOTİS, Sotiris
OSTERMANN, Joern
PİEKH, Sergej
Sainov, Ventseslav
THEOBALT, Christian
THEVAR, Thangavel
ZABULİS, Xenophon
Advances in image sensors and evolution of digital computation is a strong stimulus for development and implementation of sophisticated methods for capturing, processing and analysis of 3-D data from dynamic scenes. Research on perspective time-varying 3-D scene capture technologies is important for the upcoming 3DTV displays. Methods such as shape-from-texture, shape-from-shading, shape-from-focus, and shape-from-motion extraction can restore 3-D shape information from a single camera data. The existing techniques for 3-D extraction from single-camera video sequences are especially useful for conversion of the already available vast mono-view content to the 3DTV systems. Scene-oriented single-camera methods such as human face reconstruction and facial motion,analysis,,body modeling and body motion tracking, and motion recognition solve efficiently a variety of tasks. 3-D multicamera dynamic acquisition and reconstruction, their hardware specifics including calibration and synchronization and software demands form another area of intensive research. Different classes of multiview stereo algorithms such as those based on cost function computing and optimization, fusing of multiple views, and feature-point reconstruction are possible candidates for dynamic 3-D reconstruction. High-resolution digital holography and pattern projection techniques such as coded light or fringe projection for real-time extraction of 3-D object positions and color information could manifest themselves as an alternative to traditional camera-based methods. Apart from all of these approaches, there also are some active imaging devices capable of 3-D extraction such, as the 3-D time-of-flight camera, which provides 3-D image data of its environment by means of a modulated infrared light source.
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY

Suggestions

3-D motion estimation of rigid objects for video coding applications using an improved iterative version of the E-matrix method
Alatan, Abdullah Aydın (1998-02-01)
As an alternative to current two-dimensional (2-D) motion models, a robust three-dimensional (3-D) motion estimation method is proposed to be utilized in object-based video coding applications, Since the popular E-matrix method is well known for its susceptibility to input errors, a performance indicator, which tests the validity of the estimated 3-D motion parameters both explicitly and implicitly, is defined. This indicator is utilized within the RANSAC method to obtain a robust set of 2-D motion correspo...
Spatial 3D local descriptors for object recognition in RGB-D images
Loğoğlu, K. Berker; Temizel, Alptekin; Kalkan, Sinan; Department of Information Systems (2016)
Introduction of the affordable but relatively high resolution color and depth synchronized RGB-D sensors, along with the efforts on open-source point-cloud processing tools boosted research in both computer vision and robotics. One of the key areas which have drawn particular attention is object recognition since it is one of the crucial steps for various applications. In this thesis, two spatially enhanced local 3D descriptors are proposed for object recognition tasks: Histograms of Spatial Concentric Surf...
Steganography through perspective invariance
Yaşaroğlu, Yağız; Alatan, Abdullah Aydın; Department of Electrical and Electronics Engineering (2012)
A novel approach for watermarking of 3D models is introduced, for which data is embedded into 3D models, whereas extracted from their projected 2D visual or 2D-plus-depth representations. Such a watermarking system is valuable, since most of the 3D content is being consumed as 2D visual data. Apart from the efficiency of embedding data into 3D models before generation of arbitrary 2D projections, in some use cases, such as free viewpoint video or computer games, 2D content has to be rendered at the client, ...
3D Object Modeling by Structured Light and Stereo Vision
Ozenc, Ugur; Tastan, Oguzhan; GÜLLÜ, MEHMET KEMAL (2015-05-19)
In this paper, we demonstrate a 3D object modeling system utilizing a setup which consists of two CMOS cameras and a DLP projector by making use of structured light and stereo vision. The calibration of the system is carried out using calibration pattern. The images are taken with stereo camera pair by projecting structured light onto the object and the correspondence problem is solved by both epipolar constraint of stereo vision and gray code constraint of structured light. The first experimental results s...
End-to-end learned image compression with conditional latent space modelling for entropy coding
Yeşilyurt, Aziz Berkay; Kamışlı, Fatih; Department of Electrical and Electronics Engineering (2019)
This thesis presents a lossy image compression system based on an end-to-end trainable neural network. Traditional compression algorithms use linear transformation, quantization and entropy coding steps that are designed based on simple models of the data and are aimed to be low complexity. In neural network based image compression methods, the processing steps, such as transformation and entropy coding, are performed using neural networks. The use of neural networks enables transforms or probability models...
Citation Formats
E. STOYKOVA et al., “3-d time-varying scene capture technologies - A survey,” IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, pp. 1568–1586, 2007, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/33124.