Scene representation technologies for 3DTV - A survey

Download
2007-11-01
Alatan, Abdullah Aydın
Gueduekbay, Ugur
Zabulis, Xenophon
Mueller, Karsten
ERDEM, Cigdem Eroglu
WEİGEL, Christian
SMOLİC, Aljoscha
3-D scene representation is utilized during scene extraction, modeling, transmission and display stages of a 3DTV framework. To this end, different representation technologies are proposed to fulfill the requirements of 3DTV paradigm. Dense point-based methods are appropriate for free-view 3DTV applications, since they can generate novel views easily. As surface representations, polygonal meshes are quite popular due to their generality and current hardware support. Unfortunately, there is no inherent smoothness in their description and the resulting renderings may contain unrealistic artifacts. NURBS surfaces have embedded smoothness and efficient tools for editing and animation, but they are more suitable for synthetic content. Smooth subdivision surfaces, which offer a good compromise between polygonal meshes and NURBS surfaces, require, sophisticated geometry modeling tools and are usually difficult to obtain. One recent trend in surface representation is point-based modeling which can meet most of the requirements of 3DTV, however the relevant state-of-the-art is not yet mature enough. On the other hand, volumetric representations encapsulate neighborhood information that is useful for the reconstruction of surfaces with their parallel implementations for multiview stereo algorithms. Apart from, the representation of 3-D structure by different primitives, texturing of scenes is also essential for a realistic scene rendering. Image-based rendering techniques directly render novel views of a scene from the acquired images, since they do not require any, explicit geometry or texture representation. 3-D human face and body modeling facilitate the realistic animation and rendering of human figures that is quite crucial for 3DTV that might demand real-time animation of human bodies. Physically based modeling and animation techniques produce impressive results, thus have potential for use in a 3DTV framework for modeling and animating dynamic scenes. As a concluding remark, it can be argued that 3-D scene and texture representation techniques are mature enough to serve and fulfill the requirements of 3-D extraction, transmission and display sides in a 3DTV scenario.
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY

Suggestions

Moving Object Detction in 2D and 3D Scenes
Sırtkaya, Salim; Alatan, Abdullah Aydın; Department of Electrical and Electronics Engineering (2004)
This thesis describes the theoretical bases, development and testing of an integrated moving object detection framework in 2D and 3D scenes. The detection problem is analyzed in stationary and non-stationary camera sequences and different algorithms are developed for each case. Two methods are proposed in stationary camera sequences: background extraction followed by differencing and thresholding, and motion detection using optical flow field calculated by أKanade-Lucas Feature Trackerؤ. For non-stationary ...
3D object recognition from range images using transform invariant object representation
AKAGÜNDÜZ, erdem; Ulusoy, İlkay (Institution of Engineering and Technology (IET), 2010-10-28)
3D object recognition is performed using a scale and orientation invariant feature extraction method and a scale and orientation invariant topological representation. 3D surfaces are represented by sparse, repeatable, informative and semantically meaningful 3D surface structures, which are called multiscale features. These features are extracted with their scale (metric size and resolution) using the classified scale-space of 3D surface curvatures. Triplets of these features are used to represent the surfac...
Using multi-modal 3D contours and their relations for vision and robotics
BAŞESKİ, Emre; Pugeault, Nicolas; Kalkan, Sinan; BODENHAGEN, Leon; Piater, Justus H.; KRÜGER, Norbert (Elsevier BV, 2010-11-01)
In this work, we make use of 3D contours and relations between them (namely, coplanarity, cocolority, distance and angle) for four different applications in the area of computer vision and vision-based robotics. Our multi-modal contour representation covers both geometric and appearance information. We show the potential of reasoning with global entities in the context of visual scene analysis for driver assistance, depth prediction, robotic grasping and grasp learning. We argue that, such 3D global reasoni...
Motion compensated three dimensional wavelet transform based video compression and coding
Biçer, Aydın; Ünver, Baki Zafer; Department of Electrical and Electronics Engineering (2005)
In this thesis, a low bit rate video coding system based on three-dimensional (3-D) wavelet coding is studied. In addition to the initial motivation to make use of the motion compensated wavelet based coding schemes, the other techniques that do not utilize the motion compensation in their coding procedures have also been considered on equal footing. The 3-D wavelet transform (WT) algorithm is based on the أgroup of framesؤ (GOF) concept. The group of eight frames are decomposed both temporally and spatiall...
Pre-processing inputs for optimally-configured time-delay neural networks
Taşkaya Temizel, Tuğba; Ahmad, K (Institution of Engineering and Technology (IET), 2005-02-01)
A procedure for pre-processing non-stationary time series is proposed for modelling with a time-delay neural network (TDNN). The procedure stabilises the mean of the series and uses a fast Fourier transform to determine the TDNN input size. Results of applying this procedure on five well-known data sets are compared with existing hybrid neural network techniques, demonstrating improved prediction performance.
Citation Formats
A. A. Alatan et al., “Scene representation technologies for 3DTV - A survey,” IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, pp. 1587–1605, 2007, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/38784.