Coding algorithms for 3DTV - A survey

Smolic, Aljoscha
Mueller, Karsten
Stefanoski, Nikolce
Ostermann, Joern
Gotchev, Atanas
Akar, Gözde
Triantafyllidis, Georgios
Koz, Alper
Research efforts on 3DTV technology have been strengthened worldwide recently, covering the whole media processing chain from capture to display. Different 3DTV systems rely on different 3-D scene representations that integrate various types of data. Efficient coding of these data is crucial-for the success of 3DTV. Compression of pixel-type data including stereo video, multiview video, and associated depth or disparity maps extends available principles of classical video coding. Powerful algorithms and open international standards for multiview video coding and coding of video plus depth data are available and under development, which will provide the basis for introduction of various 3DTV systems and services in the near future. Compression of 3-D mesh models has also reached a high level of maturity. For static geometry, a variety of powerful algorithms are available to efficiently compress vertices and connectivity. Compression of dynamic 3-D geometry is currently a more active field of research. Temporal prediction is an important mechanism to remove redundancy from animated 3-D mesh sequences. Error resilience is important for transmission of data over error prone channels, and multiple description coding (MDC) is a suitable way to protect data. MDC of still images and 2-D video has already been widely studied, whereas multiview video and 3-D meshes have been addressed only recently. Intellectual property protection of 3-D data by watermarking is a pioneering research area as well. The 3-D watermarking methods in the literature are classified into three groups, considering the dimensions of the main components of scene representations and the resulting components after applying the algorithm. In general, 3DTV coding technology is maturating. Systems and services may enter the market in the near future. However, the research area is relatively young compared to coding of other types of media. Therefore, there is still a lot of room for improvement and new development of algorithms.


Column level two-step multi-slope analog to digital converter for CMOS image sensors
Tunca, Can; Koçer, Fatih; Department of Electrical and Electronics Engineering (2017)
In the past few years, CMOS image sensors has performed an enormous growth in technology and their market is broadened with the integration cameras on the cell phones. The advancement trend continues as the pixel sizes getting smaller and the array formats getting larger. With pixels decreasing in size and growing in numbers, faster row read-out speed requirements have emerged to keep frame rates constant. Column parallel ADC architectures meet these demands as they utilize large numbers of parallel convers...
End-to-end stereoscopic video streaming system
Pehlivan, Selen; Aksay, Anil; Bilen, Cagdas; Akar, Gözde; Civanlar, M. Reha (2006-07-12)
Today, stereoscopic and multi-view video are among the popular research areas in the multimedia world. In this study, we have designed and built a platform consisting of stereo-view capturing, real-time transmission and display. At the display stage, end users view video in 3D by using polarized glasses. Multi-view video is compressed in an efficient way by using multi-view video coding techniques and streamed using standard real-time transport protocols. The entire system is built by modifying available op...
FPGA implementation of real time digital video super resolution for infrared cameras
Aktukmak, Mehmet; Halıcı, Uğur; Department of Electrical and Electronics Engineering (2013)
At present, the quality of image taken from infrared cameras is low compared to the other cameras because of manufacturing technology. So, resolution enhancement processes are becoming more important for these cameras. Super resolution is a good approach to solve this resolution problem. In general, the systems that infrared cameras used require video processing to perform in real time. So, a suitable approach should be selected and implemented to work in real time. The computational load and processing tim...
3D Object Modeling by Structured Light and Stereo Vision
Ozenc, Ugur; Tastan, Oguzhan; GÜLLÜ, MEHMET KEMAL (2015-05-19)
In this paper, we demonstrate a 3D object modeling system utilizing a setup which consists of two CMOS cameras and a DLP projector by making use of structured light and stereo vision. The calibration of the system is carried out using calibration pattern. The images are taken with stereo camera pair by projecting structured light onto the object and the correspondence problem is solved by both epipolar constraint of stereo vision and gray code constraint of structured light. The first experimental results s...
Superpixel based efficient image representation for segmentation and classification
Taşlı, Hüseyin Emrah; Alatan, Abdullah Aydın; Department of Electrical and Electronics Engineering (2013)
The wide availability of visual capture and display devices with increasing resolution and a ordable prices, made the visual data an indispensable part of our life. The enormous amount of visual data produced every day is captured, stored and sometimes processed for further analysis. In this era of technological improvement, where an exponential increase in the number and capability of the devices is experienced, researchers have focused on e cient and accurate ways to reach, store, analyse and display the ...
Citation Formats
A. Smolic et al., “Coding algorithms for 3DTV - A survey,” IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, pp. 1606–1621, 2007, Accessed: 00, 2020. [Online]. Available: