End-to-end stereoscopic video streaming with content-adaptive rate and format control

2007-02-01
Aksay, Anil
Pehlivan, Selen
Akar, Gözde
Bilen, Cagdas
OZCELEBİ, Tanir
Civanlar, M. Reha
Tekalp, A. Murat
We address efficient compression and real-time streaming of stereoscopic video over the current Internet. We first propose content-adaptive stereo video coding (CA-SC), where additional coding gain, over that can be achieved by exploiting only inter-view correlations, is targeted by clown-sampling one of the views spatially or temporally depending on the content, based on the well-known theory that the human visual system can perceive high frequencies in three-dimensional (3D) from the higher quality view. We also developed stereoscopic 3D video streaming server and clients by modifying available open source platforms, where each client can view the video in mono or stereo mode depending on its display capabilities. The performance of the end-to-end stereoscopic streaming system is demonstrated using subjective quality tests.
SIGNAL PROCESSING-IMAGE COMMUNICATION

Suggestions

Hybrid Fault Tolerant Peer to Peer Video Streaming Architecture
Oeztoprak, Kasim; Akar, Gözde (Institute of Electronics, Information and Communications Engineers (IEICE), 2008-11-01)
In this paper, we propose a fault tolerant hybrid p2p-CDN video streaming arhitecture to overcome the problems caused by peer behavior in peer-to-peer (P2P) video streaming systems. Although there are several studies modeling and analytically investigating peer behaviors in P2P video streaming systems, they do not COMe LIP with a solution to guarantee the required Quality of the Services (QoS). Therefore, in this study a hybrid geographical location-time and interest based clustering algorithm is proposed t...
Multiple description coding of animated meshes
Bici, M. Oguz; Akar, Gözde (Elsevier BV, 2010-11-01)
In this paper, we propose three novel multiple description coding (MDC) methods for reliable transmission of compressed animated meshes represented by series of 3D static meshes with same connectivity. The proposed methods trade off reconstruction quality for error resilience to provide the best expected reconstruction of 3D mesh sequence at the decoder side. The methods are based on layer duplication and partitioning of the set of vertices of a scalable coded animated mesh by either spatial or temporal sub...
Implementation of a distributed video codec
Işık, Cem Vedat; Akar, Gözde; Department of Electrical and Electronics Engineering (2008)
Current interframe video compression standards such as the MPEG4 and H.264, require a high-complexity encoder for predictive coding to exploit the similarities among successive video frames. This requirement is acceptable for cases where the video sequence to be transmitted is encoded once and decoded many times. However, some emerging applications such as video-based sensor networks, power-aware surveillance and mobile video communication systems require computational complexity to be shifted from encoder ...
Information permeability for stereo matching
Cigla, Cevahir; Alatan, Abdullah Aydın (Elsevier BV, 2013-10-01)
A novel local stereo matching algorithm is introduced to address the fundamental challenge of stereo algorithms, accuracy and computational complexity dilemma. The time consuming intensity dependent aggregation procedure of local methods is improved in terms of both speed and precision. Providing connected 2D support regions, the proposed approach exploits a new paradigm, namely separable successive weighted summation (SWS) among horizontal and vertical directions enabling constant operational complexity. T...
New method for the fusion of complementary information from infrared and visual images for object detection
Ulusoy, İlkay (Institution of Engineering and Technology (IET), 2011-02-01)
Visual and infrared cameras have complementary properties and using them together may increase the performance of object detection applications. Although the fusion of visual and infrared information results in a better recall rate than using only one of those domains, there is always a decrease in the precision rate whereas the infrared domain on its own always has higher precision. Thus, the fusion of these domains is meaningful only for a better recall rate, which means that more foreground pixels are de...
Citation Formats
A. Aksay et al., “End-to-end stereoscopic video streaming with content-adaptive rate and format control,” SIGNAL PROCESSING-IMAGE COMMUNICATION, pp. 157–168, 2007, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/46759.