Estimation of depth fields suitable for video compression based on 3-D structure and motion of objects

Download
1998-6
Intensity prediction along motion trajectories removes temporal redundancy considerably in video compression algorithms. In three-dimensional (3-D) object-based video coding, both 3-D motion and depth values are required for temporal prediction. The required 3-D motion parameters for each object are found by the correspondence-based E-matrix method. The estimation of the correspondences-two-dimensional (2-D) motion field-between the frames and segmentation of the scene into objects are achieved simultaneously by minimizing a Gibbs energy. The depth field is estimated by jointly minimizing a defined distortion and bit-rate criterion using the 3-D motion parameters. The resulting depth field is efficient in the rate-distortion sense, Bit-rate values corresponding to the lossless encoding of the resultant depth fields are obtained using predictive coding; prediction errors are encoded by a Lempel-Ziv algorithm. The results are satisfactory for real-life video scenes.
IEEE Transactions on Image Processing

Suggestions

Recursive Prediction for Joint Spatial and Temporal Prediction in Video Coding
Kamışlı, Fatih (2014-06-01)
Video compression systems use prediction to reduce redundancies present in video sequences along the temporal and spatial dimensions. Standard video coding systems use either temporal or spatial prediction on a per block basis. If temporal prediction is used, spatial information is ignored. If spatial prediction is used, temporal information is ignored. This may be a computationally efficient approach, but it does not effectively combine temporal and spatial information. In this letter, we provide a framewo...
Estimation of rotation between two frames of a scene
Atalay, Mehmet Volkan (1998-04-14)
An algorithm is proposed in order to estimate the rotation between two frames of a scene. Only linear segments and their geometric attributes are used and the algorithm is based on the correlation of the slope angle histograms of linear segments extracted from two frames. The basic idea is that from one frame to the other, if the camera rotates theta(r) degrees, linear segments will also be rotated by theta(r) degrees. In order to alleviate computational complexity a coarse to fine approach is proposed. The...
3-D motion estimation of rigid objects for video coding applications using an improved iterative version of the E-matrix method
Alatan, Abdullah Aydın (1998-02-01)
As an alternative to current two-dimensional (2-D) motion models, a robust three-dimensional (3-D) motion estimation method is proposed to be utilized in object-based video coding applications, Since the popular E-matrix method is well known for its susceptibility to input errors, a performance indicator, which tests the validity of the estimated 3-D motion parameters both explicitly and implicitly, is defined. This indicator is utilized within the RANSAC method to obtain a robust set of 2-D motion correspo...
Compensation of Dead-time Effects in Three-level Neutral Point Clamped Inverters based on Space Vector PWM
Mese, Huseyin; Ersak, Aydın (2011-09-10)
hi this study the effects of dead-time in three-level neutral-point-clamped (NYC) inverter is analyzed and a compensation algorithm based on space vector pulse width modulation (PWJI) is proposed. In three phase inverter applications, different modulation techniques are utilized depending on the application. Among these, space vector pulse width modulation (SVPWM) technique is the modulation technique with which best bus bar utilization is achieved. This modulation technique is also applicable to three-leve...
OPTIMIZATION OF ENCODING AND ERROR PROTECTION PARAMETERS FOR 3D VIDEO BROADCAST OVER DVB-H
Aksay, Anil; Bugdayci, Done; Akar, Gözde (2011-05-18)
In this study, we propose a heuristic methodology for modeling the end-to-end distortion characteristics of an error resilient broadcast system for 3D video overDigital Video Broadcasting -Handheld (DVB-H). We also use this model to optimally select the parameters of the video encoder and the error correction scheme, namely, Multi Protocol Encapsulation Forward Error Correction (MPE-FEC), minimizing the overall distortion. The proposed method models the RQ curve of video encoder and performance of channel c...
Citation Formats
A. A. Alatan, “Estimation of depth fields suitable for video compression based on 3-D structure and motion of objects,” IEEE Transactions on Image Processing, pp. 904–908, 1998, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/28567.