Intra prediction with 3-tap filters for lossless and lossy video coding

Ranjbar Alvar, Saeed
Video coders are primarily designed for lossy compression. The basic steps in modern lossy video compression are block-based spatial or temporal prediction, transformation of the prediction error block, quantization of the transform coefficients and entropy coding of the quantized coefficients together with other side information. In some cases, this lossy coding architecture may not be efficient for compression. For example, when lossless video compression is desirable, the transform and quantization steps are skipped. Or in lossy compression of synthetic video content (such as animations), the transform may be skipped for some of the blocks and the prediction error is quantized and entropy coded in those blocks. In these cases, the block-based spatial prediction (called intra prediction) cannot sufficiently decorrelate the pixels by itself and large prediction errors become more frequent. For the cases where the transform is skipped, the block-based prediction can be replaced with a more accurate pixel-by-pixel prediction since the original/reconstructed neighboring pixels inside the block will be readily available due to the lack of transform. This thesis explores pixel-by-pixel prediction methods based on 3-tap filtering which use three neighboring pixels for prediction according to a two-dimensional correlation model. Two of the proposed methods are designed for lossless intra coding, one with offline determined prediction weights and the other with online determined adaptive weights. The third proposed method uses the 3-tap filtering method for the transform skipped blocks in lossy intra coding. The proposed methods are implemented within the HEVC reference software and the experimental results indicate that the pixel-by-pixel spatial prediction method based on 3-tap filtering can improve the compression efficiency for both lossless and lossy coding. 


Lossless Image and Intra-Frame Compression With Integer-to-Integer DST
Kamışlı, Fatih (2019-02-01)
Video coding standards are primarily designed for efficient lossy compression, but it is also desirable to support efficient lossless compression within video coding standards using small modifications to the lossy coding architecture. A simple approach is to skip transform and quantization, and simply entropy code the prediction residual. However, this approach is inefficient at compression. A more efficient and popular approach is to skip transform and quantization but also process the residual block in s...
Visibility grid method for efficient crowd rendering wirh shadows
Koçdemir, Şahin Serdar; İşler, Veysi; Department of Modeling and Simulation (2012)
Virtual crowd rendering have been used in film industry with offine rendering methods for a long time. But its existence in interactive real-time applications such as video games is not so common due to the limited rendering power of current graphics hardware. This thesis describes a novel method to improve shadow mapping performance of a crowded scene by taking into account the screen space visibility of the casted shadow of a crowd instance when rendering the shadow maps. A grid-based visibility mask crea...
Joint utilization of fixed and variable-length codes for improving synchronization immunity for image transmission
Alatan, Abdullah Aydın (1998-01-01)
Robust transmission of images is achieved by using fixed and variable-length coding together without much loss in compression efficiency. The probability distribution function of a DCT coefficient can be divided into two regions using a threshold; so that one portion contains roughly equiprobable transform coefficients. While fixed-length coding, which is a powerful solution to the synchronization problem, is used in this inner equiprobable region without sacrificing compression, the outer (saturating) regi...
End-to-end learned image compression with normalizing flows for latent space enhancement
Yavuz, Fatih; Kamışlı, Fatih; Department of Electrical and Electronics Engineering (2022-9)
Learning based methods for image compression recently received considerable attention and demonstrated promising performance, surpassing many commonly used codecs. Architectures of learning based methodologies are typically comprised of a nonlinear analysis transform, which maps the input image to a latent representation, a synthesis transform that maps the quantized latent representation back to the image domain and a model for the probability distribution of the latent representation. Successful modelling...
Video segmentation using partially decoded MPEG bitstream
Kayaalp, Işıl Burcun; Akar, Gözde; Department of Electrical and Electronics Engineering (2003)
In this thesis, a mixed type video segmentation algorithm is implemented to find the scene cuts in MPEG compressed video data. The main aim is to have a computationally efficient algorithm for real time applications. Due to this reason partial decoding of the bitstream is used in segmentation. As a result of partial decoding, features such as bitrate, motion vector type, and DC images are implemented to find both continuous and discontinuous scene cuts on a MPEG-2 coded general TV broadcast data. The result...
Citation Formats
S. Ranjbar Alvar, “Intra prediction with 3-tap filters for lossless and lossy video coding,” M.S. - Master of Science, Middle East Technical University, 2016.