Gibbs Model Based 3D Motion and Structure Estimation for Object-Based Video Coding Applications

1997-11-01
Alatan, Abdullah Aydın
Onural, Levent
Motion analysis is essential for any video coding scheme. A moving object in a 3D environment can be analyzed better by a 3D motion model instead of 2D models, and better modeling might lead to improved coding efficiency. Gibbs formulated joint segmentation and estimation of 2D motion not only improves the performance of each stage, but also generates robust point correspondences which are necessary for rigid 3D motion estimation algorithms. Estimated rigid 3D motion parameters of a segmented object are used to find the 3D structure of those objects by minimizing another Gibbs energy. Such an approach achieves error immunity compared to linear algorithms. A more general (non-rigid) motion model can also be proposed using Gibbs formulation which permits local elastic interactions in contrast to ultimately tight rigidity between object points. Experimental results are promising for both rigid and non-rigid 3D motion models and put these models forward as strong candidates to be used in object-based coding algorithms.

Suggestions

Streaming Multiscale Deep Equilibrium Models
Ertenli, Can Ufuk; Akbaş, Emre; Cinbiş, Ramazan Gökberk (2022-1-01)
We present StreamDEQ, a method that infers frame-wise representations on videos with minimal per-frame computation. In contrast to conventional methods where compute time grows at least linearly with the network depth, we aim to update the representations in a continuous manner. For this purpose, we leverage the recently emerging implicit layer models, which infer the representation of an image by solving a fixed-point problem. Our main insight is to leverage the slowly changing nature of videos and use the...
GIBBS RANDOM FIELD MODEL BASED 3-D MOTION ESTIMATION BY WEAKENED RIGIDITY
Alatan, Abdullah Aydın (1994-01-01)
3-D motion estimation from a video sequence remains a challenging problem. Modelling the local interactions between the 3-D motion parameters is possible by using Gibbs random fields. An energy function which gives the joint probability distribution of the motion vectors, is constructed. The most probable motion vector set is found by maximizing the probability, represented by this distribution. Since the 3-D motion estimation problem is ill-posed, the regularization is achieved by an initial rigidity assum...
Dynamic system modeling and state estimation for speech signal
Özbek, İbrahim Yücel; Demirekler, Mübeccel; Department of Electrical and Electronics Engineering (2010)
This thesis presents an all-inclusive framework on how the current formant tracking and audio (and/or visual)-to-articulatory inversion algorithms can be improved. The possible improvements are summarized as follows: The first part of the thesis investigates the problem of the formant frequency estimation when the number of formants to be estimated fixed or variable respectively. The fixed number of formant tracking method is based on the assumption that the number of formant frequencies is fixed along the ...
Comparison of whole scene image caption models
Görgülü, Tuğrul; Ulusoy, İlkay; Department of Electrical and Electronics Engineering (2021-2-10)
Image captioning is one of the most challenging processes in deep learning area which automatically describes the content of an image by using words and grammar. In recent years, studies are published constantly to improve the quality of this task. However, a detailed comparison of all possible approaches has not been done yet and we cannot know comparative performances of the proposed solutions in the literature. Thus, this thesis aims to redress this problem by making a comparative analysis among six diff...
Improvement of Transform-Skip Mode in Lossy Intra Coding with 3-Tap Filters
Alvar, Saeed Ranjbar; Kamışlı, Fatih (2016-08-05)
Using transforms in video coding is an effective method in reducing the spatial redundancy. However, for some cases applying transforms does not reduce the spatial redundancy. For these cases, transforms are skipped and the prediction error is directly quantized and then entropy coded. To further reduce the spatial redundancy in the transform skipped blocks, a pixel-by-pixel lossy intra prediction method based on a two dimensional correlation model is proposed in this paper. In the proposed method, three re...
Citation Formats
A. A. Alatan and L. Onural, Gibbs Model Based 3D Motion and Structure Estimation for Object-Based Video Coding Applications. 1997.