Bi-directional 2-D mesh representation for video object rendering, editing and superresolution in the presence of occlusion

2003-05-01
Eren, Pekin Erhan
Tekalp, AM
In this paper, we propose a new bi-directional 2-D mesh representation of video objects, which utilizes forward and backward reference frames (keyframes). This framework extends the previous uni-directional mesh representation to enable efficient rendering, editing, and superresolution of video objects in the presence of occlusion by allowing bidirectional texture mapping as in MPEG B-frames. The video object of interest is tracked between two successive keyframes (which can be automatically or interactively selected) both in forward and backward directions. Keyframes provide the texture of the video object, whereas its motion is modeled by forward and backward 2-D meshes. In addition, we employ "validity maps", associated with each 2-D mesh, which allow selective texture mapping from the keyframes. Experimental results for efficient video object editing and object-based video resolution enhancement in the presence of self-occlusion are presented to demonstrate the effectiveness of the proposed representation. (C) 2002 Elsevier Science B.V. All rights reserved.
SIGNAL PROCESSING-IMAGE COMMUNICATION

Suggestions

Keyframe based bi directional 2 D mesh representation for video object tracking and manipulation
Eren, Pekin Erhan (1999-10-28)
We propose a new bi-directional 2-D mesh representation of video objects, which utilizes multiple keyframes with forward and backward tracking. Experimental results on use of this representation for video object tracking in the presence of self occlusion are presented.
Streaming Multiscale Deep Equilibrium Models
Ertenli, Can Ufuk; Akbaş, Emre; Cinbiş, Ramazan Gökberk (2022-1-01)
We present StreamDEQ, a method that infers frame-wise representations on videos with minimal per-frame computation. In contrast to conventional methods where compute time grows at least linearly with the network depth, we aim to update the representations in a continuous manner. For this purpose, we leverage the recently emerging implicit layer models, which infer the representation of an image by solving a fixed-point problem. Our main insight is to leverage the slowly changing nature of videos and use the...
VERTEX PARTITIONING BASED MULTIPLE DESCRIPTION CODING OF 3D DYNAMIC MESHES
Bici, M. Oguz; Stefanoski, Nikolce; Akar, Gözde (2009-05-06)
In this paper, we propose a Multiple Description Coding (MDC) method for reliable transmission of compressed time consistent 3D dynamic meshes. It trades off reconstruction quality for error resilience to provide the best expected reconstruction of 3D mesh sequence at the decoder side. The method is based on partitioning the mesh vertices into two sets and encoding each set independently by a 3D dynamic mesh coder. The encoded independent bitstreams or so-called descriptions are transmitted independently. T...
Multiple Description Coding of 3D Dynamic Meshes Based on Temporal Subsampling
Bici, M. Oguz; Akar, Gözde (2010-01-21)
In this paper, we propose a Multiple Description Coding (MDC) method for reliable transmission of compressed time consistent 3D dynamic meshes. It trades off reconstruction quality for error resilience to provide the best expected reconstruction of 3D mesh sequence at the decoder side. The method is based on partitioning the mesh frames into two sets by temporal subsampling and encoding each set independently by a 3D dynamic mesh coder. The encoded independent bitstreams or so-called descriptions are transm...
Recursive Prediction for Joint Spatial and Temporal Prediction in Video Coding
Kamışlı, Fatih (2014-06-01)
Video compression systems use prediction to reduce redundancies present in video sequences along the temporal and spatial dimensions. Standard video coding systems use either temporal or spatial prediction on a per block basis. If temporal prediction is used, spatial information is ignored. If spatial prediction is used, temporal information is ignored. This may be a computationally efficient approach, but it does not effectively combine temporal and spatial information. In this letter, we provide a framewo...
Citation Formats
P. E. Eren and A. Tekalp, “Bi-directional 2-D mesh representation for video object rendering, editing and superresolution in the presence of occlusion,” SIGNAL PROCESSING-IMAGE COMMUNICATION, vol. 18, no. 5, pp. 321–336, 2003, Accessed: 00, 2022. [Online]. Available: https://hdl.handle.net/11511/100181.