Efficient MRF Energy Propagation for Video Segmentation via Bilateral Filters

Download
2014-08-01
Sener, Ozan
Ugur, Kemal
Alatan, Abdullah Aydın
Segmentation of an object from a video is a challenging task in multimedia applications. Depending on the application, automatic or interactive methods are desired; however, regardless of the application type, efficient computation of video object segmentation is crucial for time-critical applications; specifically, mobile and interactive applications require near real-time efficiencies. In this paper, we address the problem of video segmentation from the perspective of efficiency. We initially redefine the problem of video object segmentation as the propagation of MRF energies along the temporal domain. For this purpose, a novel and efficient method is proposed to propagate MRF energies throughout the frames via bilateral filters without using any global texture, color or shape model. Recently presented bi-exponential filter is utilized for efficiency, whereas a novel technique is also developed to dynamically solve graph-cuts for varying, non-lattice graphs in general linear filtering scenario. These improvements are experimented for both automatic and interactive video segmentation scenarios. Moreover, in addition to the efficiency, segmentation quality is also tested both quantitatively and qualitatively. Indeed, for some challenging examples, significant time efficiency is observed without loss of segmentation quality.
IEEE TRANSACTIONS ON MULTIMEDIA

Suggestions

Extended Target Tracking Using Polynomials With Applications to Road-Map Estimation
Lundquist, Christian; Orguner, Umut; Gustafsson, Fredrik (Institute of Electrical and Electronics Engineers (IEEE), 2011-01-01)
This paper presents an extended target tracking framework which uses polynomials in order to model extended objects in the scene of interest from imagery sensor data. State-space models are proposed for the extended objects which enables the use of Kalman filters in tracking. Different methodologies of designing measurement equations are investigated. A general target tracking algorithm that utilizes a specific data association method for the extended targets is presented. The overall algorithm must always ...
Using multi-modal 3D contours and their relations for vision and robotics
BAŞESKİ, Emre; Pugeault, Nicolas; Kalkan, Sinan; BODENHAGEN, Leon; Piater, Justus H.; KRÜGER, Norbert (Elsevier BV, 2010-11-01)
In this work, we make use of 3D contours and relations between them (namely, coplanarity, cocolority, distance and angle) for four different applications in the area of computer vision and vision-based robotics. Our multi-modal contour representation covers both geometric and appearance information. We show the potential of reasoning with global entities in the context of visual scene analysis for driver assistance, depth prediction, robotic grasping and grasp learning. We argue that, such 3D global reasoni...
Efficient detection and tracking of salient regions for visual processing on mobile platforms
Serhat, Gülhan; Saranlı, Afşar; Department of Electrical and Electronics Engineering (2009)
Visual Attention is an interesting concept that constantly widens its application areas in the field of image processing and computer vision. The main idea of visual attention is to find the locations on the image that are visually attractive. In this thesis, the visually attractive regions are extracted and tracked in video sequences coming from the vision systems of mobile platforms. First, the salient regions are extracted in each frame and a feature vector is constructed for each one. Then Scale Invaria...
Fast intra/inter mode decision for a real-time H.264 streaming system
Alay, Özgü; Akar, Gözde; Department of Electrical and Electronics Engineering (2006)
Video compression is a key technology used in several multimedia applications. Improvements in the compression techniques together with the increasing speed and optimized architecture of the new family processors enable us to use this technology more in real time systems. H.264 (also known as MPEG-4 Part 10 or AVC - Advanced Video Coding), is the latest video coding standard which is noted for achieving very high data compression. While H.264 is superior to its predecessors, it has a very high computational...
Implementation of a distributed video codec
Işık, Cem Vedat; Akar, Gözde; Department of Electrical and Electronics Engineering (2008)
Current interframe video compression standards such as the MPEG4 and H.264, require a high-complexity encoder for predictive coding to exploit the similarities among successive video frames. This requirement is acceptable for cases where the video sequence to be transmitted is encoded once and decoded many times. However, some emerging applications such as video-based sensor networks, power-aware surveillance and mobile video communication systems require computational complexity to be shifted from encoder ...
Citation Formats
O. Sener, K. Ugur, and A. A. Alatan, “Efficient MRF Energy Propagation for Video Segmentation via Bilateral Filters,” IEEE TRANSACTIONS ON MULTIMEDIA, pp. 1292–1302, 2014, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/42877.