Feasible Local Content Representation For Image İn Video Search On Large Video Collection

2016-05-16
Özkan, Savaş
Esen, Ersin
Akar, Gözde
In this paper, we tackle content-based image-in-video search on large video archive. Particularly in this problem, the computation cost of a proposed method is another important parameter that should be considered alongside of the success rate achieved due to the high volume of data. With our proposed method in this paper, content representation of multimedia data is achieved quite fast compared to other local approaches. Additionally, approximately %16 improvement is introduced at the success rate with respect to the method that uses global approach for content representation on Stanford I2V dataset.
2016 24th Signal Processing and Communication Application Conference (SIU)

Suggestions

Multimodal video database modeling, querying and browsing
Durak, N; Yazıcı, Adnan (2005-01-01)
In this paper, a multimodal video indexing and retrieval system, MMVIRS, is presented. MMVIRS models the auditory, visual, and textual sources of video collections from a semantic perspective. Besides multimodality, our model is constituted on semantic hierarchies that enable us to access the video from different semantic levels. MMVIRS has been implemented with data annotation, querying and browsing parts. In the annotation part, metadata information and video semantics are extracted in hierarchical ways. ...
Optical flow based video frame segmentation and segment classification
Akpınar, Samet; Alpaslan, Ferda Nur; Department of Computer Engineering (2018)
Video information retrieval is a field of multimedia research enabling us to extract desired semantic information from video data. In content-based video information retrieval, visual content obtained from video scenes is utilized. For developing methods to cope with content-based video information retrieval in terms of temporal concepts such as action, event, etc., representation of temporal information becomes critical. In this thesis, action detection is tackled based on a temporal video representation m...
QUALITY EVALUATION OF STEREOSCOPIC VIDEOS USING DEPTH MAP SEGMENTATION
Sarikan, Selim S.; Olgun, Ramazan F.; Akar, Gözde (2011-09-09)
This paper presents a new quality evaluation model for stereoscopic videos using depth map segmentation. This study includes both objective and subjective evaluation. The goal of this study is to understand the effect of different depth levels on the overall 3D quality. Test sequences with different coding schemes are used. The results show that overall quality has a strong correlation with the quality of the background, where disparity is smaller relative to the foreground. The results also showed that con...
Flexible Content Extraction and Querying for Videos
Demir, Utku; KOYUNCU, Murat; Yazıcı, Adnan; Yilmaz, Turgay; SERT, MUSTAFA (2011-10-28)
In this study, a multimedia database system which includes a semantic content extractor, a high-dimensional index structure and an intelligent fuzzy object-oriented database component is proposed. The proposed system is realized by following a component-oriented approach. It supports different flexible query capabilities for the requirements of video users, which is the main focus of this paper. The query performance of the system (including automatic semantic content extraction) is tested and analyzed in t...
Online annotation of faces in personal videos by sequential learning
Yilmazturk, M. C.; Ulusoy, İlkay; Çiçekli, Fehime Nihan (2013-04-01)
This paper addresses semi-automatic annotation of faces in personal videos. Different from previous offline annotation systems, this paper studies online annotation of faces. During an annotation session, few annotations are requested from the user only for some part of the video online. These annotations are used to train a system that will perform annotation automatically for the rest of the video. The automatic annotation results are presented to the user during the same session and the user is allowed t...
Citation Formats
S. Özkan, E. Esen, and G. Akar, “Feasible Local Content Representation For Image İn Video Search On Large Video Collection,” presented at the 2016 24th Signal Processing and Communication Application Conference (SIU), Zonguldak, Turkey, 2016, Accessed: 00, 2021. [Online]. Available: https://hdl.handle.net/11511/79788.