Automatic Semantic Content Extraction in Videos Using a Fuzzy Ontology and Rule-Based Model

2013-01-01
Yildirim, Yakup
Yazıcı, Adnan
Yilmaz, Turgay
Recent increase in the use of video-based applications has revealed the need for extracting the content in videos. Raw data and low-level features alone are not sufficient to fulfill the user's needs; that is, a deeper understanding of the content at the semantic level is required. Currently, manual techniques, which are inefficient, subjective and costly in time and limit the querying capabilities, are being used to bridge the gap between low-level representative features and high-level semantic content. Here, we propose a semantic content extraction system that allows the user to query and retrieve objects, events, and concepts that are extracted automatically. We introduce an ontology-based fuzzy video semantic content model that uses spatial/temporal relations in event and concept definitions. This metaontology definition provides a wide-domain applicable rule construction standard that allows the user to construct an ontology for a given domain. In addition to domain ontologies, we use additional rule definitions (without using ontology) to lower spatial relation computation cost and to be able to define some complex situations more effectively. The proposed framework has been fully implemented and tested on three different domains. We have obtained satisfactory precision and recall rates for object, event and concept extraction.
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING

Suggestions

Automatic semantic content extraction in videos using a spatio-temporal ontology model
Yıldırım, Yakup; Yazıcı, Adnan; Department of Computer Engineering (2009)
Recent increase in the use of video in many applications has revealed the need for extracting the content in videos. Raw data and low-level features alone are not sufficient to fulfill the user's need; that is, a deeper understanding of the content at the semantic level is required. Currently, manual techniques are being used to bridge the gap between low-level representative features and high-level semantic content, which are inefficient, subjective and costly in time and have limitations on querying capab...
An ontology-based multimedia information management system
Tarakçı, Hilal; Çiçekli, Fehime Nihan; Department of Computer Engineering (2008)
In order to manage the content of multimedia data, the content must be annotated. Although any user-defined annotation is acceptable, it is preferable if systems agree on the same annotation format. MPEG-7 is a widely accepted standard for multimedia content annotation. However, in MPEG-7, semantically identical metadata can be represented in multiple ways due to lack of precise semantics in its XML-based syntax. Unfortunately this prevents metadata interoperability. To overcome this problem, MPEG-7 standar...
Fusing semantic information extracted from visual, auditory and textual data of videos
Gönül, Elvan; Yazıcı, Adnan; Department of Computer Engineering (2012)
In recent years, due to the increasing usage of videos, manual information extraction is becoming insufficient to users. Therefore, extracting semantic information automatically turns out to be a serious requirement. Today, there exists some systems that extract semantic information automatically by using visual, auditory and textual data separately but the number of studies that uses more than one data source is very limited. As some studies on this topic have already shown, using multimodal video data for...
ENHANCED SPATIO-TEMPORAL VIDEO COPY DETECTION BY COMBINING TRAJECTORY AND SPATIAL CONSISTENCY
Ozkan, Savas; Esen, Ersin; Akar, Gözde (2014-10-30)
The recent improvements on internet technologies and video coding techniques cause an increase in copyright infringements especially for video. Frequently, image-based approaches appear as an essential solution due to the fact that joint usage of quantization-based indexing and weak geometric consistency stages give a capability to compare duplicate videos quickly. However, exploiting purely spatial content ignores the temporal variation of video. In this work, we propose a system that combines the state-of...
Automatic image annotation by ensemble of visual descriptors
Akbaş, Emre; Yarman Vural, Fatoş Tunay; Department of Computer Engineering (2006)
Automatic image annotation is the process of automatically producing words to de- scribe the content for a given image. It provides us with a natural means of semantic indexing for content based image retrieval. In this thesis, two novel automatic image annotation systems targeting dierent types of annotated data are proposed. The rst system, called Supervised Ensemble of Visual Descriptors (SEVD), is trained on a set of annotated images with predened class labels. Then, the system auto- matically annotates...
Citation Formats
Y. Yildirim, A. Yazıcı, and T. Yilmaz, “Automatic Semantic Content Extraction in Videos Using a Fuzzy Ontology and Rule-Based Model,” IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, pp. 47–61, 2013, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/46709.