A semi-automatic text-based semantic video annotation system for Turkish facilitating multilingual retrieval

Kucuk, Dilek
Yazıcı, Adnan
It is commonly acknowledged that ever-increasing video archives should be conveniently indexed with the conveyed semantic information to facilitate later video retrieval. Domain-independent semantic video indexing is usually carried out through manual means which is too time-consuming and labor-intensive to be employed in practical settings. On the other hand, fully automated approaches are usually proposed for very specialized domains such as team sports videos. In this paper, we propose a generic text-based semi-automatic system for off-line semantic indexing and retrieval of news videos, since video texts such as speech transcripts stand as a plausible source of semantic information. The proposed system has a pipelined flow of execution where the sole manual intervention takes place during text extraction, yet it could execute in fully automated mode in case the associated video text is already available or a convenient text extractor is available to be incorporated into the system. At the core of the system is an information extraction component - a named entity recognizer - which extracts representative semantic information from the video texts. Based on the proposed generic system, a novel semantic annotation and retrieval system for Turkish is designed, implemented, and evaluated on two distinct news video data sets. By equipping it with the necessary components, the ultimate system is also turned into a multilingual video retrieval system and executed on a video data set in English, thereby facilitating multilingual semantic video retrieval.


A Text-Based Fully Automated Architecture for the Semantic Annotation and Retrieval of Turkish News Videos
Kucuk, Dilek; Yazıcı, Adnan (2010-07-23)
Video texts are known to constitute an important source of information for semantic summaries of video archives. In this study, we propose a fully automated architecture for semantic annotation and later retrieval of Turkish news videos based on the corresponding video texts. At the core of the architecture is a named entity recognizer, the output of which on video texts is used as semantic annotations for the corresponding videos. The architecture also comprises components for news story segmentation, slid...
An antology - driven video annotation and retrieval system
Demirdizen, Goncagül; Çiçekli, Fehime Nihan; Department of Computer Engineering (2010)
In this thesis, a system, called Ontology-Driven Video Annotation and Retrieval System (OntoVARS) is developed in order to provide a video management system which is used for ontology-driven semantic content annotation and querying. The proposed system is based on MPEG-7 ontology which provides interoperability and common communication platform with other MPEG-7 ontology compatible systems. The Rhizomik MPEG-7 ontology is used as the core ontology and domain specific ontologies are integrated to the core on...
An ontology-based multimedia information management system
Tarakçı, Hilal; Çiçekli, Fehime Nihan; Department of Computer Engineering (2008)
In order to manage the content of multimedia data, the content must be annotated. Although any user-defined annotation is acceptable, it is preferable if systems agree on the same annotation format. MPEG-7 is a widely accepted standard for multimedia content annotation. However, in MPEG-7, semantically identical metadata can be represented in multiple ways due to lack of precise semantics in its XML-based syntax. Unfortunately this prevents metadata interoperability. To overcome this problem, MPEG-7 standar...
Semi-automatic semantic video annotation tool
Aydınlılar, Merve; Yazıcı, Adnan; Department of Computer Engineering (2011)
Semantic annotation of video content is necessary for indexing and retrieval tasks of video management systems. Currently, it is not possible to extract all high-level semantic information from video data automatically. Video annotation tools assist users to generate annotations to represent video data. Generated annotations can also be used for testing and evaluation of content based retrieval systems. In this study, a semi-automatic semantic video annotation tool is presented. Generated annotations are in...
A hybrid named entity recognizer for Turkish
Kucuk, Dilek; Yazıcı, Adnan (2012-02-15)
Named entity recognition is an important subfield of the broader research area of information extraction from textual data. Yet, named entity recognition research conducted on Turkish texts is still rare as compared to related research carried out on other languages such as English, Spanish, Chinese, and Japanese. In this study, we present a hybrid named entity recognizer for Turkish, which is based on a manually engineered rule based recognizer that we have proposed. Since rule based systems for specific d...
Citation Formats
D. Kucuk and A. Yazıcı, “A semi-automatic text-based semantic video annotation system for Turkish facilitating multilingual retrieval,” EXPERT SYSTEMS WITH APPLICATIONS, pp. 3398–3411, 2013, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/47200.