A Text-Based Fully Automated Architecture for the Semantic Annotation and Retrieval of Turkish News Videos

2010-07-23
Kucuk, Dilek
Yazıcı, Adnan
Video texts are known to constitute an important source of information for semantic summaries of video archives. In this study, we propose a fully automated architecture for semantic annotation and later retrieval of Turkish news videos based on the corresponding video texts. At the core of the architecture is a named entity recognizer, the output of which on video texts is used as semantic annotations for the corresponding videos. The architecture also comprises components for news story segmentation, sliding text recognition, and video retrieval in addition to a news video database. The news story segmentation module makes use of the audio waveforms of the raw video files to detect the boundaries of individual news stories. The sliding text recognizer is then executed on the video segments corresponding to these news stories to extract their texts. The texts are then fed into the named entity recognizer for Turkish news texts to extract the named entities which are to be used as semantic annotations or index terms for the retrieval of these news videos. Finally, the retrieval interface of the overall architecture enables access to the annotated videos and video segments through boolean queries formed by using the previously extracted named entities. This study is significant for its proposing the first fully automated architecture for the semantic annotation and retrieval of Turkish news video archives.
2010 IEEE World Congress on Computational Intelligence

Suggestions

A semi-automatic text-based semantic video annotation system for Turkish facilitating multilingual retrieval
Kucuk, Dilek; Yazıcı, Adnan (2013-07-01)
It is commonly acknowledged that ever-increasing video archives should be conveniently indexed with the conveyed semantic information to facilitate later video retrieval. Domain-independent semantic video indexing is usually carried out through manual means which is too time-consuming and labor-intensive to be employed in practical settings. On the other hand, fully automated approaches are usually proposed for very specialized domains such as team sports videos. In this paper, we propose a generic text-bas...
Employing Named Entities for Semantic Retrieval of News Videos in Turkish
Kucuk, Dilek; Yazıcı, Adnan (2009-09-16)
Named entities are known to be important means for semantic annotation of news texts. Considerable work has been carried out for semantic indexing of both textual news and news videos especially in English through the employment of named entities extracted from textual news or transcriptions of the news videos. In this paper, we present our semantic retrieval architecture for news videos in Turkish based on prior semantic annotation of the videos with the corresponding named entities in the news transcripti...
Efficient Multimedia Information Retrieval with Query Level Fusion
Sattari, Saeid; Yazıcı, Adnan (2015-10-28)
Multimedia data particularly digital videos that contain various modalities (visual, audio, and text) are complex and time consuming to deal with. Therefore, managing a large volume of multimedia data reveals the necessity for efficient methods for modeling, processing, storing and retrieving such data. In this study, we investigate how to efficiently manage multimedia data, especially video data. In addition, we discuss various flexible query types including the combination of content as well as concept-ba...
A TV Content Augmentation System Exploiting Rule Based Named Entity Recognition Method
Isiklar, Yunus Emre; Cicekli, Nihan (2015-09-24)
This paper presents a TV content augmentation system that enhances the contents of TVprograms by retrieving context related data and presenting them to the viewers without the necessity of another device. The paper presents both the conceptual description of the system and a prototype implementation. The implementation utilizes program descriptions crawled from web resources in order to extract named entities such as person names, locations, organizations, etc. For this purpose, a rule based Named Entity Re...
Utilization of texture, contrast and color homogeneity for detecting and recognizing text from video frames
Tekinalp, S; Alatan, Abdullah Aydın (2003-09-17)
It is possible to index and manage large video archives in a more efficient manner by detecting and recognizing text within video frames. There are some inherent properties of videotext, such as distinguishing texture, higher contrast against background, and uniform color, making it detectable. By employing these properties, it is possible to detect text regions and binarize the image for character recognition. In this paper, a complete framework for detection and recognition of videotext is presented. The ...
Citation Formats
D. Kucuk and A. Yazıcı, “A Text-Based Fully Automated Architecture for the Semantic Annotation and Retrieval of Turkish News Videos,” presented at the 2010 IEEE World Congress on Computational Intelligence, Barcelona, Spain, 2010, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/54871.