A Text-Based Fully Automated Architecture for the Semantic Annotation and Retrieval of Turkish News Videos

Kucuk, Dilek
Yazıcı, Adnan
Video texts are known to constitute an important source of information for semantic summaries of video archives. In this study, we propose a fully automated architecture for semantic annotation and later retrieval of Turkish news videos based on the corresponding video texts. At the core of the architecture is a named entity recognizer, the output of which on video texts is used as semantic annotations for the corresponding videos. The architecture also comprises components for news story segmentation, sliding text recognition, and video retrieval in addition to a news video database. The news story segmentation module makes use of the audio waveforms of the raw video files to detect the boundaries of individual news stories. The sliding text recognizer is then executed on the video segments corresponding to these news stories to extract their texts. The texts are then fed into the named entity recognizer for Turkish news texts to extract the named entities which are to be used as semantic annotations or index terms for the retrieval of these news videos. Finally, the retrieval interface of the overall architecture enables access to the annotated videos and video segments through boolean queries formed by using the previously extracted named entities. This study is significant for its proposing the first fully automated architecture for the semantic annotation and retrieval of Turkish news video archives.
2010 IEEE World Congress on Computational Intelligence


A semi-automatic text-based semantic video annotation system for Turkish facilitating multilingual retrieval
Kucuk, Dilek; Yazıcı, Adnan (2013-07-01)
It is commonly acknowledged that ever-increasing video archives should be conveniently indexed with the conveyed semantic information to facilitate later video retrieval. Domain-independent semantic video indexing is usually carried out through manual means which is too time-consuming and labor-intensive to be employed in practical settings. On the other hand, fully automated approaches are usually proposed for very specialized domains such as team sports videos. In this paper, we propose a generic text-bas...
Employing Named Entities for Semantic Retrieval of News Videos in Turkish
Kucuk, Dilek; Yazıcı, Adnan (2009-09-16)
Named entities are known to be important means for semantic annotation of news texts. Considerable work has been carried out for semantic indexing of both textual news and news videos especially in English through the employment of named entities extracted from textual news or transcriptions of the news videos. In this paper, we present our semantic retrieval architecture for news videos in Turkish based on prior semantic annotation of the videos with the corresponding named entities in the news transcripti...
A TV Content Augmentation System Exploiting Rule Based Named Entity Recognition Method
Isiklar, Yunus Emre; Cicekli, Nihan (2015-09-24)
This paper presents a TV content augmentation system that enhances the contents of TVprograms by retrieving context related data and presenting them to the viewers without the necessity of another device. The paper presents both the conceptual description of the system and a prototype implementation. The implementation utilizes program descriptions crawled from web resources in order to extract named entities such as person names, locations, organizations, etc. For this purpose, a rule based Named Entity Re...
Utilization of texture, contrast and color homogeneity for detecting and recognizing text from video frames
Tekinalp, S; Alatan, Abdullah Aydın (2003-09-17)
It is possible to index and manage large video archives in a more efficient manner by detecting and recognizing text within video frames. There are some inherent properties of videotext, such as distinguishing texture, higher contrast against background, and uniform color, making it detectable. By employing these properties, it is possible to detect text regions and binarize the image for character recognition. In this paper, a complete framework for detection and recognition of videotext is presented. The ...
Exploiting information extraction techniques for automatic semantic annotation and retrieval of news videos in Turkish
Küçük, Dilek; Yazıcı, Adnan; Department of Computer Engineering (2011)
Information extraction (IE) is known to be an effective technique for automatic semantic indexing of news texts. In this study, we propose a text-based fully automated system for the semantic annotation and retrieval of news videos in Turkish which exploits several IE techniques on the video texts. The IE techniques employed by the system include named entity recognition, automatic hyperlinking, person entity extraction with coreference resolution, and event extraction. The system utilizes the outputs of th...
Citation Formats
D. Kucuk and A. Yazıcı, “A Text-Based Fully Automated Architecture for the Semantic Annotation and Retrieval of Turkish News Videos,” presented at the 2010 IEEE World Congress on Computational Intelligence, Barcelona, Spain, 2010, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/54871.