Automatic video text localization and recognition

2006-01-01
Saracoglu, Ahmet
Alatan, Abdullah Aydın
For the indexing and management of large scale video databases an important tool would be the text in the digital media. In this work, the localization performances of the overlay texts using different feature extraction methods with different classifiers are analyzed. Besides that in order to improve the text recognition rate by using multiple hipothesis obtained from multilevel segmentation and using statistical language model are investigated.

Suggestions

High efficiency combined - cycle gas polygenerator for ecological local generation (HEGEL)
Pınarcıoğlu, Mehmet Melih(2009-4-30)
Objective is to develop, demonstrate and assess an innovative, high efficiency concept of micro-cogeneration system applied to a real demand site under real operating conditions. The application concept is based on a combined cycle architecture (Combi system) constituted by two integrated cogenerators powered by different prime movers: an innovative reciprocating engine cogenerator and a Rankine engine system (bottoming cycle) operated on the exhaust gases of the reciprocating engine. The location will b...
An agent-based optimization framework for mobile-cloud computing
Angın, Pelin (2013-01-01)
The proliferation of cloud computing resources in the recent years offers a way for mobile devices with limited resources to achieve computationally intensive tasks in real-time. The mobile-cloud computing paradigm, which involves collaboration between mobile and cloud resources, is expected to become increasingly popular in mobile application development. Dynamic partitioning of applications between mobile and cloud platforms based on resource availability is crucial in achieving the best performance for a...
Image classification for content based indexing
Taner, Serdar; Severcan, Mete; Department of Electrical and Electronics Engineering (2003)
As the size of image databases increases in time, the need for content based image indexing and retrieval become important. Image classification is a key to content based image indexing. In this thesis supervised learning with feed forward back propagation artificial neural networks is used for image classification. Low level features derived from the images are used to classify the images to interpret the high level features that yield semantics. Features are derived using detail histogram correlations obt...
Fusion of multimodal information for multimedia information retrieval
Yılmaz, Turgay; Yazıcı, Adnan; Department of Computer Engineering (2014)
An effective retrieval of multimedia data is based on its semantic content. In order to extract the semantic content, the nature of multimedia data should be analyzed carefully and the information contained should be used completely. Multimedia data usually has a complex structure containing multimodal information. Noise in the data, non-universality of any single modality, and performance upper bound of each modality make it hard to rely on a single modality. Thus, multimodal fusion is a practical approach...
Optimization of an online course with web usage mining
Akman, LE; Akkan, B; Baykal, Nazife (2004-02-18)
The huge amount of information existing in the World Wide Web constitutes an ideal environment to implement data mining techniques. Web mining is the mining of web data. There are different applications of web mining: web content mining, web structure mining and web usage mining. In our study we analyzed an online course by web usage mining techniques in order to optimize the navigation paths, the duration of the time spend on each page and the number of visits throughout the semester of the course. Moreove...
Citation Formats
A. Saracoglu and A. A. Alatan, “Automatic video text localization and recognition,” 2006, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/36009.