Localization and recognition of text in digital media

Download
2007
Saracoğlu, Ahmet
Textual information within digital media can be used in many areas such as, indexing and structuring of media databases, in the aid of visually impaired, translation of foreign signs and many more. This said, mainly text can be separated into two categories in digital media as, overlay-text and scene-text. In this thesis localization and recognition of video text regardless of its category in digital media is investigated. As a necessary first step, framework of a complete system is discussed. Next, a comparative analysis of feature vector and classification method pairs is presented. Furthermore, multi-part nature of text is exploited by proposing a novel Markov Random Field approach for the classification of text/non-text regions. Additionally, better localization of text is achieved by introducing bounding-box extraction method. And for the recognition of text regions, a handprint based Optical Character Recognition system is thoroughly investigated. During the investigation of text recognition, multi-hypothesis approach for the segmentation of background is proposed by incorporating k-Means clustering. Furthermore, a novel dictionary-based ranking mechanism is proposed for recognition spelling correction. And overall system is simulated on a challenging data set. Also, a through survey on scene-text localization and recognition is presented. Furthermore, challenges are identified and discussed by providing related work on them. Scene-text localization simulations on a public competition data set are also provided. Lastly, in order to improve recognition performance of scene-text on signs that are affected from perspective projection distortion, a rectification method is proposed and simulated.

Suggestions

Localization in underwater acoustic sensor networks
Işık, Mehmet Talha; Akan, Özgür Barış; Department of Electrical and Electronics Engineering (2007)
Underwater Acoustic Sensor Networks (UW-ASNs) have the potential to enable many applications such as environmental monitoring, undersea exploration and distributed tactical surveillance. In order to realize the potential gains of these applications, it is essential that the sensor nodes can be accurately located in a three dimensional underwater sensor network topology. Although many localization protocols have been proposed recently for terrestrial sensor networks, the unique characteristics of the underwa...
Causal inference in graph text constellations Designing verbally annotated graphs
CHROSTOPHER, HABEL; Acartürk, Cengiz (2011-01-01)
Multimodal documents combining language and graphs are wide-spread in print media as well as in electronic media. One of the most important tasks to be solved in comprehending graph-text combinations is construction of causal chains among the meaning entities provided by modalities. In this study we focus on the role of annotation position and shape of graph lines in simple line graphs on causal attributions concerning the event presented by the annotation and the processes (i.e. increases and decreases) an...
Implement of three segmentation algorithms for CT images of torso
Öz, Sinan; Serinağaoğlu Doğrusöz, Yeşim; Department of Electrical and Electronics Engineering (2011)
Many practical applications in the field of medical image processing require valid and reliable segmentation of images. In this dissertation, we propose three different semi-automatic segmentation frameworks for 2D-upper torso medical images to construct 3D geometric model of the torso structures. In the first framework, an extended version of the Otsu’s method for three level thresholding and a recursive connected component algorithm are combined. The segmentation process is accomplished by first using Ext...
S-band hybrid 4 bit phase shifter using cots components
Erkek, Eser; Demir, Şimşek; Department of Electrical and Electronics Engineering (2009)
Microwave and millimeter-wave phase shifters are one of the most important structures of the antenna series that are used in communication and radar applications. They are used to form the main beam of the electronically scanned phase array antennas and generate the appropriate phase values for the antenna elements design while providing electronic beam steering. In this thesis, S-band hybrid 4 bit phase shifter of 22.5º phase resolution is designed, simulated, fabricated and measured. Bits are separately d...
Dense depth map estimation for object segmentation in multi-view video
Çığla, Cevahir; Alatan, Abdullah Aydın; Department of Electrical and Electronics Engineering (2007)
In this thesis, novel approaches for dense depth field estimation and object segmentation from mono, stereo and multiple views are presented. In the first stage, a novel graph-theoretic color segmentation algorithm is proposed, in which the popular Normalized Cuts 59H[6] segmentation algorithm is improved with some modifications on its graph structure. Segmentation is obtained by the recursive partitioning of the weighted graph. The simulation results for the comparison of the proposed segmentation scheme w...
Citation Formats
A. Saracoğlu, “Localization and recognition of text in digital media,” M.S. - Master of Science, Middle East Technical University, 2007.