Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
Localization and recognition of text in digital media
Download
index.pdf
Date
2007
Author
Saracoğlu, Ahmet
Metadata
Show full item record
Item Usage Stats
209
views
100
downloads
Cite This
Textual information within digital media can be used in many areas such as, indexing and structuring of media databases, in the aid of visually impaired, translation of foreign signs and many more. This said, mainly text can be separated into two categories in digital media as, overlay-text and scene-text. In this thesis localization and recognition of video text regardless of its category in digital media is investigated. As a necessary first step, framework of a complete system is discussed. Next, a comparative analysis of feature vector and classification method pairs is presented. Furthermore, multi-part nature of text is exploited by proposing a novel Markov Random Field approach for the classification of text/non-text regions. Additionally, better localization of text is achieved by introducing bounding-box extraction method. And for the recognition of text regions, a handprint based Optical Character Recognition system is thoroughly investigated. During the investigation of text recognition, multi-hypothesis approach for the segmentation of background is proposed by incorporating k-Means clustering. Furthermore, a novel dictionary-based ranking mechanism is proposed for recognition spelling correction. And overall system is simulated on a challenging data set. Also, a through survey on scene-text localization and recognition is presented. Furthermore, challenges are identified and discussed by providing related work on them. Scene-text localization simulations on a public competition data set are also provided. Lastly, in order to improve recognition performance of scene-text on signs that are affected from perspective projection distortion, a rectification method is proposed and simulated.
Subject Keywords
Electrical Engineering.
,
Electronics.
,
Nuclear Engineering.
URI
http://etd.lib.metu.edu.tr/upload/2/12609028/index.pdf
https://hdl.handle.net/11511/17419
Collections
Graduate School of Natural and Applied Sciences, Thesis
Suggestions
OpenMETU
Core
Localization in underwater acoustic sensor networks
Işık, Mehmet Talha; Akan, Özgür Barış; Department of Electrical and Electronics Engineering (2007)
Underwater Acoustic Sensor Networks (UW-ASNs) have the potential to enable many applications such as environmental monitoring, undersea exploration and distributed tactical surveillance. In order to realize the potential gains of these applications, it is essential that the sensor nodes can be accurately located in a three dimensional underwater sensor network topology. Although many localization protocols have been proposed recently for terrestrial sensor networks, the unique characteristics of the underwa...
Causal inference in graph text constellations Designing verbally annotated graphs
CHROSTOPHER, HABEL; Acartürk, Cengiz (2011-01-01)
Multimodal documents combining language and graphs are wide-spread in print media as well as in electronic media. One of the most important tasks to be solved in comprehending graph-text combinations is construction of causal chains among the meaning entities provided by modalities. In this study we focus on the role of annotation position and shape of graph lines in simple line graphs on causal attributions concerning the event presented by the annotation and the processes (i.e. increases and decreases) an...
Implement of three segmentation algorithms for CT images of torso
Öz, Sinan; Serinağaoğlu Doğrusöz, Yeşim; Department of Electrical and Electronics Engineering (2011)
Many practical applications in the field of medical image processing require valid and reliable segmentation of images. In this dissertation, we propose three different semi-automatic segmentation frameworks for 2D-upper torso medical images to construct 3D geometric model of the torso structures. In the first framework, an extended version of the Otsu’s method for three level thresholding and a recursive connected component algorithm are combined. The segmentation process is accomplished by first using Ext...
S-band hybrid 4 bit phase shifter using cots components
Erkek, Eser; Demir, Şimşek; Department of Electrical and Electronics Engineering (2009)
Microwave and millimeter-wave phase shifters are one of the most important structures of the antenna series that are used in communication and radar applications. They are used to form the main beam of the electronically scanned phase array antennas and generate the appropriate phase values for the antenna elements design while providing electronic beam steering. In this thesis, S-band hybrid 4 bit phase shifter of 22.5º phase resolution is designed, simulated, fabricated and measured. Bits are separately d...
Dense depth map estimation for object segmentation in multi-view video
Çığla, Cevahir; Alatan, Abdullah Aydın; Department of Electrical and Electronics Engineering (2007)
In this thesis, novel approaches for dense depth field estimation and object segmentation from mono, stereo and multiple views are presented. In the first stage, a novel graph-theoretic color segmentation algorithm is proposed, in which the popular Normalized Cuts 59H[6] segmentation algorithm is improved with some modifications on its graph structure. Segmentation is obtained by the recursive partitioning of the weighted graph. The simulation results for the comparison of the proposed segmentation scheme w...
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
A. Saracoğlu, “Localization and recognition of text in digital media,” M.S. - Master of Science, Middle East Technical University, 2007.