Shape descriptors based on intersection consistency and global binary patterns

Download
2012
Sivri, Erdal
Shape description is an important problem in computer vision because most vision tasks that require comparing or matching visual entities rely on shape descriptors. In this thesis, two novel shape descriptors are proposed, namely Intersection Consistency Histogram (ICH) and Global Binary Patterns (GBP). The former is based on a local regularity measure called Intersection Consistency (IC), which determines whether edge pixels in an image patch point towards the center or not. The second method, called Global Binary Patterns, represents the shape in binary along horizontal, vertical, diagonal or principal directions. These two methods are extensively analyzed on several databases, and retrieval and running time performances are presented. Moreover, these methods are compared with methods such as Shape Context, Histograms of Oriented Gradients, Local Binary Patterns and Fourier Descriptors. We report that our descriptors perform comparable to these methods.

Suggestions

Data-driven image captioning via salient region discovery
Kilickaya, Mert; Akkuş, Burak Kerim; Çakıcı, Ruket; Erdem, Aykut; Erdem, Erkut; İKİZLER CİNBİŞ, NAZLI (Institution of Engineering and Technology (IET), 2017-09-01)
n the past few years, automatically generating descriptions for images has attracted a lot of attention in computer vision and natural language processing research. Among the existing approaches, data-driven methods have been proven to be highly effective. These methods compare the given image against a large set of training images to determine a set of relevant images, then generate a description using the associated captions. In this study, the authors propose to integrate an object-based semantic image r...
Continuous dimensionality characterization of image structures
Felsberg, Michael; Kalkan, Sinan; Kruger, Norbert (Elsevier BV, 2009-05-04)
Intrinsic dimensionality is a concept introduced by statistics and later used in image processing to measure the dimensionality of a data set. In this paper, we introduce a continuous representation of the intrinsic dimension of an image patch in terms of its local spectrum or, equivalently, its gradient field. By making use of a cone structure and barycentric co-ordinates, we can associate three confidences to the three different ideal cases of intrinsic dimensions corresponding to homogeneous image patche...
Extraction of shape skeletons from grayscale images
Tarı, Zehra Sibel; Pien, H (Elsevier BV, 1997-05-01)
Shape skeletons have been used in computer vision to represent shapes and discover their salient features. Earlier attempts were based on morphological approach in which a shape is eroded successively and uniformly until it is reduced to its skeleton. The main difficulty with this approach is its sensitivity to noise and several approaches have been proposed for dealing with this problem. In this paper, we propose a new method based on diffusion to smooth out the noise and extract shape skeletons in a robus...
Visual object detection and tracking using local convolutional context features and recurrent neural networks
Kaya, Emre Can; Alatan, Abdullah Aydın; Department of Electrical and Electronics Engineering (2018)
Visual object detection and tracking are two major problems in computer vision which have important real-life application areas. During the last decade, Convolutional Neural Networks (CNNs) have received significant attention and outperformed methods that rely on handcrafted representations in both detection and tracking. On the other hand, Recurrent Neural Networks (RNNs) are commonly preferred for modeling sequential data such as video sequences. A novel convolutional context feature extension is introduc...
Phase-space window and degrees of freedom of optical systems with multiple apertures
Ozaktas, Haldun M.; Öktem, Sevinç Figen (The Optical Society, 2013-04-01)
We show how to explicitly determine the space-frequency window (phase-space window) for optical systems consisting of an arbitrary sequence of lenses and apertures separated by arbitrary lengths of free space. If the space-frequency support of a signal lies completely within this window, the signal passes without information loss. When it does not, the parts that lie within the window pass and the parts that lie outside of the window are blocked, a result that is valid to a good degree of approximation for ...
Citation Formats
E. Sivri, “Shape descriptors based on intersection consistency and global binary patterns,” M.S. - Master of Science, Middle East Technical University, 2012.