Evaluation of Feature Channels for Correlation Filter Based Visual Object Tracking in Infrared Spectrum

2016-07-01
Gündoğdu, Erhan
SOLMAZ, Berkan
Koç, AYKUT
HAMMOUD, RI
Alatan, Abdullah Aydın
Correlation filters for visual object tracking in visible imagery has been well-studied. Most of the correlation-filter-based methods use either raw image intensities or feature maps of gradient orientations or color channels. However, well-known features designed for visible spectrum may not be ideal for infrared object tracking, since infrared and visible spectra have dissimilar characteristics in general. We assess the performance of two state-of-the-art correlation-filter-based object tracking methods on Link "oping Thermal InfraRed (LTIR) dataset of medium wave and longwave infrared videos, using deep convolutional neural networks (CNN) features as well as other traditional hand-crafted descriptors. The deep CNN features are trained on an infrared dataset consisting of 16K objects for a supervised classification task. The highest performance in terms of the overlap metric is achieved when these deep CNN features are utilized in a correlation-filter-based tracker.

Suggestions

Fine-grained object recognition and zero-shot learning in multispectral imagery
Sumbul, Gencer; Cinbiş, Ramazan Gökberk; AKSOY, SELİM (2018-05-05)
We present a method for fine-grained object recognition problem, that aims to recognize the type of an object among a large number of sub-categories, and zero-shot learning scenario on multispectral images. In order to establish a relation between seen classes and new unseen classes, a compatibility function between image features extracted from a convolutional neural network and auxiliary information of classes is learnt. Knowledge transfer for unseen classes is carried out by maximizing this function. Per...
Utilization of texture, contrast and color homogeneity for detecting and recognizing text from video frames
Tekinalp, S; Alatan, Abdullah Aydın (2003-09-17)
It is possible to index and manage large video archives in a more efficient manner by detecting and recognizing text within video frames. There are some inherent properties of videotext, such as distinguishing texture, higher contrast against background, and uniform color, making it detectable. By employing these properties, it is possible to detect text regions and binarize the image for character recognition. In this paper, a complete framework for detection and recognition of videotext is presented. The ...
Good Features to Correlate for Visual Tracking
Gundogdu, Erhan; Alatan, Abdullah Aydın (Institute of Electrical and Electronics Engineers (IEEE), 2018-05-01)
During the recent years, correlation filters have shown dominant and spectacular results for visual object tracking. The types of the features that are employed in this family of trackers significantly affect the performance of visual tracking. The ultimate goal is to utilize the robust features invariant to any kind of appearance change of the object, while predicting the object location as properly as in the case of no appearance change. As the deep learning based methods have emerged, the study of learni...
Investigation of the Effect of Face Regions on Local Shape Descriptor Based 3D Face Recognition
İNAN, TOLGA; Halıcı, Uğur (2013-04-26)
In this study, the effect of face regions on local shape descriptor based 3D face recognition was investigated. Our approach starts with calculation of the SIFT descriptors on the shape maps of the 3D faces. In the next phase, SIFT descriptors in the selected regions are concatenated to form feature vectors. Then these feature vectors are fed into linear discriminant analysis (LDA) for face recognition. In this study, faces are segmented into 47 regions and the descriptors in one or more regions are concate...
Towards finding optimal mixture of subspaces for data classification
Musa, Mohamed Elhafiz Mustafa; Atalay, Mehmet Volkan; Department of Computer Engineering (2003)
In pattern recognition, when data has different structures in different parts of the input space, fitting one global model can be slow and inaccurate. Learning methods can quickly learn the structure of the data in local regions, consequently, offering faster and more accurate model fitting. Breaking training data set into smaller subsets may lead to curse of dimensionality problem, as a training sample subset may not be enough for estimating the required set of parameters for the submodels. Increasing the ...
Citation Formats
E. Gündoğdu, B. SOLMAZ, A. Koç, R. HAMMOUD, and A. A. Alatan, “Evaluation of Feature Channels for Correlation Filter Based Visual Object Tracking in Infrared Spectrum,” 2016, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/37986.