A NOVEL LEARNING-BASED IMAGE MATCHING APPROACH BASED ON MUTUAL NEAREST NEIGHBOR SEARCH WITH RATIO TEST

2021-9-09
Efe, Ufuk
This thesis proposes a novel image matching method that utilizes learned features extracted by an off-the-shelf deep neural network to obtain a promising performance. The proposed method simply uses a pre-trained VGG architecture as a feature extractor and does not require any additional training to improve matching. Inspired by well-established concepts in the psychology area, such as the Mental Rotation paradigm, an initial warping step is also performed by the help of a preliminary geometric transformation estimate. The matching estimates are based on dense matching using Mutual Nearest Neighbor Search with Bidirectional Ratio Test (MNNSwBRT) at the terminal layer of VGG network outputs of the images. After this initial alignment, the same approach is repeated again at every network level between reference and aligned images in a hierarchical manner to reach a good localization and matching performance. By comprehensive experiments, five classical and four learning-based methods in the literature are also compared while optimizing a single parameter, and it is shown that the proposed method achieves the state-of-the-art performance. As a result of a fair comparison, the experimental results on HPatches dataset reveal that the performance gap between classical and learning-based methods is not that significant as reported in most of the previous studies. Hence, one can conclude that our proposed method, which uses only a pre-trained network and ratio test, outperforms most well-trained learning-based methods.

Suggestions

A temporal neural network model for constructing connectionist expert system knowledge bases
Alpaslan, Ferda Nur (Elsevier BV, 1996-04-01)
This paper introduces a temporal feedforward neural network model that can be applied to a number of neural network application areas, including connectionist expert systems. The neural network model has a multi-layer structure, i.e. the number of layers is not limited. Also, the model has the flexibility of defining output nodes in any layer. This is especially important for connectionist expert system applications.
A NOVEL BOVW MIMICKING END-TO-END TRAINABLE CNN CLASSIFICATION FRAMEWORK USING OPTIMAL TRANSPORT THEORY
Gürbüz, Yeti Ziya (2019-01-01)
An end-to-end trainable convolutional neural network (CNN) framework which mimics bag of visual words (BoVW) is proposed for image classification. To this end, a new paradigm for histogram-like image representation is introduced and optimal transport (OT) distance is utilized for the similarity assessment. Any patch of an image is considered as a unique visual word and the image is represented as the uniform histogram of the visual words with the histogram bins associated to embedding vectors according to t...
A heuristic algorithm for optical character recognition of Arabic script
Yarman Vural, Fatoş T.; Atici, A. Alper (1996-03-20)
In this paper, a heuristic method is developed for segmentation, feature extraction and recognition of the Arabic script. The study is part of a large project for the transcription of the documents in Ottoman Archives. A geometrical and topological feature analysis method is developed for segmentation and feature extraction stages. Chain code transformation is applied to main strokes of the characters which are then classified by the hidden Markov model (HMM) in the recognition stage. Experimental results i...
A Transformation Media Based Approach for Efficient Monte Carlo Analysis of Scattering From Rough Surfaces With Objects
Ozgun, Ozlem; Kuzuoğlu, Mustafa (2013-03-01)
This paper presents a computational model that utilizes transformation-based metamaterials to enhance the performance of numerical modeling methods for achieving the statistical characterization of two-dimensional electromagnetic scattering from objects on or above one-dimensional rough sea surfaces. Monte Carlo simulation of the rough surface scattering problem by means of differential equation-based finite methods (such as finite element or finite difference methods) usually places a heavy burden on compu...
A heuristic algorithm for optical character recognition of Arabic script
Atici, A. Alper; Yarman Vural, Fatoş T. (1997-10-01)
In this paper, a heuristic method is developed for segmentation, feature extraction and recognition of the Arabic script. The study is part of a large project for transcription of the documents in Ottoman Archives. A geometrical and topological feature analysis method is developed for segmentation and feature extraction stages. Chain code transformation is applied to main strokes of the characters, which are classified by the hidden Markov model (HMM) in the recognition stage. Experimental results indicate ...
Citation Formats
U. Efe, “A NOVEL LEARNING-BASED IMAGE MATCHING APPROACH BASED ON MUTUAL NEAREST NEIGHBOR SEARCH WITH RATIO TEST,” M.S. - Master of Science, Middle East Technical University, 2021.