Fine‐grained recognition of maritime vessels and land vehicles by deep feature embedding

Date

2018-12-01

Author

Solmaz, Berkan
Gundogdu, Erhan
Yucesoy, Veysel
Koc, Aykut
Alatan, Abdullah Aydın

Metadata

Show full item record

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Item Usage Stats

281
views

0
downloads

Recent advances in large-scale image and video analysis have empowered the potential capabilities of visual surveillance systems. In particular, deep learning-based approaches bring in substantial benefits in solving certain computer vision problems such as fine-grained object recognition. Here, the authors mainly concentrate on classification and identification of maritime vessels and land vehicles, which are the key constituents of visual surveillance systems. Employing publicly available data sets for maritime vessels and land vehicles, the authors aim to improve visual recognition. Specifically, the authors focus on five tasks regarding visual recognition; coarse-grained classification, fine-grained classification, coarse-grained retrieval, fine-grained retrieval, and verification. To increase the performance in these tasks, the authors utilise a multi-task learning framework and present a novel loss function which simultaneously considers deep feature learning and classification by exploiting the available hierarchical labels of individual samples and the global statistics of distances between the data pairs. The authors observe that the proposed multi-task learning model improves the fine-grained recognition performance on MARVEL and Stanford Cars data sets, compared to training of a model targeting a single recognition task.

Subject Keywords

Marine vehicles, Stanford Cars data set, MARVEL data set, Data pairs, Hierarchical individual sample label, Global statistics, Loss function, Multitask learning framework, Verification task, Fine-grained retrieval task, Coarse-grained retrieval task, Fine-grained classification task, Coarse-grained classification task, Visual recognition, Land vehicle identification, Maritime vessel identification, Fine-grained object recognition, Computer vision problems, Deep learning-based approaches, Visual surveillance systems, Large-scale video analysis, Large-scale image analysis, Deep feature embedding, Fine-grained land vehicle recognition, Fine-grained maritime vessel recognition, Video retrieval, Traffic engineering computing, Statistical analysis, Learning (artificial intelligence), Object recognition, Image classification

URI

https://hdl.handle.net/11511/37334

Journal

IET COMPUTER VISION

DOI

https://doi.org/10.1049/iet-cvi.2018.5187

Collections

Department of Electrical and Electronics Engineering, Article

Suggestions

OpenMETU
Core

Comparison of 3D local and global descriptors for similarity retrieval of range data Bayramoglu, Neslihan; Alatan, Abdullah Aydın (2016-04-05) Recent improvements in scanning technologies such as consumer penetration of RGB-D cameras lead obtaining and managing range image databases practical. Hence, the need for describing and indexing such data arises. In this study, we focus on similarity indexing of range data among a database of range objects (range-to-range retrieval) by employing only single view depth information. We utilize feature based approaches both on local and global scales. However, the emphasis is on the local descriptors with the...
Automated crowd behavior analysis for video surveillance applications Güler, Püren; Temizel, Alptekin; Taşkaya Temizel, Tuğba; Department of Information Systems (2012) Automated analysis of a crowd behavior using surveillance videos is an important issue for public security, as it allows detection of dangerous crowds and where they are headed. Computer vision based crowd analysis algorithms can be divided into three groups; people counting, people tracking and crowd behavior analysis. In this thesis, the behavior understanding will be used for crowd behavior analysis. In the literature, there are two types of approaches for behavior understanding problem: analyzing behavi...
Comparison of approaches for mobile document image analysis using server supported smartphones Ozarslan, Suleyman; Eren, Pekin Erhan (2014-02-05) With the recent advances in mobile technologies, new capabilities are emerging, such as mobile document image analysis. However, mobile phones are still less powerful than servers, and they have some resource limitations. One approach to overcome these limitations is performing resource-intensive processes of the application on remote servers. In mobile document image analysis, the most resource consuming process is the Optical Character Recognition (OCR) process, which is used to extract text in mobile pho...
Fine-Grained Object Recognition and Zero-Shot Learning in Remote Sensing Imagery Sumbul, Gencer; Cinbiş, Ramazan Gökberk; Aksoy, Selim (2018-02-01) Fine-grained object recognition that aims to identify the type of an object among a large number of subcategories is an emerging application with the increasing resolution that exposes new details in image data. Traditional fully supervised algorithms fail to handle this problem where there is low betweenclass variance and high within-class variance for the classes of interest with small sample sizes. We study an even more extreme scenario named zero-shot learning (ZSL) in which no training example exists f...
3D POINT CLOUD CLASSIFICATION WITH GANs: ACGAN and VACWGAN-GP ERGÜN, ONUR; Sahillioğlu, Yusuf; Department of Computer Engineering (2022-2-11) With the developing technology and the power of sensors, 3D data has started to be used in almost every field. Point clouds detected with LIDAR sensors or obtained by sampling 3D meshes have begun to come to the fore in many areas from autonomous driving to data visualization, from generating new data and mesh to classifying detected 3D objects. Machine learning and deep learning techniques are widely used to make sense of this produced data and to implement various applications. In this work, we propose ne...

Citation Formats

B. Solmaz, E. Gundogdu, V. Yucesoy, A. Koc, and A. A. Alatan, “Fine‐grained recognition of maritime vessels and land vehicles by deep feature embedding,” IET COMPUTER VISION, pp. 1121–1132, 2018, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/37334.