Fine‐grained recognition of maritime vessels and land vehicles by deep feature embedding

Solmaz, Berkan
Gundogdu, Erhan
Yucesoy, Veysel
Koc, Aykut
Alatan, Abdullah Aydın
Recent advances in large-scale image and video analysis have empowered the potential capabilities of visual surveillance systems. In particular, deep learning-based approaches bring in substantial benefits in solving certain computer vision problems such as fine-grained object recognition. Here, the authors mainly concentrate on classification and identification of maritime vessels and land vehicles, which are the key constituents of visual surveillance systems. Employing publicly available data sets for maritime vessels and land vehicles, the authors aim to improve visual recognition. Specifically, the authors focus on five tasks regarding visual recognition; coarse-grained classification, fine-grained classification, coarse-grained retrieval, fine-grained retrieval, and verification. To increase the performance in these tasks, the authors utilise a multi-task learning framework and present a novel loss function which simultaneously considers deep feature learning and classification by exploiting the available hierarchical labels of individual samples and the global statistics of distances between the data pairs. The authors observe that the proposed multi-task learning model improves the fine-grained recognition performance on MARVEL and Stanford Cars data sets, compared to training of a model targeting a single recognition task.


Comparison of 3D local and global descriptors for similarity retrieval of range data
Bayramoglu, Neslihan; Alatan, Abdullah Aydın (2016-04-05)
Recent improvements in scanning technologies such as consumer penetration of RGB-D cameras lead obtaining and managing range image databases practical. Hence, the need for describing and indexing such data arises. In this study, we focus on similarity indexing of range data among a database of range objects (range-to-range retrieval) by employing only single view depth information. We utilize feature based approaches both on local and global scales. However, the emphasis is on the local descriptors with the...
Automated crowd behavior analysis for video surveillance applications
Güler, Püren; Temizel, Alptekin; Taşkaya Temizel, Tuğba; Department of Information Systems (2012)
Automated analysis of a crowd behavior using surveillance videos is an important issue for public security, as it allows detection of dangerous crowds and where they are headed. Computer vision based crowd analysis algorithms can be divided into three groups; people counting, people tracking and crowd behavior analysis. In this thesis, the behavior understanding will be used for crowd behavior analysis. In the literature, there are two types of approaches for behavior understanding problem: analyzing behavi...
Comparison of approaches for mobile document image analysis using server supported smartphones
Ozarslan, Suleyman; Eren, Pekin Erhan (2014-02-05)
With the recent advances in mobile technologies, new capabilities are emerging, such as mobile document image analysis. However, mobile phones are still less powerful than servers, and they have some resource limitations. One approach to overcome these limitations is performing resource-intensive processes of the application on remote servers. In mobile document image analysis, the most resource consuming process is the Optical Character Recognition (OCR) process, which is used to extract text in mobile pho...
Fine-Grained Object Recognition and Zero-Shot Learning in Remote Sensing Imagery
Sumbul, Gencer; Cinbiş, Ramazan Gökberk; Aksoy, Selim (2018-02-01)
Fine-grained object recognition that aims to identify the type of an object among a large number of subcategories is an emerging application with the increasing resolution that exposes new details in image data. Traditional fully supervised algorithms fail to handle this problem where there is low betweenclass variance and high within-class variance for the classes of interest with small sample sizes. We study an even more extreme scenario named zero-shot learning (ZSL) in which no training example exists f...
ERGÜN, ONUR; Sahillioğlu, Yusuf; Department of Computer Engineering (2022-2-11)
With the developing technology and the power of sensors, 3D data has started to be used in almost every field. Point clouds detected with LIDAR sensors or obtained by sampling 3D meshes have begun to come to the fore in many areas from autonomous driving to data visualization, from generating new data and mesh to classifying detected 3D objects. Machine learning and deep learning techniques are widely used to make sense of this produced data and to implement various applications. In this work, we propose ne...
Citation Formats
B. Solmaz, E. Gundogdu, V. Yucesoy, A. Koc, and A. A. Alatan, “Fine‐grained recognition of maritime vessels and land vehicles by deep feature embedding,” IET COMPUTER VISION, pp. 1121–1132, 2018, Accessed: 00, 2020. [Online]. Available: