Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
Fine‐grained recognition of maritime vessels and land vehicles by deep feature embedding
Date
2018-12-01
Author
Solmaz, Berkan
Gundogdu, Erhan
Yucesoy, Veysel
Koc, Aykut
Alatan, Abdullah Aydın
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
260
views
0
downloads
Cite This
Recent advances in large-scale image and video analysis have empowered the potential capabilities of visual surveillance systems. In particular, deep learning-based approaches bring in substantial benefits in solving certain computer vision problems such as fine-grained object recognition. Here, the authors mainly concentrate on classification and identification of maritime vessels and land vehicles, which are the key constituents of visual surveillance systems. Employing publicly available data sets for maritime vessels and land vehicles, the authors aim to improve visual recognition. Specifically, the authors focus on five tasks regarding visual recognition; coarse-grained classification, fine-grained classification, coarse-grained retrieval, fine-grained retrieval, and verification. To increase the performance in these tasks, the authors utilise a multi-task learning framework and present a novel loss function which simultaneously considers deep feature learning and classification by exploiting the available hierarchical labels of individual samples and the global statistics of distances between the data pairs. The authors observe that the proposed multi-task learning model improves the fine-grained recognition performance on MARVEL and Stanford Cars data sets, compared to training of a model targeting a single recognition task.
Subject Keywords
Marine vehicles
,
Stanford Cars data set
,
MARVEL data set
,
Data pairs
,
Hierarchical individual sample label
,
Global statistics
,
Loss function
,
Multitask learning framework
,
Verification task
,
Fine-grained retrieval task
,
Coarse-grained retrieval task
,
Fine-grained classification task
,
Coarse-grained classification task
,
Visual recognition
,
Land vehicle identification
,
Maritime vessel identification
,
Fine-grained object recognition
,
Computer vision problems
,
Deep learning-based approaches
,
Visual surveillance systems
,
Large-scale video analysis
,
Large-scale image analysis
,
Deep feature embedding
,
Fine-grained land vehicle recognition
,
Fine-grained maritime vessel recognition
,
Video retrieval
,
Traffic engineering computing
,
Statistical analysis
,
Learning (artificial intelligence)
,
Object recognition
,
Image classification
URI
https://hdl.handle.net/11511/37334
Journal
IET COMPUTER VISION
DOI
https://doi.org/10.1049/iet-cvi.2018.5187
Collections
Department of Electrical and Electronics Engineering, Article
Suggestions
OpenMETU
Core
Comparison of 3D local and global descriptors for similarity retrieval of range data
Bayramoglu, Neslihan; Alatan, Abdullah Aydın (2016-04-05)
Recent improvements in scanning technologies such as consumer penetration of RGB-D cameras lead obtaining and managing range image databases practical. Hence, the need for describing and indexing such data arises. In this study, we focus on similarity indexing of range data among a database of range objects (range-to-range retrieval) by employing only single view depth information. We utilize feature based approaches both on local and global scales. However, the emphasis is on the local descriptors with the...
Automated crowd behavior analysis for video surveillance applications
Güler, Püren; Temizel, Alptekin; Taşkaya Temizel, Tuğba; Department of Information Systems (2012)
Automated analysis of a crowd behavior using surveillance videos is an important issue for public security, as it allows detection of dangerous crowds and where they are headed. Computer vision based crowd analysis algorithms can be divided into three groups; people counting, people tracking and crowd behavior analysis. In this thesis, the behavior understanding will be used for crowd behavior analysis. In the literature, there are two types of approaches for behavior understanding problem: analyzing behavi...
Comparison of approaches for mobile document image analysis using server supported smartphones
Ozarslan, Suleyman; Eren, Pekin Erhan (2014-02-05)
With the recent advances in mobile technologies, new capabilities are emerging, such as mobile document image analysis. However, mobile phones are still less powerful than servers, and they have some resource limitations. One approach to overcome these limitations is performing resource-intensive processes of the application on remote servers. In mobile document image analysis, the most resource consuming process is the Optical Character Recognition (OCR) process, which is used to extract text in mobile pho...
Fine-Grained Object Recognition and Zero-Shot Learning in Remote Sensing Imagery
Sumbul, Gencer; Cinbiş, Ramazan Gökberk; Aksoy, Selim (2018-02-01)
Fine-grained object recognition that aims to identify the type of an object among a large number of subcategories is an emerging application with the increasing resolution that exposes new details in image data. Traditional fully supervised algorithms fail to handle this problem where there is low betweenclass variance and high within-class variance for the classes of interest with small sample sizes. We study an even more extreme scenario named zero-shot learning (ZSL) in which no training example exists f...
3D POINT CLOUD CLASSIFICATION WITH GANs: ACGAN and VACWGAN-GP
ERGÜN, ONUR; Sahillioğlu, Yusuf; Department of Computer Engineering (2022-2-11)
With the developing technology and the power of sensors, 3D data has started to be used in almost every field. Point clouds detected with LIDAR sensors or obtained by sampling 3D meshes have begun to come to the fore in many areas from autonomous driving to data visualization, from generating new data and mesh to classifying detected 3D objects. Machine learning and deep learning techniques are widely used to make sense of this produced data and to implement various applications. In this work, we propose ne...
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
B. Solmaz, E. Gundogdu, V. Yucesoy, A. Koc, and A. A. Alatan, “Fine‐grained recognition of maritime vessels and land vehicles by deep feature embedding,”
IET COMPUTER VISION
, pp. 1121–1132, 2018, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/37334.