A Survey on Deep Learning-based Architectures for Semantic Segmentation on 2D Images

Download

08839514.2022.pdf

Date

2022-02-01

Author

Ülkü, İrem
Akagündüz, Erdem

Metadata

Show full item record

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Item Usage Stats

215
views

265
downloads

Semantic segmentation is the pixel-wise labeling of an image. Boosted by the extraordinary ability of convolutional neural networks (CNN) in creating semantic, high-level and hierarchical image features; several deep learning-based 2D semantic segmentation approaches have been proposed within the last decade. In this survey, we mainly focus on the recent scientific developments in semantic segmentation, specifically on deep learning-based methods using 2D images. We started with an analysis of the public image sets and leaderboards for 2D semantic segmentation, with an overview of the techniques employed in performance evaluation. In examining the evolution of the field, we chronologically categorized the approaches into three main periods, namely pre-and early deep learning era, the fully convolutional era, and the post-FCN era. We technically analyzed the solutions put forward in terms of solving the fundamental problems of the field, such as fine-grained localization and scale invariance. Before drawing our conclusions, we present a table of methods from all mentioned eras, with a summary of each approach that explains their contribution to the field. We conclude the survey by discussing the current challenges of the field and to what extent they have been solved.

URI

https://hdl.handle.net/11511/96480

Journal

APPLIED ARTIFICIAL INTELLIGENCE

DOI

https://doi.org/10.1080/08839514.2022.2032924

Collections

Graduate School of Informatics, Article

Suggestions

OpenMETU
Core

Novel Optimization Models to Generalize Deep Metric Learning Gürbüz, Yeti Ziya; Alatan, Abdullah Aydın; Department of Electrical and Electronics Engineering (2022-8-24) Deep metric learning (DML) aims to fit a parametric embedding function to data of semantic information (e.g. images) so that l2-distance between embedded samples is low whenever they share similar semantic entities. An embedding function of such behavior is attained by minimizing empirical expected pairwise loss that penalizes inter-/intra-class proximity violations in embedding space. Proxy-based methods which use a learnable embedding vector per class in their loss formulation are state-of-the-art. We fir...
A rule-based method for object segmentation in video sequences Alatan, Abdullah Aydın; Onural, L (1997-01-01) Object segmentation and tracking are problems within the scope of MPEG-4 and MPEG-7 standardization activities. A novel algorithm for both object segmentation and tracking is presented. The algorithm fuses motion, color, and accumulated previous segmentation data at 'region level', in contrast to conventional 'pixel level' approaches. The information fusion is achieved by a rule-based region processing unit which intelligently utilizes the motion information to locate the objects in the scene, the color inf...
A land-cover classification for landslide susceptibility mapping by using feature components YESİLNACAR, ERTAN; Süzen, Mehmet Lütfi (2006-06-20) Classifying original bands and/or image components may cause unsatisfactory results in fields that have heterogeneous reflectance. In such cases, the demand for accurate land-use, land-cover, vegetation, and forestry information may require more specific components. The components should represent peculiar information collected from several inputs for target land covers. In this study, a new technique of land-cover classification was explored to prepare an input which increases the success of landslide susc...
A Proposed Methodology for Evaluating HDR False Color Maps Akyüz, Ahmet Oğuz (Association for Computing Machinery (ACM), 2016-08-01) Color mapping, which involves assigning colors to the individual elements of an underlying data distribution, is a commonly used method for data visualization. Although color maps are used in many disciplines and for a variety of tasks, in this study we focus on its usage for visualizing luminance maps. Specifically, we ask ourselves the question of how to best visualize a luminance distribution encoded in a high-dynamic-range (HDR) image using false colors such that the resulting visualization is the most ...
Combining MPEG-7 based visual experts for reaching semantics Soysal, M; Alatan, Abdullah Aydın (2003-01-01) Semantic classification of images using low-level features is a challenging problem. Combining experts with different classifier structures, trained by MPEG-7 low-level color and texture descriptors is examined as a solution alternative. For combining different classifiers and features, two advanced decision mechanisms are proposed, one of which enjoys a significant classification performance improvement. Simulations are conducted on 8 different visual semantic classes, resulting in accuracy improvements be...

Citation Formats

İ. Ülkü and E. Akagündüz, “A Survey on Deep Learning-based Architectures for Semantic Segmentation on 2D Images,” APPLIED ARTIFICIAL INTELLIGENCE, pp. 0–0, 2022, Accessed: 00, 2022. [Online]. Available: https://hdl.handle.net/11511/96480.