Does estimated depth help object detection?

Download
2019
Çetinkaya, Bedrettin
With the widespread use of RGB-D cameras, depth information has improved solutions of many computer vision problems including object detection. Object detection can exploit depth information and different encodings obtained from the depth map. Although previous works proved that depth information can be used to improve object detection results, this thesis investigates the effects of depth map to object detection from different aspects in detailed experiments. To clarify these effects, we examine the following three questions: (i) Should depth be used in its raw form or should it be processed to obtain different encodings and color spaces? (ii) How and when should the depth information be integrated into the object detection pipeline? (iii) how does estimated depth affect object detection results? In addition, we propose a novel method to integrate depth features into the processing pipeline of a modern two-stage object detector. Compared to previous methods, our method produces better results and uses fewer parameters. In this thesis, we also explore new loss functions to better handle the consistency between RGB and depth discontinuities. We proposed both hand-crafted and learning-based loss functions which we call the" bound loss". Using the bound loss, we were able v to improve the mean average and absolute errors for depth estimation.

Suggestions

Deep convolutional neural networks for airport detection in remote sensing images
Budak, Umit; Sengur, Abdulkadir; Halıcı, Uğur (2018-05-05)
This study investigated the use of deep convolutional neural networks (CNNs) in providing a solution for the problem of airport detection in remote sensing images (RSIs). In recent years, Deep CNNs have gained much attention with numerous applications having been undertaken in the area of computer vision. Researchers generally approach airport detection as a pattern recognition problem, in which first various distinctive features are extracted, and then a classifier is adopted to detect airports. CNNs not o...
AN ANALYSIS ON THE EFFECT OF DYNAMIC RANGE ON OBJECT DETECTION WITH DEEP NEURAL NETWORKS
Koçdemir, İsmail Hakkı; Kalkan, Sinan; Alatan, Abdullah Aydın; Department of Computer Engineering (2021-10-8)
An important problem in computer vision, particularly in object detection, is being able to perceive objects even under challenging illumination conditions. Being robust to such conditions is especially important in applications, such as autonomous driving. Despite the significance of the problem, existing autonomous driving systems use deep object detection networks with low-dynamic range (LDR) images during both the training phase and the testing phase. In this thesis, we investigate whether high-dynamic ...
Does depth estimation help object detection?
Cetinkaya, Bedrettin; Kalkan, Sinan; Akbaş, Emre (2022-06-01)
Ground-truth depth, when combined with color data, helps improve object detection accuracy over baseline models that only use color. However, estimated depth does not always yield improvements. Many factors affect the performance of object detection when estimated depth is used. In this paper, we comprehensively investigate these factors with detailed experiments, such as using ground-truth vs. estimated depth, effects of different state-of-the-art depth estimation networks, effects of using different indoo...
Automatic target recognition and detection in infrared imagery under cluttered background
GÜNDOĞDU, ERHAN; KOÇ, AYKUT; Alatan, Abdullah Aydın (2017-09-14)
Visual object classification has long been studied in visible spectrum by utilizing conventional cameras. Since the labeled images has recently increased in number, it is possible to train deep Convolutional Neural Networks (CNN) with significant amount of parameters. As the infrared (IR) sensor technology has been improved during the last two decades, labeled images extracted from IR sensors have been started to be used for object detection and recognition tasks. We address the problem of infrared object ...
Visual object detection and tracking using local convolutional context features and recurrent neural networks
Kaya, Emre Can; Alatan, Abdullah Aydın; Department of Electrical and Electronics Engineering (2018)
Visual object detection and tracking are two major problems in computer vision which have important real-life application areas. During the last decade, Convolutional Neural Networks (CNNs) have received significant attention and outperformed methods that rely on handcrafted representations in both detection and tracking. On the other hand, Recurrent Neural Networks (RNNs) are commonly preferred for modeling sequential data such as video sequences. A novel convolutional context feature extension is introduc...
Citation Formats
B. Çetinkaya, “Does estimated depth help object detection?,” Thesis (M.S.) -- Graduate School of Natural and Applied Sciences. Computer Engineering., Middle East Technical University, 2019.