Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
Transformers in Small Object Detection: A Benchmark and Survey of State-of-the-Art
Download
Transformers in Small Object Detection.pdf
Date
2025-09-10
Author
Miri Rekavandi, Aref
Rashidi, Shima
Boussaid, Farid
Hoefs, Stephen
Akbaş, Emre
Bennamoun, Mohammed
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
103
views
3
downloads
Cite This
Transformers have rapidly gained popularity in computer vision, especially in the field of object detection. Upon examining the outcomes of state-of-the-art object detection methods, we noticed that transformers consistently outperformed well-established CNN-based detectors in almost every video or image dataset. Small objects have been identified as one of the most challenging object types in detection frameworks due to their low visibility. This article aims to explore the performance benefits offered by such extensive networks and identify potential reasons for their Small Object Detection (SOD) superiority. We aim to investigate potential strategies that could further enhance transformers’ performance in SOD. This survey presents a taxonomy of over 60 research studies on developed transformers for the task of SOD, spanning the years 2020 to 2023. These studies encompass a variety of detection applications, including SOD in generic images, aerial images, medical images, active millimeter images, underwater images, and videos. We also compile and present a list of 12 large-scale datasets suitable for SOD that were overlooked in previous studies and compare the performance of the reviewed studies using popular metrics such as mean Average Precision (mAP), Frames Per Second (FPS), and number of parameters.
Subject Keywords
attention
,
deep learning
,
MS COCO dataset
,
object localization
,
Object recognition
,
small object detection
,
vision transformers
URI
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105021924275&origin=inward
https://hdl.handle.net/11511/117269
Journal
ACM Computing Surveys
DOI
https://doi.org/10.1145/3758090
Collections
Department of Computer Engineering, Article
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
A. Miri Rekavandi, S. Rashidi, F. Boussaid, S. Hoefs, E. Akbaş, and M. Bennamoun, “Transformers in Small Object Detection: A Benchmark and Survey of State-of-the-Art,”
ACM Computing Surveys
, vol. 58, no. 3, pp. 0–0, 2025, Accessed: 00, 2025. [Online]. Available: https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105021924275&origin=inward.