Open Access iconOpen Access

ARTICLE

crossmark

Enhancing Dense Small Object Detection in UAV Images Based on Hybrid Transformer

Changfeng Feng1, Chunping Wang2, Dongdong Zhang1, Renke Kou1, Qiang Fu1,*

1Shijiazhuang Campus, People Liberation Army Engineering University, Shijiazhuang, 050003, China
2 School of Information and Intelligent Engineering, Sanya University, Sanya, 572000, China

* Corresponding Author: Qiang Fu. Email: email

Computers, Materials & Continua 2024, 78(3), 3993-4013. https://doi.org/10.32604/cmc.2024.048351

Abstract

Transformer-based models have facilitated significant advances in object detection. However, their extensive computational consumption and suboptimal detection of dense small objects curtail their applicability in unmanned aerial vehicle (UAV) imagery. Addressing these limitations, we propose a hybrid transformer-based detector, H-DETR, and enhance it for dense small objects, leading to an accurate and efficient model. Firstly, we introduce a hybrid transformer encoder, which integrates a convolutional neural network-based cross-scale fusion module with the original encoder to handle multi-scale feature sequences more efficiently. Furthermore, we propose two novel strategies to enhance detection performance without incurring additional inference computation. Query filter is designed to cope with the dense clustering inherent in drone-captured images by counteracting similar queries with a training-aware non-maximum suppression. Adversarial denoising learning is a novel enhancement method inspired by adversarial learning, which improves the detection of numerous small targets by counteracting the effects of artificial spatial and semantic noise. Extensive experiments on the VisDrone and UAVDT datasets substantiate the effectiveness of our approach, achieving a significant improvement in accuracy with a reduction in computational complexity. Our method achieves 31.9% and 21.1% AP on the VisDrone and UAVDT datasets, respectively, and has a faster inference speed, making it a competitive model in UAV image object detection.

Keywords


Cite This Article

APA Style
Feng, C., Wang, C., Zhang, D., Kou, R., Fu, Q. (2024). Enhancing dense small object detection in UAV images based on hybrid transformer. Computers, Materials & Continua, 78(3), 3993-4013. https://doi.org/10.32604/cmc.2024.048351
Vancouver Style
Feng C, Wang C, Zhang D, Kou R, Fu Q. Enhancing dense small object detection in UAV images based on hybrid transformer. Comput Mater Contin. 2024;78(3):3993-4013 https://doi.org/10.32604/cmc.2024.048351
IEEE Style
C. Feng, C. Wang, D. Zhang, R. Kou, and Q. Fu, “Enhancing Dense Small Object Detection in UAV Images Based on Hybrid Transformer,” Comput. Mater. Contin., vol. 78, no. 3, pp. 3993-4013, 2024. https://doi.org/10.32604/cmc.2024.048351



cc Copyright © 2024 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 576

    View

  • 258

    Download

  • 0

    Like

Share Link