JOURNAL ARTICLE

Multi-Scale Feature Fusion and Context-Enhanced Spatial Sparse Convolution Single-Shot Detector for Unmanned Aerial Vehicle Image Object Detection

Guimei QiZhihong YuJian Song

Year: 2025 Journal:   Applied Sciences Vol: 15 (2)Pages: 924-924   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Accurate and efficient object detection in UAV images is a challenging task due to the diversity of target scales and the massive number of small targets. This study investigates the enhancement in the detection head using sparse convolution, demonstrating its effectiveness in achieving an optimal balance between accuracy and efficiency. Nevertheless, the sparse convolution method encounters challenges related to the inadequate incorporation of global contextual information and exhibits network inflexibility attributable to its fixed mask ratios. To address the above issues, the MFFCESSC-SSD, a novel single-shot detector (SSD) with multi-scale feature fusion and context-enhanced spatial sparse convolution, is proposed in this paper. First, a global context-enhanced group normalization (CE-GN) layer is developed to address the issue of information loss resulting from the convolution process applied exclusively to the masked region. Subsequently, a dynamic masking strategy is designed to determine the optimal mask ratios, thereby ensuring compact foreground coverage that enhances both accuracy and efficiency. Experiments on two datasets (i.e., VisDrone and ARH2000; the latter dataset was created by the researchers) demonstrate that the MFFCESSC-SSD remarkably outperforms the performance of the SSD and numerous conventional object detection algorithms in terms of accuracy and efficiency.

Keywords:
Artificial intelligence Computer vision Single shot Computer science Object detection Convolution (computer science) Aerial image Detector Scale (ratio) Context (archaeology) Pattern recognition (psychology) Image (mathematics) Geography Cartography Physics Optics Artificial neural network Telecommunications

Metrics

5
Cited By
23.87
FWCI (Field Weighted Citation Impact)
39
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Infrared Target Detection Methodologies
Physical Sciences →  Engineering →  Aerospace Engineering
Advanced Image Processing Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

BOOK-CHAPTER

Multi-scale Feature Fusion for Unmanned Aerial Vehicle Object Detection

Chunlong FanLanxin LiLinchao Zhu

Communications in computer and information science Year: 2025 Pages: 189-199
JOURNAL ARTICLE

Small Object Detection in Unmanned Aerial Vehicle Images Using Feature Fusion and Scaling-Based Single Shot Detector With Spatial Context Analysis

Liang XiJing ZhangZhuo LiYuzhao LiQi Tian

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2019 Vol: 30 (6)Pages: 1758-1770
JOURNAL ARTICLE

FFR-SSD: feature fusion and reconstruction single shot detector for multi-scale object detection

Xu ChengZhixiang WangChen SongZitong Yu

Journal:   Signal Image and Video Processing Year: 2023 Vol: 17 (6)Pages: 3145-3153
JOURNAL ARTICLE

A lightweight multi-feature fusion network for unmanned aerial vehicle infrared ray image object detection

Yunlei ChenZiyan LiuLihui ZhangYingyu WuQian ZhangXuhui Zheng

Journal:   The Egyptian Journal of Remote Sensing and Space Science Year: 2024 Vol: 27 (2)Pages: 268-276
© 2026 ScienceGate Book Chapters — All rights reserved.