JOURNAL ARTICLE

A Multi-Scale Feature Fusion Based Lightweight Vehicle Target Detection Network on Aerial Optical Images

Chengrui YuXiaonan JiangFanlu WuYao FuJunyan PeiYu ZhangXiangzhi LiTianjiao Fu

Year: 2024 Journal:   Remote Sensing Vol: 16 (19)Pages: 3637-3637   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Vehicle detection with optical remote sensing images has become widely applied in recent years. However, the following challenges have remained unsolved during remote sensing vehicle target detection. These challenges include the dense and arbitrary angles at which vehicles are distributed and which make it difficult to detect them; the extensive model parameter (Param) that blocks real-time detection; the large differences between larger vehicles in terms of their features, which lead to a reduced detection precision; and the way in which the distribution in vehicle datasets is unbalanced and thus not conducive to training. First, this paper constructs a small dataset of vehicles, MiVehicle. This dataset includes 3000 corresponding infrared and visible image pairs, offering a more balanced distribution. In the infrared part of the dataset, the proportions of different vehicle types are as follows: cars, 48%; buses, 19%; trucks, 15%; freight, cars 10%; and vans, 8%. Second, we choose the rotated box mechanism for detection with the model and we build a new vehicle detector, ML-Det, with a novel multi-scale feature fusion triple cross-criss FPN (TCFPN), which can effectively capture the vehicle features in three different positions with an mAP improvement of 1.97%. Moreover, we propose LKC–INVO, which allows involution to couple the structure of multiple large kernel convolutions, resulting in an mAP increase of 2.86%. We also introduce a novel C2F_ContextGuided module with global context perception, which enhances the perception ability of the model in the global scope and minimizes model Params. Eventually, we propose an assemble–disperse attention module to aggregate local features so as to improve the performance. Overall, ML-Det achieved a 3.22% improvement in accuracy while keeping Params almost unchanged. In the self-built small MiVehicle dataset, we achieved 70.44% on visible images and 79.12% on infrared images with 20.1 GFLOPS, 78.8 FPS, and 7.91 M. Additionally, we trained and tested our model on the following public datasets: UAS-AOD and DOTA. ML-Det was found to be ahead of many other advanced target detection algorithms.

Keywords:
Computer science Artificial intelligence Computer vision Remote sensing Aerial image Scale (ratio) Fusion Feature (linguistics) Image (mathematics) Geology Cartography Geography

Metrics

6
Cited By
7.91
FWCI (Field Weighted Citation Impact)
50
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Infrared Target Detection Methodologies
Physical Sciences →  Engineering →  Aerospace Engineering
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Measurement and Detection Methods
Physical Sciences →  Engineering →  Electrical and Electronic Engineering

Related Documents

JOURNAL ARTICLE

Underwater Target Detection Lightweight Algorithm Based on Multi-Scale Feature Fusion

Liang ChenYuyi YangZhenheng WangJian ZhangShaowu ZhouLianghong Wu

Journal:   Journal of Marine Science and Engineering Year: 2023 Vol: 11 (2)Pages: 320-320
JOURNAL ARTICLE

Lightweight Saliency Target Intelligent Detection based on Multi-scale Feature Adaptive Fusion

Muqing Zhu

Journal:   Scalable Computing Practice and Experience Year: 2024 Vol: 25 (2)Pages: 883-890
BOOK-CHAPTER

Multi-scale Feature Fusion for Unmanned Aerial Vehicle Object Detection

Chunlong FanLanxin LiLinchao Zhu

Communications in computer and information science Year: 2025 Pages: 189-199
© 2026 ScienceGate Book Chapters — All rights reserved.