JOURNAL ARTICLE

Cross-modal multi-scale feature fusion-based RGB-T saliency object detection method

Guangyu ZhangLianqiang Niu

Year: 2023 Journal:   Journal of Physics Conference Series Vol: 2562 (1)Pages: 012032-012032   Publisher: IOP Publishing

Abstract

Abstract To cope with the challenge of significant target detection in complex scenes, this study proposes an RGB-T significant target detection method called CMFF. The method utilizes the complete potential of RGB and thermal infrared modal images and employs a codec structure and cross-modal multiscale feature fusion techniques. In the coding stage, two VGG16 backbone networks are used for multi-level feature extraction and CBAM attention module feature enhancement, and the enhanced features are fused using a stepwise fusion approach. Meanwhile, the weights of the two modalities are assigned using the L 1 -parametric fusion strategy to enhance the complementarity between them. In the decoding stage, global features are extracted from the high-level fused features by introducing the pyramid pooling module (PPM), and the low-level fused features are fused with multi-scale features in the up-sampling and encoding stages to enrich the global and local information of the feature map. Finally, this study conducted comparison experiments on the publicly available VT5000 dataset, and the method achieved an F-measure value of 0.863 and a mean absolute error (MAE) of 0.062, which significantly improved the overall detection performance relative to the six existing methods.

Keywords:
RGB color model Artificial intelligence Computer science Pattern recognition (psychology) Pyramid (geometry) Pooling Fusion Feature (linguistics) Feature extraction Computer vision Mathematics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
6
Refs
0.09
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image Fusion Techniques
Physical Sciences →  Engineering →  Media Technology
Infrared Target Detection Methodologies
Physical Sciences →  Engineering →  Aerospace Engineering

Related Documents

JOURNAL ARTICLE

RGB-D Saliency Detection based on Cross-Modal and Multi-scale Feature Fusion

Xuxing ZhuJin WuLei Zhu

Journal:   2022 34th Chinese Control and Decision Conference (CCDC) Year: 2022 Pages: 6154-6160
JOURNAL ARTICLE

RGB-D Saliency Detection Based on Attention Mechanism and Multi-Scale Cross-Modal Fusion

Zhiqiang CuiZhengyong FengFeng WangQiang Liu

Journal:   Journal of Computer-Aided Design & Computer Graphics Year: 2023 Vol: 35 (6)Pages: 803-902
JOURNAL ARTICLE

MC-Net: A Multi-modal Cross-scale Feature Interaction Learning Method for RGB-T Salient Object Detection

Qi Qi

Journal:   IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences Year: 2025
JOURNAL ARTICLE

RGB-D Salient Object Detection Based on Cross-Modal and Cross-Level Feature Fusion

Yanbin PengZhinian ZhaiMingkun Feng

Journal:   IEEE Access Year: 2024 Vol: 12 Pages: 45134-45146
© 2026 ScienceGate Book Chapters — All rights reserved.