JOURNAL ARTICLE

Weighted Guided Optional Fusion Network for RGB-T Salient Object Detection

Jie WangGuoqiang LiJie ShiJinwen Xi

Year: 2023 Journal:   ACM Transactions on Multimedia Computing Communications and Applications Vol: 20 (5)Pages: 1-20   Publisher: Association for Computing Machinery

Abstract

There is no doubt that the rational and effective use of visible and thermal infrared image data information to achieve cross-modal complementary fusion is the key to improving the performance of RGB-T salient object detection (SOD). A meticulous analysis of the RGB-T SOD data reveals that it mainly consists of three scenarios in which both modalities (RGB and T) have a significant foreground and only a single modality (RGB or T) is disturbed. However, existing methods are obsessed with pursuing more effective cross-modal fusion based on treating both modalities equally. Obviously, the subjective use of equivalence has two significant limitations. Firstly, it does not allow for practical discrimination of which modality makes the dominant contribution to performance. While both modalities may have visually significant foregrounds, differences in their imaging properties will result in distinct performance contributions. Secondly, in a specific acquisition scenario, a pair of images with two modalities will contribute differently to the final detection performance due to their varying sensitivity to the same background interference. Intelligibly, for the RGB-T saliency detection task, it would be more reasonable to generate exclusive weights for the two modalities and select specific fusion mechanisms based on different weight configurations to perform cross-modal complementary integration. Consequently, we propose a weighted guided optional fusion network (WGOFNet) for RGB-T SOD. Specifically, a feature refinement module is first used to perform an initial refinement of the extracted multilevel features. Subsequently, a weight generation module (WGM) will generate exclusive network performance contribution weights for each of the two modalities, and an optional fusion module (OFM) will rely on this weight to perform particular integration of cross-modal information. Simple cross-level fusion is finally utilized to obtain the final saliency prediction map. Comprehensive experiments on three publicly available benchmark datasets demonstrate the proposed WGOFNet achieves superior performance compared with the state-of-the-art RGB-T SOD methods. The source code is available at: https://github.com/WJ-CV/WGOFNet .

Keywords:
RGB color model Modalities Computer science Modality (human–computer interaction) Artificial intelligence Salient Feature (linguistics) Fusion Sensor fusion Pattern recognition (psychology) Modal Image fusion Computer vision Image (mathematics)

Metrics

10
Cited By
1.82
FWCI (Field Weighted Citation Impact)
63
Refs
0.83
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Olfactory and Sensory Function Studies
Life Sciences →  Neuroscience →  Sensory Systems
Advanced Image Fusion Techniques
Physical Sciences →  Engineering →  Media Technology

Related Documents

JOURNAL ARTICLE

Edge-guided feature fusion network for RGB-T salient object detection

Yuanlin ChenZhenan SunCheng YanMing Zhao

Journal:   Frontiers in Neurorobotics Year: 2024 Vol: 18 Pages: 1489658-1489658
JOURNAL ARTICLE

CGFNet: Cross-Guided Fusion Network for RGB-T Salient Object Detection

Jie WangKechen SongYanqi BaoLiming HuangYunhui Yan

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2021 Vol: 32 (5)Pages: 2949-2961
BOOK-CHAPTER

Cross-Collaboration Weighted Fusion Network for RGB-T Salient Detection

Yumei WangChanglei DongyeWenxiu Zhao

Lecture notes in computer science Year: 2024 Pages: 301-312
JOURNAL ARTICLE

Modal complementary fusion network for RGB-T salient object detection

Shuai MaKechen SongHongwen DongHongkun TianYunhui Yan

Journal:   Applied Intelligence Year: 2022 Vol: 53 (8)Pages: 9038-9055
© 2026 ScienceGate Book Chapters — All rights reserved.