JOURNAL ARTICLE

Transformer-based Adaptive Interactive Promotion Network for RGB-T Salient Object Detection

Abstract

RGB-Thermal salient object detection (RGB-T SOD) aims to better segment the most salient objects with the cooperation of visual and thermal infrared images. The addition of thermal infrared images helps to improve the accuracy of robot decision-making when performing complex visual tasks. How to exploit the potential of multi-modal complementarity, tap the dominant modal information, and better complete object location is still a problem worthy of exploration. In this paper, we propose an adaptive interaction promotion network (AIPNet). In specific, we design a modal interaction module (MIM) with two parallel units to fuse the two modal features extracted by the encoders. The spatial interaction unit (SIU) is responsible for directly completing modal interaction and integration. The self-reinforcement unit (SRU) is responsible for enhancing two single-mode features and amplifying the role of dominant modal features. Besides, we use a query-location module (QLM) for high-level features to accurately confirm the location of salient objects. Finally, we adopt a re-calibration dual branch decoder (RCDB) to integrate the output features. Sufficient experiments conducted on RGB-T and RGB-D SOD datasets demonstrate that the proposed method performs favorably against the other 13 state-of-the-art methods.

Keywords:
Computer science RGB color model Artificial intelligence Computer vision Salient Modal Pattern recognition (psychology)

Metrics

5
Cited By
0.35
FWCI (Field Weighted Citation Impact)
33
Refs
0.66
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Olfactory and Sensory Function Studies
Life Sciences →  Neuroscience →  Sensory Systems
Gaze Tracking and Assistive Technology
Physical Sciences →  Computer Science →  Human-Computer Interaction

Related Documents

JOURNAL ARTICLE

Adaptive interactive network for RGB-T salient object detection with double mapping transformer

Feng DongYuxuan WangJinchao ZhuYuehua Li

Journal:   Multimedia Tools and Applications Year: 2023 Vol: 83 (20)Pages: 59169-59193
JOURNAL ARTICLE

Transformer-Based Cross-Modal Integration Network for RGB-T Salient Object Detection

Chengtao LvXiaofei ZhouBin WanShuai WangYaoqi SunJiyong ZhangChenggang Yan

Journal:   IEEE Transactions on Consumer Electronics Year: 2024 Vol: 70 (2)Pages: 4741-4755
JOURNAL ARTICLE

Interactive context-aware network for RGB-T salient object detection

Yuxuan WangFeng DongJinchao ZhuJianren Chen

Journal:   Multimedia Tools and Applications Year: 2024 Vol: 83 (28)Pages: 72153-72174
JOURNAL ARTICLE

Dual Swin-transformer based mutual interactive network for RGB-D salient object detection

Chao ZengSam KwongHorace H. S. Ip

Journal:   Neurocomputing Year: 2023 Vol: 559 Pages: 126779-126779
© 2026 ScienceGate Book Chapters — All rights reserved.