JOURNAL ARTICLE

Learning Adaptive Fusion Bank for Multi-Modal Salient Object Detection

Kunpeng WangZhengzheng TuChenglong LiCheng ZhangBin Luo

Year: 2024 Journal:   IEEE Transactions on Circuits and Systems for Video Technology Vol: 34 (8)Pages: 7344-7358   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Multi-modal salient object detection (MSOD) aims to boost saliency detection performance by integrating visible sources with depth or thermal infrared ones. Existing methods generally design different fusion schemes to handle certain issues or challenges. Although these fusion schemes are effective at addressing specific issues or challenges, they may struggle to handle multiple complex challenges simultaneously. To solve this problem, we propose a novel adaptive fusion bank that makes full use of the complementary benefits from a set of basic fusion schemes to handle different challenges simultaneously for robust MSOD. We focus on handling five major challenges in MSOD, namely center bias, scale variation, image clutter, low illumination, and thermal crossover or depth ambiguity. The fusion bank proposed consists of five representative fusion schemes, which are specifically designed based on the characteristics of each challenge, respectively. The bank is scalable, and more fusion schemes could be incorporated into the bank for more challenges. To adaptively select the appropriate fusion scheme for multi-modal input, we introduce an adaptive ensemble module that forms the adaptive fusion bank, which is embedded into hierarchical layers for sufficient fusion of different source data. Moreover, we design an indirect interactive guidance module to accurately detect salient hollow objects via the skip integration of high-level semantic information and low-level spatial details. Extensive experiments on three RGBT datasets and seven RGBD datasets demonstrate that the proposed method achieves the outstanding performance compared to the state-of-the-art methods.

Keywords:
Computer science Fusion Scalability Image fusion Sensor fusion Artificial intelligence Modal Salient Clutter Data mining Pattern recognition (psychology) Computer vision Machine learning Image (mathematics) Radar

Metrics

48
Cited By
25.45
FWCI (Field Weighted Citation Impact)
82
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Olfactory and Sensory Function Studies
Life Sciences →  Neuroscience →  Sensory Systems
Advanced Image Fusion Techniques
Physical Sciences →  Engineering →  Media Technology

Related Documents

JOURNAL ARTICLE

Unified-Modal Salient Object Detection via Adaptive Prompt Learning

Kunpeng WangZhengzheng TuChenglong LiZhengyi LiuBin Luo

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2025 Vol: 35 (12)Pages: 12109-12124
JOURNAL ARTICLE

BMFNet: Bifurcated multi-modal fusion network for RGB-D salient object detection

Chenwang SunQing ZhangChenyu ZhuangMingqian Zhang

Journal:   Image and Vision Computing Year: 2024 Vol: 147 Pages: 105048-105048
JOURNAL ARTICLE

RGBD Salient Object Detection via Disentangled Cross-Modal Fusion

Hao ChenYongjian DengYoufu LiTzu-Yi HungGuosheng Lin

Journal:   IEEE Transactions on Image Processing Year: 2020 Vol: 29 Pages: 8407-8416
© 2026 ScienceGate Book Chapters — All rights reserved.