JOURNAL ARTICLE

Feature interaction and two-stage cross-modal fusion for RGB-D salient object detection

Ming YuJiali LiuYi LiuGang Yan

Year: 2023 Journal:   Journal of Intelligent & Fuzzy Systems Vol: 46 (2)Pages: 4543-4556   Publisher: IOS Press

Abstract

Most existing RGB-D salient object detection (SOD) methods extract features of both modalities in parallel or adopt depth features as supplementary information for unidirectional interaction from depth modality to RGB modality in the encoder stage. These methods ignore the influence of low-quality depth maps, and there is still room for improvement in effectively fusing RGB features and depth features. To address the above problems, this paper proposes a Feature Interaction Network (FINet), which performs bi-directional interaction through feature interaction module (FIM) in the encoder stage. The feature interaction module is divided into two parts: depth enhancement module (DEM) filters the noise in the depth features through the attention mechanism; and cross enhancement module (CEM) effectively interacts RGB features and depth features. In addition, this paper proposes a two-stage cross-modal fusion strategy: high-level fusion adopts the semantic information of high level for coarse localization of salient regions, and low-level fusion makes full use of the detailed information of low level through boundary fusion, and then we progressively refine high-level and low-level cross-modal features to obtain the final saliency prediction map. Extensive experiments show that the proposed model achieves better performance than eight state-of-the-art models on five standard datasets.

Keywords:
RGB color model Computer science Artificial intelligence Feature (linguistics) Encoder Fusion Salient Pattern recognition (psychology) Modal Fusion mechanism Computer vision Modality (human–computer interaction) Object (grammar)

Metrics

2
Cited By
0.36
FWCI (Field Weighted Citation Impact)
18
Refs
0.53
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Face Recognition and Perception
Life Sciences →  Neuroscience →  Cognitive Neuroscience
Olfactory and Sensory Function Studies
Life Sciences →  Neuroscience →  Sensory Systems
© 2026 ScienceGate Book Chapters — All rights reserved.