JOURNAL ARTICLE

Cross-guided Cross-modal Feature Fusion Network for RGB-D Salient Object Detection

Abstract

RGB-D salient object detection has been one of the hottest research topics in the field of computer vision in recent years. The research in this field aims to achieve automatic detection and segmentation of salient targets in scenes by combining information from RGB images and depth images. The existing RGB-D salient object detection methods generally operate directly on different modules without considering the complementarity between different modes. And for multi-modal fusion, the differences between different modalities have not been fully explored. In order to better solve the above two problems, we propose a cross-guided cross-modal feature fusion network(CCFFNet). It is composed of a cross-guided feature enhancement (CFE) module and a multi-modal feature fusion (MFF) module. Specifically, in the cross-guided feature enhancement module, the representation of cross modal features is enhanced through guided learning of the mutual feature weights between RGB and depth, fully exploring the complementarity between RGB and depth. In addition, we also utilize multiple different levels of modal features to participate in fusion, enhancing the fusion features through attention, making the model significance prediction of RGB-D salient object detection more accurate. Finally, extensive experiments on five benchmark datasets have shown that our model outperforms the other seven state-of-the-art methods, while also demonstrating its superiority.

Keywords:
RGB color model Artificial intelligence Computer science Feature (linguistics) Computer vision Salient Modal Pattern recognition (psychology) Object detection Fusion Benchmark (surveying) Complementarity (molecular biology)

Metrics

1
Cited By
0.18
FWCI (Field Weighted Citation Impact)
7
Refs
0.46
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Face Recognition and Perception
Life Sciences →  Neuroscience →  Cognitive Neuroscience
Ocular and Laser Science Research
Health Sciences →  Medicine →  Ophthalmology
© 2026 ScienceGate Book Chapters — All rights reserved.