Cross-guided Cross-modal Feature Fusion Network for RGB-D Salient Object Detection

Chenwang Sun; Mingqian Zhang

doi:10.1109/iciibms60103.2023.10347642

ScienceGate Book Chapters

JOURNAL ARTICLE

Cross-guided Cross-modal Feature Fusion Network for RGB-D Salient Object Detection

Chenwang Sun Mingqian Zhang

Year: 2023 Pages: 565-568

DOI: 10.1109/iciibms60103.2023.10347642

Get Full-Text PDF Get Analytical Report

Abstract

RGB-D salient object detection has been one of the hottest research topics in the field of computer vision in recent years. The research in this field aims to achieve automatic detection and segmentation of salient targets in scenes by combining information from RGB images and depth images. The existing RGB-D salient object detection methods generally operate directly on different modules without considering the complementarity between different modes. And for multi-modal fusion, the differences between different modalities have not been fully explored. In order to better solve the above two problems, we propose a cross-guided cross-modal feature fusion network(CCFFNet). It is composed of a cross-guided feature enhancement (CFE) module and a multi-modal feature fusion (MFF) module. Specifically, in the cross-guided feature enhancement module, the representation of cross modal features is enhanced through guided learning of the mutual feature weights between RGB and depth, fully exploring the complementarity between RGB and depth. In addition, we also utilize multiple different levels of modal features to participate in fusion, enhancing the fusion features through attention, making the model significance prediction of RGB-D salient object detection more accurate. Finally, extensive experiments on five benchmark datasets have shown that our model outperforms the other seven state-of-the-art methods, while also demonstrating its superiority.

Keywords:

RGB color model Artificial intelligence Computer science Feature (linguistics) Computer vision Salient Modal Pattern recognition (psychology) Object detection Fusion Benchmark (surveying) Complementarity (molecular biology)

Metrics

Cited By

0.18

FWCI (Field Weighted Citation Impact)

Refs

0.46

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Face Recognition and Perception

Life Sciences → Neuroscience → Cognitive Neuroscience

Ocular and Laser Science Research

Health Sciences → Medicine → Ophthalmology

Cross-guided Cross-modal Feature Fusion Network for RGB-D Salient Object Detection

Abstract

Metrics

Citation History

Topics

Related Documents

Transformer-Based Cross-Modal Feature Fusion Network for RGB-D Salient Object Detection

Global Guided Cross-Modal Cross-Scale Network for RGB-D Salient Object Detection

Attention-guided cross-modal multiple feature aggregation network for RGB-D salient object detection

RGB-D Salient Object Detection Based on Cross-Modal and Cross-Level Feature Fusion

Cross-modal refined adjacent-guided network for RGB-D salient object detection