JOURNAL ARTICLE

Attention-guided cross-modal multiple feature aggregation network for RGB-D salient object detection

Bojian ChenWenbin WuZhezhou LiTengfei HanZhuolei ChenWeihao Zhang

Year: 2024 Journal:   Electronic Research Archive Vol: 32 (1)Pages: 643-669   Publisher: American Institute of Mathematical Sciences

Abstract

<abstract><p>The goal of RGB-D salient object detection is to aggregate the information of the two modalities of RGB and depth to accurately detect and segment salient objects. Existing RGB-D SOD models can extract the multilevel features of single modality well and can also integrate cross-modal features, but it can rarely handle both at the same time. To tap into and make the most of the correlations of intra- and inter-modality information, in this paper, we proposed an attention-guided cross-modal multi-feature aggregation network for RGB-D SOD. Our motivation was that both cross-modal feature fusion and multilevel feature fusion are crucial for RGB-D SOD task. The main innovation of this work lies in two points: One is the cross-modal pyramid feature interaction (CPFI) module that integrates multilevel features from both RGB and depth modalities in a bottom-up manner, and the other is cross-modal feature decoder (CMFD) that aggregates the fused features to generate the final saliency map. Extensive experiments on six benchmark datasets showed that the proposed attention-guided cross-modal multiple feature aggregation network (ACFPA-Net) achieved competitive performance over 15 state of the art (SOTA) RGB-D SOD methods, both qualitatively and quantitatively.</p></abstract>

Keywords:
RGB color model Feature (linguistics) Artificial intelligence Computer science Benchmark (surveying) Modal Pattern recognition (psychology) Modality (human–computer interaction) Salient Computer vision Geography

Metrics

3
Cited By
1.59
FWCI (Field Weighted Citation Impact)
114
Refs
0.72
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Face Recognition and Perception
Life Sciences →  Neuroscience →  Cognitive Neuroscience
Olfactory and Sensory Function Studies
Life Sciences →  Neuroscience →  Sensory Systems

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.