JOURNAL ARTICLE

Point-aware Interaction and CNN-induced Refinement Network for RGB-D Salient Object Detection

Abstract

By integrating complementary information from RGB image and depth map, the ability of salient object detection (SOD) for complex and challenging scenes can be improved. In recent years, the important role of Convolutional Neural Networks (CNNs) in feature extraction and cross-modality interaction has been fully explored, but it is still insufficient in modeling global long-range dependencies of self-modality and cross-modality. To this end, we introduce CNNs-assisted Transformer architecture and propose a novel RGB-D SOD network with Point-aware Interaction and CNN-induced Refinement (PICR-Net). On the one hand, considering the prior correlation between RGB modality and depth modality, an attention-triggered cross-modality point-aware interaction (CmPI) module is designed to explore the feature interaction of different modalities with positional constraints. On the other hand, in order to alleviate the block effect and detail destruction problems brought by the Transformer naturally, we design a CNN-induced refinement (CNNR) unit for content refinement and supplementation. Extensive experiments on five RGB-D SOD datasets show that the proposed network achieves competitive results in both quantitative and qualitative comparisons. Our code is publicly available at: https://github.com/rmcong/PICR-Net_ACMMM23.

Keywords:
Computer science RGB color model Modality (human–computer interaction) Convolutional neural network Artificial intelligence Pattern recognition (psychology) Feature extraction Block (permutation group theory) Computer vision Feature (linguistics) Salient Deep learning Mathematics

Metrics

52
Cited By
9.46
FWCI (Field Weighted Citation Impact)
35
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Face Recognition and Perception
Life Sciences →  Neuroscience →  Cognitive Neuroscience
Gaze Tracking and Assistive Technology
Physical Sciences →  Computer Science →  Human-Computer Interaction

Related Documents

JOURNAL ARTICLE

Depth-aware inverted refinement network for RGB-D salient object detection

Lina GaoBing LiuPing FuMingzhu Xu

Journal:   Neurocomputing Year: 2022 Vol: 518 Pages: 507-522
JOURNAL ARTICLE

Modal-Aware Interaction Network for RGB-D Salient Object Detection

Longsheng WeiZiQiang Zhu

Journal:   IEEE Transactions on Instrumentation and Measurement Year: 2025 Vol: 74 Pages: 1-12
JOURNAL ARTICLE

Global-aware Interaction Network for RGB-D salient object detection

Zijian JiangLing YuHan YuJunru LiFanglin Niu

Journal:   Neurocomputing Year: 2024 Vol: 621 Pages: 129204-129204
JOURNAL ARTICLE

Context-aware network for RGB-D salient object detection

Fangfang LiangLijuan DuanWei MaYuanhua QiaoJun MiaoQixiang Ye

Journal:   Pattern Recognition Year: 2020 Vol: 111 Pages: 107630-107630
JOURNAL ARTICLE

Depth‐aware lightweight network for RGB‐D salient object detection

Liuyi LingYiwen WangChengjun WangShanyong XuYourui Huang

Journal:   IET Image Processing Year: 2023 Vol: 17 (8)Pages: 2350-2361
© 2026 ScienceGate Book Chapters — All rights reserved.