JOURNAL ARTICLE

Hierarchical Alternate Interaction Network for RGB-D Salient Object Detection

Gongyang LiZhi LiuMinyu ChenZhen BaiWeisi LinHaibin Ling

Year: 2021 Journal:   IEEE Transactions on Image Processing Vol: 30 Pages: 3528-3542   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Existing RGB-D Salient Object Detection (SOD) methods take advantage of depth cues to improve the detection accuracy, while pay insufficient attention to the quality of depth information. In practice, a depth map is often with uneven quality and sometimes suffers from distractors, due to various factors in the acquisition procedure. In this article, to mitigate distractors in depth maps and highlight salient objects in RGB images, we propose a Hierarchical Alternate Interactions Network (HAINet) for RGB-D SOD. Specifically, HAINet consists of three key stages: feature encoding, cross-modal alternate interaction, and saliency reasoning. The main innovation in HAINet is the Hierarchical Alternate Interaction Module (HAIM), which plays a key role in the second stage for cross-modal feature interaction. HAIM first uses RGB features to filter distractors in depth features, and then the purified depth features are exploited to enhance RGB features in turn. The alternate RGB-depth-RGB interaction proceeds in a hierarchical manner, which progressively integrates local and global contexts within a single feature scale. In addition, we adopt a hybrid loss function to facilitate the training of HAINet. Extensive experiments on seven datasets demonstrate that our HAINet not only achieves competitive performance as compared with 19 relevant state-of-the-art methods, but also reaches a real-time processing speed of 43 fps on a single NVIDIA Titan X GPU. The code and results of our method are available at https://github.com/MathLee/HAINet.

Keywords:
RGB color model Computer science Artificial intelligence Salient Computer vision Feature (linguistics) Encoding (memory) Feature extraction Pattern recognition (psychology) Key (lock)

Metrics

302
Cited By
24.84
FWCI (Field Weighted Citation Impact)
141
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

BOOK-CHAPTER

Progressively Guided Alternate Refinement Network for RGB-D Salient Object Detection

Shuhan ChenYun Fu

Lecture notes in computer science Year: 2020 Pages: 520-538
JOURNAL ARTICLE

Asymmetric deep interaction network for RGB-D salient object detection

Feifei WangYongming LiLiejun WangPanpan Zheng

Journal:   Expert Systems with Applications Year: 2024 Vol: 266 Pages: 126083-126083
JOURNAL ARTICLE

Modal-Aware Interaction Network for RGB-D Salient Object Detection

Longsheng WeiZiQiang Zhu

Journal:   IEEE Transactions on Instrumentation and Measurement Year: 2025 Vol: 74 Pages: 1-12
© 2026 ScienceGate Book Chapters — All rights reserved.