Abstract

Efficiently exploiting multi-modal inputs for accurate RGB-D saliency detection is a topic of high interest. Most existing works leverage cross-modal interactions to fuse the two streams of RGB-D for intermediate features' enhancement. In this process, a practical aspect of the low quality of the available depths has not been fully considered yet. In this work, we aim for RGB-D saliency detection that is robust to the low-quality depths which primarily appear in two forms: inaccuracy due to noise and the misalignment to RGB. To this end, we propose a robust RGB-D fusion method that benefits from (1) layer-wise, and (2) trident spatial, attention mechanisms. On the one hand, layer-wise attention (LWA) learns the trade-off between early and late fusion of RGB and depth features, depending upon the depth accuracy. On the other hand, trident spatial attention (TSA) aggregates the features from a wider spatial context to address the depth misalignment problem. The proposed LWA and TSA mechanisms allow us to efficiently exploit the multi-modal inputs for saliency detection while being robust against low-quality depths. Our experiments on five bench-mark datasets demonstrate that the proposed fusion method performs consistently better than the state-of-the-art fusion alternatives. The source code is publicly available at: https://github.com/Zongwei97/RFnet.

Keywords:
RGB color model Computer science Artificial intelligence Leverage (statistics) Fuse (electrical) Computer vision Fusion Robustness (evolution) Context (archaeology) Pattern recognition (psychology) Engineering

Metrics

39
Cited By
4.58
FWCI (Field Weighted Citation Impact)
65
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Industrial Vision Systems and Defect Detection
Physical Sciences →  Engineering →  Industrial and Manufacturing Engineering

Related Documents

JOURNAL ARTICLE

RGB-D Image Saliency Detection via Background and Foreground Fusion

Qiang Wang

Journal:   DOAJ (DOAJ: Directory of Open Access Journals) Year: 2020
BOOK-CHAPTER

RGB-D Saliency Detection by Multi-stream Late Fusion Network

Hao ChenYoufu LiDan Su

Lecture notes in computer science Year: 2017 Pages: 459-468
JOURNAL ARTICLE

Attentive Cross-Modal Fusion Network for RGB-D Saliency Detection

Di LiuKao ZhangZhenzhong Chen

Journal:   IEEE Transactions on Multimedia Year: 2020 Vol: 23 Pages: 967-981
JOURNAL ARTICLE

Uncertainty Inspired RGB-D Saliency Detection

Jing ZhangDeng-Ping FanYuchao DaiSaeed AnwarFatemeh Sadat SalehSadegh AliakbarianNick Barnes

Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Year: 2021 Vol: 44 (9)Pages: 1-1
JOURNAL ARTICLE

RGB-D Image Saliency Detection Based on Cross-Model Feature Fusion

Zheng ChenXiaoli ZhaoJiaying ZhangMingchen YinHanchen YeHaojun Zhou

Journal:   Journal of Computer-Aided Design & Computer Graphics Year: 2021 Vol: 33 (11)Pages: 1688-1697
© 2026 ScienceGate Book Chapters — All rights reserved.