Attention-Aware Cross-Modal Cross-Level Fusion Network for RGB-D Salient Object Detection

Hao Chen; Youfu Li; Dan Su

doi:10.1109/iros.2018.8594373

ScienceGate Book Chapters

JOURNAL ARTICLE

Attention-Aware Cross-Modal Cross-Level Fusion Network for RGB-D Salient Object Detection

Hao Chen Youfu Li Dan Su

Year: 2018 Pages: 6821-6826

DOI: 10.1109/iros.2018.8594373

Get Full-Text PDF Get Analytical Report

Abstract

Convolutional neural networks have achieved wide success in RGB saliency detection. Recently, the advent of RGB-D sensors such as Kinect provide additional geometric saliency cues. However, the key challenge for RGB-D salient object detection that how to fuse RGB and depth information sufficiently is still under-studied. Traditional works mainly follow the two-stream architecture and combine RGB and depth features/decisions in an early or late point. The multi-modal fusion stage is performed by directly concatenating the features from two modalities without selection. In this work, we address this question by proposing a novel network with a distinguished insight: A selection module is significantly helpful for more informative and sufficient cross-modal cross-level combination. To this end, we introduce a top-down RGB-D fusion network which integrates an attention-aware cross-modal cross-level fusion block in each level to select discriminative features from each level and each modality. Extensive experiments on public datasets show that the proposed network is able to solve the key problems in RGB-D fusion and achieves state-of-the-art performance on RGB-D salient object detection.

Keywords:

RGB color model Computer science Artificial intelligence Convolutional neural network Discriminative model Computer vision Key (lock) Fuse (electrical) Modal Visualization Object detection Pattern recognition (psychology) Engineering

Metrics

Cited By

3.18

FWCI (Field Weighted Citation Impact)

Refs

0.92

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image and Video Quality Assessment

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Attention-Aware Cross-Modal Cross-Level Fusion Network for RGB-D Salient Object Detection

Abstract

Metrics

Citation History

Topics

Related Documents

CMA-SOD: cross-modal attention fusion network for RGB-D salient object detection

Three‐stream RGB‐D salient object detection network based on cross‐level and cross‐modal dual‐attention fusion

Cross-guided Cross-modal Feature Fusion Network for RGB-D Salient Object Detection

Boundary-Aware Cross-Level Multi-Scale Fusion Network for RGB-D Salient Object Detection

Progressive cross-level fusion network for RGB-D salient object detection