RGB-Guided Depth Feature Enhancement for RGB–Depth Salient Object Detection

Zhihong Zeng; Jiahao He; Yue Zhan; Haijun Liu; Xian Tan

doi:10.3390/electronics13244915

ScienceGate Book Chapters

JOURNAL ARTICLE

RGB-Guided Depth Feature Enhancement for RGB–Depth Salient Object Detection

Zhihong Zeng Jiahao He Yue Zhan Haijun Liu Xian Tan

Year: 2024 Journal: Electronics Vol: 13 (24)Pages: 4915-4915 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/electronics13244915

Get Full-Text PDF Get Analytical Report

Abstract

RGB-D (depth) Salient Object Detection (SOD) seeks to identify and segment the most visually compelling objects within a given scene. Depth data, known for their strong discriminative capability in spatial localization, provide an advantage in achieving accurate RGB-D SOD. However, recent research in this field has encountered significant challenges due to the poor visual qualities and disturbing cues in raw depth maps. This issue results in indistinct or ambiguous depth features, which consequently weaken the performance of RGB-D SOD. To address this problem, we propose a novel pseudo depth feature generation-based RGB-D SOD Network, named PDFNet, which can generate some new and more distinctive pseudo depth features as an extra supplement source to enhance the raw depth features. Specifically, we first introduce an RGB-guided pseudo depth feature generation subnet to synthesize more distinctive pseudo depth features for raw depth feature enhancement, since the discriminative power of depth features plays a pivotal role in providing effective contour and spatial cues. Then, we propose a cross-modal fusion mamba (CFM) to effectively merge RGB features, raw depth features, and generated pseudo depth features. We adopt a channel selection strategy within the CFM module to align the pseudo depth features with raw depth features, thereby enhancing the depth features. We test the proposed PDFNet on six commonly used RGB-D SOD benchmark datasets. Extensive experimental results validate that the proposed approach achieves superior performance. For example, compared to the previous cutting-edge method, AirSOD, our method improves the F-measure by 2%, 1.7%, 1.1%, and 2.2% on the STERE, DUTLF-D, NLPR, and NJU2K datasets, respectively.

Keywords:

RGB color model Artificial intelligence Computer vision Salient Feature (linguistics) Depth map Measured depth Computer science Object detection Depth perception Object (grammar) Pattern recognition (psychology) Geology Computer graphics (images) Image (mathematics) Psychology

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.25

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Visual Attention and Saliency Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Video Surveillance and Tracking Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Enhancement Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

RGB-Guided Depth Feature Enhancement for RGB–Depth Salient Object Detection

Abstract

Metrics

Topics

Related Documents

Depth Quality Enhancement and Feature Shrinking Guided RGB-D Salient Object Detection

RDFCNet: RGB-guided depth feature calibration network for RGB-D salient object detection

TranSal: Depth-guided Transformer for RGB-D Salient Object Detection

Adaptive Depth Enhancement Network for RGB-D Salient Object Detection

Guided residual network for RGB-D salient object detection with efficient depth feature learning