AirSOD: A Lightweight Network for RGB-D Salient Object Detection

Zhihong Zeng; Haijun Liu; Fenglei Chen; Xiaoheng Tan

doi:10.1109/tcsvt.2023.3295588

ScienceGate Book Chapters

JOURNAL ARTICLE

AirSOD: A Lightweight Network for RGB-D Salient Object Detection

Zhihong Zeng Haijun Liu Fenglei Chen Xiaoheng Tan

Year: 2023 Journal: IEEE Transactions on Circuits and Systems for Video Technology Vol: 34 (3)Pages: 1656-1669 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tcsvt.2023.3295588

Get Full-Text PDF Get Analytical Report

Abstract

Salient object detection (SOD) aims to identify the most prominent regions in images. However, the large model sizes, high computational costs, and slow inference speeds of existing RGB-D SOD models have hindered their deployment on real-world embedded devices. To address this issue, we propose a novel method named AirSOD, which is committed to lightweight RGB-D SOD. Specifically, we first design a hybrid feature extraction network, which includes the first three stages of MobileNetV2 and our Parallel Attention-Shift convolution (PAS) module. Using the novel PAS module enables capturing both long-range dependencies and local information to enhance the representation learning while significantly reducing the number of parameters and computational complexity. Secondly, we propose a Multi-level and Multi-modal feature Fusion (MMF) module to facilitate feature fusion, and a Multi-path enhancement for Feature Refinement (MFR) decoder for feature integration. The proposed method significantly reduces the model size by 63%, decreases the computational complexity by 43%, and improves the inference speed by 43% compared with the cutting-edge model (MobileSal). We test our AirSOD on six widely-used RGB-D SOD datasets. Extensive experimental results demonstrate that our method obtains satisfactory performance. The source codes will be made available.

Keywords:

Computer science Artificial intelligence Computer vision Object detection RGB color model Object (grammar) Salient Pattern recognition (psychology)

Metrics

Cited By

6.73

FWCI (Field Weighted Citation Impact)

Refs

0.96

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Infrared Target Detection Methodologies

Physical Sciences → Engineering → Aerospace Engineering

Advanced Image Fusion Techniques

Physical Sciences → Engineering → Media Technology

AirSOD: A Lightweight Network for RGB-D Salient Object Detection

Abstract

Metrics

Citation History

Topics

Related Documents

Depth‐aware lightweight network for RGB‐D salient object detection

LESOD: Lightweight and efficient network for RGB-D salient object detection

Multiple cross-modal complementation network for lightweight RGB-D salient object detection

Lightweight Multi-Frequency Enhancement Network for RGB-D Video Salient Object Detection

Lightweight cross-modal transformer for RGB-D salient object detection