JOURNAL ARTICLE

Swin Transformer-Based Edge Guidance Network for RGB-D Salient Object Detection

Shuaihui WangFengyi JiangBoqian Xu

Year: 2023 Journal:   Sensors Vol: 23 (21)Pages: 8802-8802   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Salient object detection (SOD), which is used to identify the most distinctive object in a given scene, plays an important role in computer vision tasks. Most existing RGB-D SOD methods employ a CNN-based network as the backbone to extract features from RGB and depth images; however, the inherent locality of a CNN-based network limits the performance of CNN-based methods. To tackle this issue, we propose a novel Swin Transformer-based edge guidance network (SwinEGNet) for RGB-D SOD in which the Swin Transformer is employed as a powerful feature extractor to capture the global context. An edge-guided cross-modal interaction module is proposed to effectively enhance and fuse features. In particular, we employed the Swin Transformer as the backbone to extract features from RGB images and depth maps. Then, we introduced the edge extraction module (EEM) to extract edge features and the depth enhancement module (DEM) to enhance depth features. Additionally, a cross-modal interaction module (CIM) was used to integrate cross-modal features from global and local contexts. Finally, we employed a cascaded decoder to refine the prediction map in a coarse-to-fine manner. Extensive experiments demonstrated that our SwinEGNet achieved the best performance on the LFSD, NLPR, DES, and NJU2K datasets and achieved comparable performance on the STEREO dataset compared to 14 state-of-the-art methods. Our model achieved better performance compared to SwinNet, with 88.4% parameters and 77.2% FLOPs. Our code will be publicly available.

Keywords:
RGB color model Artificial intelligence Computer science Transformer Feature extraction Computer vision Backbone network Pattern recognition (psychology) Salient Engineering Voltage

Metrics

3
Cited By
0.55
FWCI (Field Weighted Citation Impact)
53
Refs
0.62
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Gaze Tracking and Assistive Technology
Physical Sciences →  Computer Science →  Human-Computer Interaction
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

SwinNet: Swin Transformer Drives Edge-Aware RGB-D and RGB-T Salient Object Detection

Zhengyi LiuYacheng TanQian HeYun Xiao

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2021 Vol: 32 (7)Pages: 4486-4497
JOURNAL ARTICLE

Dual Swin-transformer based mutual interactive network for RGB-D salient object detection

Chao ZengSam KwongHorace H. S. Ip

Journal:   Neurocomputing Year: 2023 Vol: 559 Pages: 126779-126779
JOURNAL ARTICLE

Transformer-based difference fusion network for RGB-D salient object detection

Zhiqiang CuiFeng WangZhengyong Feng

Journal:   Journal of Electronic Imaging Year: 2022 Vol: 31 (06)
© 2026 ScienceGate Book Chapters — All rights reserved.