JOURNAL ARTICLE

TANet: Transformer‐based asymmetric network for RGB‐D salient object detection

Chang LiuGang YangShuo WangHangxu WangYunhua ZhangYutao Wang

Year: 2023 Journal:   IET Computer Vision Vol: 17 (4)Pages: 415-430   Publisher: Institution of Engineering and Technology

Abstract

Abstract Existing RGB‐D salient object detection methods mainly rely on a symmetric two‐stream Convolutional Neural Network (CNN)‐based network to extract RGB and depth channel features separately. However, there are two problems with the symmetric conventional network structure: first, the ability of CNN in learning global contexts is limited; second, the symmetric two‐stream structure ignores the inherent differences between modalities. In this study, a Transformer‐based asymmetric network is proposed to tackle the issues mentioned above. The authors employ the powerful feature extraction capability of Transformer to extract global semantic information from RGB data and design a lightweight CNN backbone to extract spatial structure information from depth data without pre‐training. The asymmetric hybrid encoder effectively reduces the number of parameters in the model while increasing speed without sacrificing performance. Then, a cross‐modal feature fusion module which enhances and fuses RGB and depth features with each other is designed. Finally, the authors add edge prediction as an auxiliary task and propose an edge enhancement module to generate sharper contours. Extensive experiments demonstrate that our method achieves superior performance over 14 state‐of‐the‐art RGB‐D methods on six public datasets. The code of the authors will be released at https://github.com/lc012463/TANet .

Keywords:
RGB color model Computer science Artificial intelligence Feature extraction Transformer Encoder Convolutional neural network Pattern recognition (psychology) Computer vision Salient Voltage Engineering

Metrics

18
Cited By
3.28
FWCI (Field Weighted Citation Impact)
77
Refs
0.90
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Face Recognition and Perception
Life Sciences →  Neuroscience →  Cognitive Neuroscience
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Transformer-based difference fusion network for RGB-D salient object detection

Zhiqiang CuiFeng WangZhengyong Feng

Journal:   Journal of Electronic Imaging Year: 2022 Vol: 31 (06)
JOURNAL ARTICLE

Swin Transformer-Based Edge Guidance Network for RGB-D Salient Object Detection

Shuaihui WangFengyi JiangBoqian Xu

Journal:   Sensors Year: 2023 Vol: 23 (21)Pages: 8802-8802
JOURNAL ARTICLE

Asymmetric deep interaction network for RGB-D salient object detection

Feifei WangYongming LiLiejun WangPanpan Zheng

Journal:   Expert Systems with Applications Year: 2024 Vol: 266 Pages: 126083-126083
© 2026 ScienceGate Book Chapters — All rights reserved.