JOURNAL ARTICLE

Adaptive Dual-Stream Sparse Transformer Network for Salient Object Detection in Optical Remote Sensing Images

Jie ZhaoJia YunLin MaLidan Yu

Year: 2024 Journal:   IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Vol: 17 Pages: 5173-5192   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Excellent performance has been demonstrated by convolutional neural network (CNN) in salient object detection for optical remote sensing images (ORSI-SOD). However, the limitations of CNN's feature extraction using sliding window approach hinder the capture of global representations. Therefore, an end-to-end detection model, known as adaptive dual-stream sparse transformer network (ADSTNet), has been proposed for ORSI-SOD and is assisted by the vision transformer. It effectively addresses the compensation issue of global and local information in ORSI-SOD. In particular, an adaptive interaction encoder has been devised, amalgamating the multiscale sparse transformer and the pyramid atrous attention to constitute the adaptive dual-stream sparse encoder. This encoder collaborates with the CNN to enhance long-range dependency modeling and preserve global information more effectively base on local features. In addition, a directional feature reconfiguration is constructed to extract texture details from multiple directional dimensions. Finally, we propose the adaptive feature cascade decoder that synthesizes content information from the foreground, edges, and background to enhance the representational capacity of the image. Furthermore, a structural loss function, known as the weight compensation mechanism, is introduced to balance the performance of boundary and salmap segmentation losses. The proposed model has been demonstrated to outperform 26 state-of-the-art ORSI-SOD methods across eight evaluation metrics on two standard datasets, as evidenced by extensive experiments. Furthermore, to verify its robustness, the generalization performance of the model on the latest challenging ORSI-4199 dataset is reported.

Keywords:
Computer science Dual (grammatical number) Object detection Transformer Artificial intelligence Computer vision Salient Pattern recognition (psychology) Voltage Physics

Metrics

37
Cited By
19.62
FWCI (Field Weighted Citation Impact)
112
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Infrared Target Detection Methodologies
Physical Sciences →  Engineering →  Aerospace Engineering
Remote-Sensing Image Classification
Physical Sciences →  Engineering →  Media Technology

Related Documents

JOURNAL ARTICLE

Transformer guidance dual-stream network for salient object detection in optical remote sensing images

Yi ZhangJichang GuoHuihui YueXiangjun YinSida Zheng

Journal:   Neural Computing and Applications Year: 2023 Vol: 35 (24)Pages: 17733-17747
JOURNAL ARTICLE

Adaptive Spatial Tokenization Transformer for Salient Object Detection in Optical Remote Sensing Images

Lina GaoBing LiuPing FuMingzhu Xu

Journal:   IEEE Transactions on Geoscience and Remote Sensing Year: 2023 Vol: 61 Pages: 1-15
JOURNAL ARTICLE

Salient Object Detection in Optical Remote Sensing Images Driven by Transformer

Gongyang LiZhen BaiZhi LiuXinpeng ZhangHaibin Ling

Journal:   IEEE Transactions on Image Processing Year: 2023 Vol: 32 Pages: 5257-5269
© 2026 ScienceGate Book Chapters — All rights reserved.