JOURNAL ARTICLE

Adaptive Spatial Tokenization Transformer for Salient Object Detection in Optical Remote Sensing Images

Lina GaoBing LiuPing FuMingzhu Xu

Year: 2023 Journal:   IEEE Transactions on Geoscience and Remote Sensing Vol: 61 Pages: 1-15   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Convolutional neural network (CNN)-based salient object detection (SOD) models have achieved promising performance in optical remote sensing images (ORSIs) in recent years. However, the restriction concerning the local sliding window operation of CNN has caused many existing CNN-based ORSI SOD models to still struggle with learning long-range relationships. To this end, a novel transformer framework is proposed for ORSI SOD, which is inspired by the powerful global dependency relationships of transformer networks. This is the first attempt to explore global and local details using transformer architecture for SOD in ORSIs. Concretely, we design an adaptive spatial tokenization transformer encoder to extract global–local features, which can accurately sparsify tokens for each input image and achieve competitive performance in ORSI SOD tasks. Then, a specific dense token aggregation decoder (DTAD) is proposed to generate saliency results, including three cascade decoders to integrate the global–local tokens and contextual dependencies. Extensive experiments indicate that the proposed model greatly surpasses 20 state-of-the-art (SOTA) SOD approaches on two standard ORSI SOD datasets under seven evaluation metrics. We also report comparison results to demonstrate the generalization capacity on the latest challenging ORSI datasets. In addition, we validate the contributions of different modules through a series of ablation analyses, especially the proposed adaptive spatial tokenization module (ASTM), which can halve the computational budget.

Keywords:
Computer science Convolutional neural network Artificial intelligence Transformer Encoder Security token Pattern recognition (psychology) Object detection Machine learning Voltage

Metrics

34
Cited By
6.19
FWCI (Field Weighted Citation Impact)
82
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image Fusion Techniques
Physical Sciences →  Engineering →  Media Technology
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Salient Object Detection in Optical Remote Sensing Images Driven by Transformer

Gongyang LiZhen BaiZhi LiuXinpeng ZhangHaibin Ling

Journal:   IEEE Transactions on Image Processing Year: 2023 Vol: 32 Pages: 5257-5269
JOURNAL ARTICLE

Adaptive Dual-Stream Sparse Transformer Network for Salient Object Detection in Optical Remote Sensing Images

Jie ZhaoJia YunLin MaLidan Yu

Journal:   IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Year: 2024 Vol: 17 Pages: 5173-5192
JOURNAL ARTICLE

Bidirectional mutual guidance transformer for salient object detection in optical remote sensing images

Kan HuangChunwei TianGe Li

Journal:   International Journal of Remote Sensing Year: 2023 Vol: 44 (13)Pages: 4016-4033
JOURNAL ARTICLE

Transformer guidance dual-stream network for salient object detection in optical remote sensing images

Yi ZhangJichang GuoHuihui YueXiangjun YinSida Zheng

Journal:   Neural Computing and Applications Year: 2023 Vol: 35 (24)Pages: 17733-17747
© 2026 ScienceGate Book Chapters — All rights reserved.