JOURNAL ARTICLE

Bidirectional mutual guidance transformer for salient object detection in optical remote sensing images

Kan HuangChunwei TianGe Li

Year: 2023 Journal:   International Journal of Remote Sensing Vol: 44 (13)Pages: 4016-4033   Publisher: Taylor & Francis

Abstract

ABSTRACTSalient object detection in optical remote sensing images presents great challenges due to the characteristics of remote sensing images such as cluttered background, varying object scales, and unstable imaging conditions, etc. In this paper, we present a Bidirectional Mutual Guidance Transformer (BMGT), which mitigates the locality issue of CNN-based models, and exploits the mutual guidance between global context-aware object representations and fine-grained boundary structures. It contains a hierarchically structured Transformer encoder that extracts multi-level multi-scale token representations, and a dual-stream cross-task MLP decoder that performs joint salient object detection and salient boundary detection in an end-to-end manner. In particular, the dual-stream decoder consists of two sub-branch networks with symmetric architectures, that are connected by a newly proposed Mutual Guidance MLP layer (MG-MLP). Through MG-MLP, salient object features and salient boundary features interact with each other, facilitating complementary learning at multiple network levels. Extensive evaluations demonstrate that our proposed method outperforms other existing methods in two public remote sensing image benchmarks. It proves that our BMGT is advantageous in exploiting long-range context dependencies as well as preserving fine-grained boundary structures.KEYWORDS: Salient object detection, optical remote sensing imagesTransformer Disclosure statementNo potential conflict of interest was reported by the author(s).Additional informationFundingThis work was supported in part by the National Science Foundation of China under Grant 62101316, in part by the China Postdoctoral Science Foundation Grant 2022TQ0259, in part by the Jiangsu Provincial Double–Innovation Doctor Program Grant JSSCBS20220942.

Keywords:
Computer science Salient Scalability Encoder Locality Artificial intelligence Transformer Object detection Computer vision Remote sensing Pattern recognition (psychology) Database Geography Engineering Electrical engineering

Metrics

13
Cited By
2.37
FWCI (Field Weighted Citation Impact)
49
Refs
0.87
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Visual Attention and Saliency Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.