JOURNAL ARTICLE

STransFuse: Fusing Swin Transformer and Convolutional Neural Network for Remote Sensing Image Semantic Segmentation

Liang GaoHui LiuMinhang YangLong ChenYaling WanZhengqing XiaoYurong Qian

Year: 2021 Journal:   IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Vol: 14 Pages: 10990-11003   Publisher: Institute of Electrical and Electronics Engineers

Abstract

The applied research in remote sensing images has been pushed by convolutional neural network (CNN). Because of the fixed size of the perceptual field, CNN is unable to model global semantic relevance. Modeling global semantic information is possible with the self-attentive Transformer-based model. However, the method of patch computation used by Transformer for self-attentive computation ignores the spatial information inside each patch. To address these issues, we offer the STransFuse model as a new semantic segmentation method for remote sensing images. It is a model that combines the benefits of Transformer with CNN to improve the segmentation quality of various remote sensing images. We employ a staged model to extract coarse-grained and fine-grained feature representations at various semantic scales, unlike earlier techniques based on Transformer model fusion. In order to take full advantage of the features acquired at different stages, we designed an adaptive fusion module. This module adaptively fuses the semantic information between features at different scales employing a self-attentive mechanism. The overall accuracy (OA) of our proposed model on the Vaihingen dataset is 1.36% higher than the baseline, and 1.27% improvement in OA over baseline on the Potsdam dataset. When compared to other advanced models, the STransFuse model performs admirably.

Keywords:
Computer science Convolutional neural network Segmentation Transformer Artificial intelligence Computation Pattern recognition (psychology) Computer vision Algorithm

Metrics

215
Cited By
15.95
FWCI (Field Weighted Citation Impact)
60
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Remote-Sensing Image Classification
Physical Sciences →  Engineering →  Media Technology
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Swin Transformer Embedding UNet for Remote Sensing Image Semantic Segmentation

Xin HeYong ZhouJiaqi ZhaoDi ZhangRui YaoYong Xue

Journal:   IEEE Transactions on Geoscience and Remote Sensing Year: 2022 Vol: 60 Pages: 1-15
JOURNAL ARTICLE

Combining Swin Transformer With UNet for Remote Sensing Image Semantic Segmentation

Lili FanYu ZhouHongmei LiuYunjie LiDongpu Cao

Journal:   IEEE Transactions on Geoscience and Remote Sensing Year: 2023 Vol: 61 Pages: 1-11
JOURNAL ARTICLE

Semantic Segmentation of Remote Sensing Image Based on Convolutional Neural Network

双玲 朱

Journal:   Computer Science and Application Year: 2021 Vol: 11 (02)Pages: 356-369
© 2026 ScienceGate Book Chapters — All rights reserved.