JOURNAL ARTICLE

Cascaded CNN and global–local attention transformer network-based semantic segmentation for high-resolution remote sensing image

Abstract

High-resolution remote sensing images (HRRSIs) contain rich local spatial information and long-distance location dependence, which play an important role in semantic segmentation tasks and have received more and more research attention. However, HRRSIs often exhibit large intraclass variance and small interclass variance due to the diversity and complexity of ground objects, thereby bringing great challenges to a semantic segmentation task. In most networks, there are numerous small-scale object omissions and large-scale object fragmentations in the segmentation results because of insufficient local feature extraction and low global information utilization. A network cascaded by convolution neural network and global–local attention transformer is proposed called CNN-transformer cascade network. First, convolution blocks and global–local attention transformer blocks are used to extract multiscale local features and long-range location information, respectively. Then a multilevel channel attention integration block is designed to fuse geometric features and semantic features of different depths and revise the channel weights through the channel attention module to resist the interference of redundant information. Finally, the smoothness of the segmentation is improved through the implementation of upsampling using a deconvolution operation. We compare our method with several state-of-the-art methods on the ISPRS Vaihingen and Potsdam datasets. Experimental results show that our method can improve the integrity and independence of multiscale objects segmentation results.

Keywords:
Computer science Image segmentation Artificial intelligence Image resolution Segmentation Computer vision Transformer Remote sensing Pattern recognition (psychology) Geology

Metrics

2
Cited By
1.23
FWCI (Field Weighted Citation Impact)
59
Refs
0.74
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image Fusion Techniques
Physical Sciences →  Engineering →  Media Technology
Brain Tumor Detection and Classification
Life Sciences →  Neuroscience →  Neurology
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Remote Sensing Image Semantic Segmentation Based on Cascaded Transformer

Falin WangJian JiYuan Wang

Journal:   IEEE Transactions on Artificial Intelligence Year: 2024 Vol: 5 (8)Pages: 4136-4148
JOURNAL ARTICLE

GLSANet: Global-Local Self-Attention Network for Remote Sensing Image Semantic Segmentation

Xudong HuPenglin ZhangQi ZhangFeng Yuan

Journal:   IEEE Geoscience and Remote Sensing Letters Year: 2023 Vol: 20 Pages: 1-5
JOURNAL ARTICLE

RSSGLT: Remote Sensing Image Segmentation Network Based on Global–Local Transformer

Satyawant KumarAbhishek KumarDong-Gyu Lee

Journal:   IEEE Geoscience and Remote Sensing Letters Year: 2023 Vol: 21 Pages: 1-5
JOURNAL ARTICLE

GLMCNet: A Global-Local Multiscale Context Network for High-Resolution Remote Sensing Image Semantic Segmentation

Yanting ZhangQiang LiuChuanzhao TianXuewen LiNa YangFeng ZhangHongyue Zhang

Journal:   Computers, materials & continua/Computers, materials & continua (Print) Year: 2025 Vol: 86 (1)Pages: 1-25
© 2026 ScienceGate Book Chapters — All rights reserved.