Spatial structure-aware and cross-scale feature modeling network for remote sensing image semantic segmentation

Fang Huang; Yuxuan Guo

doi:10.3934/era.2025282

ScienceGate Book Chapters

JOURNAL ARTICLE

Spatial structure-aware and cross-scale feature modeling network for remote sensing image semantic segmentation

Fang Huang Yuxuan Guo

Year: 2025 Journal: Electronic Research Archive Vol: 33 (10)Pages: 6391-6417 Publisher: American Institute of Mathematical Sciences

DOI: 10.3934/era.2025282

Get Full-Text PDF Get Analytical Report

Abstract

Remote sensing images exhibit significant spatial geometric characteristics for ground objects such as buildings and roads, while targets within scenes show enormous scale variations, posing challenges to semantic segmentation algorithms' spatial structure modeling capabilities and cross-scale information processing abilities. Traditional methods lack specialized modeling mechanisms for spatial geometric features and suffer from information loss in multi-scale feature fusion. This paper proposes the SC-Net network, addressing these issues through three key technological innovations. First, we designed a feature attention layer where the spatial attention module captures spatial geometric patterns through directional feature decomposition, and the multi-scale attention module preserves feature information at different scales through adaptive pooling strategies. Second, we constructed a three-branch fusion transformer that employs cross-window attention and nine-group feature key-value pair interactions to achieve collaborative modeling of spatial, multi-scale, and global features. Finally, the multi-branch cascaded decoder enhances segmentation boundary accuracy through hierarchical feature fusion strategies. Comprehensive experiments on three standard remote sensing datasets validated the method's superiority. SC-Net achieved 63.04% mean intersection over union (MIOU) on Wuhan dense labeling dataset (WHDLD), 71.57% on Potsdam dataset, and 81.57% on Vaihingen dataset, outperforming state-of-the-art methods such as AerialFormer and SERNet by 0.67–2.12% MIOU. The method particularly demonstrated outstanding performance in scenarios with complex spatial structures and dense multi-scale targets, providing an effective solution for precise remote sensing image interpretation.

Keywords:

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Spatial structure-aware and cross-scale feature modeling network for remote sensing image semantic segmentation

Abstract

Metrics

Topics

Related Documents

Dual-Path Feature Aware Network for Remote Sensing Image Semantic Segmentation

A boundary-aware cross-scale feature enhancement network for the semantic segmentation of remote-sensing images

DCANet: A Dual-Branch Cross-Scale Feature Aggregation Network for Remote Sensing Image Semantic Segmentation

Remote sensing image semantic segmentation network based on multi-scale feature enhancement fusion

Cross-Scale Feature Propagation Network for Semantic Segmentation of High-Resolution Remote Sensing Images