JOURNAL ARTICLE

DiffMamba: semantic diffusion guided feature modeling network for semantic segmentation of remote sensing images

Zhen WangNan XuZhuhong YouShanwen Zhang

Year: 2025 Journal:   GIScience & Remote Sensing Vol: 62 (1)   Publisher: Taylor & Francis

Abstract

With the rapid development of remote sensing technology, the application scope of high-resolution remote sensing images (HR-RSIs) has been continuously expanding. The emergence of convolutional neural networks and Transformer models has significantly enhanced the accuracy of semantic segmentation. However, these methods primarily focus on local feature extraction and long-range dependency modeling of global information, neglecting the spatial correlation of local features, which leads to poor segmentation of small-scale regions. To address this issue, based on Diffusion Model and State Space Model (SSM), we propose a semantic diffusion guided feature modeling network (DiffMamba) for HR-RSI semantic segmentation. DiffMamba uses a hybrid CNNs-Transformer as the encoder structure, and is equipped with the efficient phase sensing module (EPSM), the multi-view transformer module (MVTrans), the semantic diffusion alignment module (SDAM), and the coordinate state space model (CAMamba). EPSM focuses on enhancing local feature representation in the channel dimension, using the phase information of object region features to improve local information interaction and filter out clutter noise interference. MVTrans can observe the spatial location information of the object region from various perspectives to obtain refined global context details. SDAM utilizes the diffusion propagation process to fuse local and global information, alleviating the feature redundancy caused by semantic information differences. CAMamba employs state space transformation to construct the correlation of enhanced local features, and guides the model to achieve feature decoding to obtain refined semantic segmentation results. Extensive experiments on the widely used ISPRS 2-D Semantic Labeling dataset and the 15-Class Gaofen Image dataset confirm the superior efficiency of DiffMamba over several state-of-the-art methods.

Keywords:
Feature (linguistics) Segmentation Semantic feature Computer science Semantic network Remote sensing Artificial intelligence Geography Pattern recognition (psychology) Cartography Linguistics

Metrics

1
Cited By
4.77
FWCI (Field Weighted Citation Impact)
68
Refs
0.82
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Medical Image Segmentation Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Hidden Feature-Guided Semantic Segmentation Network for Remote Sensing Images

Zhen WangShanwen ZhangChuanlei ZhangBuhong Wang

Journal:   IEEE Transactions on Geoscience and Remote Sensing Year: 2023 Vol: 61 Pages: 1-17
JOURNAL ARTICLE

A Frequency Domain Feature-Guided Network for Semantic Segmentation of Remote Sensing Images

Xin LiFeng XuHongmin GaoFan LiuXin Lyu

Journal:   IEEE Signal Processing Letters Year: 2024 Vol: 31 Pages: 1369-1373
JOURNAL ARTICLE

Multilevel Feature Interaction Network for Remote Sensing Images Semantic Segmentation

Hongkun ChenHuilan Luo

Journal:   IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Year: 2024 Vol: 17 Pages: 19831-19852
JOURNAL ARTICLE

Frequency-Guided Denoising Network for Semantic Segmentation of Remote Sensing Images

Xin LiFeng XuJue ZhangHongsheng ZhangXin LyuFan LiuHongmin GaoAndré Kaup

Journal:   IEEE Transactions on Geoscience and Remote Sensing Year: 2025 Vol: 64 Pages: 1-17
© 2026 ScienceGate Book Chapters — All rights reserved.