GLFFNet: Global–Local Feature Fusion Network for High-Resolution Remote Sensing Image Semantic Segmentation

Shengqi Zhu; Liaoying Zhao; Qingjiang Xiao; Jigang Ding; Xiaorun Li

doi:10.3390/rs17061019

ScienceGate Book Chapters

JOURNAL ARTICLE

GLFFNet: Global–Local Feature Fusion Network for High-Resolution Remote Sensing Image Semantic Segmentation

Shengqi Zhu Liaoying Zhao Qingjiang Xiao Jigang Ding Xiaorun Li

Year: 2025 Journal: Remote Sensing Vol: 17 (6)Pages: 1019-1019 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/rs17061019

Get Full-Text PDF Get Analytical Report

Abstract

Although hybrid models based on convolutional neural network (CNN) and Transformer can extract features encompassing both global and local information, they still face two challenges in addressing the semantic segmentation task of high-resolution remote sensing (HR2S) images. First, they are limited by the loss of detailed information during encoding, resulting in inadequate utilization of features. Second, the ineffective fusion of local and global context information leads to unsatisfactory segmentation performance. To simultaneously address these two challenges, we propose a dual-branch network named global–local feature fusion network (GLFFNet) for HR2S image semantic segmentation. Specifically, we use the residual network (ResNet) as the main branch to extract local features. Recently, a Mamba architecture based on State Space Models has shown significant potential in image semantic segmentation tasks. Given that Mamba is capable of handling long-range relationships with linear computational complexity and relatively high speed, we introduce VMamba as an auxiliary branch encoder to provide global information for the main branch. Meanwhile, in order to utilize global information efficiently, we propose a multi-scale feature refinement (MSFR) module to reduce the loss of details during global feature extraction. Additionally, we develop a semantic bridging fusion (SBF) module to promote the full integration of global and local features, resulting in more comprehensive and refined feature representations. Comparative experiments on three public datasets demonstrate the segmentation accuracy and application potential of GLFFNet. Specifically, GLFFNet achieves mIoU scores of 84.01% on ISPRS Vaihingen, 87.54% on ISPRS Potsdam, and 54.73% on LoveDA, as well as mF1 scores of 91.11%, 93.23%, and 70.07% on these respective datasets.

Keywords:

Computer science Feature (linguistics) Artificial intelligence High resolution Remote sensing Segmentation Image fusion Computer vision Pattern recognition (psychology) Image (mathematics) Geology

Metrics

Cited By

28.14

FWCI (Field Weighted Citation Impact)

Refs

0.98

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Remote-Sensing Image Classification

Physical Sciences → Engineering → Media Technology

Advanced Image Fusion Techniques

Physical Sciences → Engineering → Media Technology

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

GLFFNet: Global–Local Feature Fusion Network for High-Resolution Remote Sensing Image Semantic Segmentation

Abstract

Metrics

Citation History

Topics

Related Documents

GLFFNet: A Global and Local Features Fusion Network with Biencoder for Remote Sensing Image Segmentation

Global-Local Feature Cross Fusion Network for Semantic Segmentation of Remote Sensing Images

Global-local semantic fusion for wetland remote sensing image segmentation

GLMCNet: A Global-Local Multiscale Context Network for High-Resolution Remote Sensing Image Semantic Segmentation

DGLFNet:A Dual-Branch Global-Local Fusion Network for Remote Sensing Image Semantic Segmentation