MMFNet: A Mamba-Based Multimodal Fusion Network for Remote Sensing Image Semantic Segmentation

J. F. Qiu; Wei Chang; Wei Ren; Shanshan Hou; Ronghao Yang

doi:10.3390/s25196225

ScienceGate Book Chapters

JOURNAL ARTICLE

MMFNet: A Mamba-Based Multimodal Fusion Network for Remote Sensing Image Semantic Segmentation

J. F. Qiu Wei Chang Wei Ren Shanshan Hou Ronghao Yang

Year: 2025 Journal: Sensors Vol: 25 (19)Pages: 6225-6225 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/s25196225

Get Full-Text PDF Get Analytical Report

Abstract

Accurate semantic segmentation of high-resolution remote sensing imagery is challenged by substantial intra-class variability, inter-class similarity, and the limitations of single-modality data. This paper proposes MMFNet, a novel multimodal fusion network that leverages the Mamba architecture to efficiently capture long-range dependencies for semantic segmentation tasks. MMFNet adopts a dual-encoder design, combining ResNet-18 for local detail extraction and VMamba for global contextual modelling, striking a balance between segmentation accuracy and computational efficiency. A Multimodal Feature Fusion Block (MFFB) is introduced to effectively integrate complementary information from optical imagery and digital surface models (DSMs), thereby enhancing multimodal feature interaction and improving segmentation accuracy. Furthermore, a frequency-aware upsampling module (FreqFusion) is incorporated in the decoder to enhance boundary delineation and recover fine spatial details. Extensive experiments on the ISPRS Vaihingen and Potsdam benchmarks demonstrate that MMFNet achieves mean IoU scores of 83.50% and 86.06%, outperforming eight state-of-the-art methods while maintaining relatively low computational complexity. These results highlight MMFNet’s potential for efficient and accurate multimodal semantic segmentation in remote sensing applications.

Keywords:

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.42

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Remote-Sensing Image Classification

Physical Sciences → Engineering → Media Technology

MMFNet: A Mamba-Based Multimodal Fusion Network for Remote Sensing Image Semantic Segmentation

Abstract

Metrics

Topics

Related Documents

Hybrid Attention Driven CNN-Mamba Multimodal Fusion Network for Remote Sensing Image Semantic Segmentation

HMAFNet: Hybrid Mamba-Attention Fusion Network for Remote Sensing Image Semantic Segmentation

Multimodal cross fusion Mamba network for remote sensing image semantic segmentation with complementary masked self-supervision

MambaTriNet: A Mamba-Based Tribackbone Multimodal Remote Sensing Image Semantic Segmentation Model

A Mamba-Diffusion Framework for Multimodal Remote Sensing Image Semantic Segmentation