JOURNAL ARTICLE

MMFNet: A Mamba-Based Multimodal Fusion Network for Remote Sensing Image Semantic Segmentation

J. F. QiuWei ChangWei RenShanshan HouRonghao Yang

Year: 2025 Journal:   Sensors Vol: 25 (19)Pages: 6225-6225   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Accurate semantic segmentation of high-resolution remote sensing imagery is challenged by substantial intra-class variability, inter-class similarity, and the limitations of single-modality data. This paper proposes MMFNet, a novel multimodal fusion network that leverages the Mamba architecture to efficiently capture long-range dependencies for semantic segmentation tasks. MMFNet adopts a dual-encoder design, combining ResNet-18 for local detail extraction and VMamba for global contextual modelling, striking a balance between segmentation accuracy and computational efficiency. A Multimodal Feature Fusion Block (MFFB) is introduced to effectively integrate complementary information from optical imagery and digital surface models (DSMs), thereby enhancing multimodal feature interaction and improving segmentation accuracy. Furthermore, a frequency-aware upsampling module (FreqFusion) is incorporated in the decoder to enhance boundary delineation and recover fine spatial details. Extensive experiments on the ISPRS Vaihingen and Potsdam benchmarks demonstrate that MMFNet achieves mean IoU scores of 83.50% and 86.06%, outperforming eight state-of-the-art methods while maintaining relatively low computational complexity. These results highlight MMFNet’s potential for efficient and accurate multimodal semantic segmentation in remote sensing applications.

Keywords:

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
39
Refs
0.42
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Remote-Sensing Image Classification
Physical Sciences →  Engineering →  Media Technology

Related Documents

JOURNAL ARTICLE

Hybrid Attention Driven CNN-Mamba Multimodal Fusion Network for Remote Sensing Image Semantic Segmentation

Shu TianMinglei LiLin CaoLihong KangJing TianXiangwei XingBo ShenKangning DuChong FuYe Zhang

Journal:   IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Year: 2025 Vol: 19 Pages: 2254-2272
JOURNAL ARTICLE

HMAFNet: Hybrid Mamba-Attention Fusion Network for Remote Sensing Image Semantic Segmentation

Haoyue SunJianjun LiuJinlong YangZebin Wu

Journal:   IEEE Geoscience and Remote Sensing Letters Year: 2025 Vol: 22 Pages: 1-5
JOURNAL ARTICLE

Multimodal cross fusion Mamba network for remote sensing image semantic segmentation with complementary masked self-supervision

Xiao LiuTao WangFei JinJie RuiShuxiang WangZiheng HuangYujie ZouXiaowei Yu

Journal:   International Journal of Applied Earth Observation and Geoinformation Year: 2025 Vol: 145 Pages: 104960-104960
JOURNAL ARTICLE

A Mamba-Diffusion Framework for Multimodal Remote Sensing Image Semantic Segmentation

Wenliang DuYang GuJiaqi ZhaoHancheng ZhuRui YaoYong Zhou

Journal:   IEEE Geoscience and Remote Sensing Letters Year: 2024 Vol: 21 Pages: 1-5
© 2026 ScienceGate Book Chapters — All rights reserved.