JOURNAL ARTICLE

ACMFNet: Attention-Based Cross-Modal Fusion Network for Building Extraction of Remote Sensing Images

Baiyu ChenZongxu PanJianwei YangHui Long

Year: 2024 Journal:   IEEE Transactions on Geoscience and Remote Sensing Vol: 62 Pages: 1-14   Publisher: Institute of Electrical and Electronics Engineers

Abstract

In recent years, significant progress has been made in extracting buildings from high spatial resolution (HSR) remote sensing images due to the rapid development of deep learning (DL). However, existing methods still have some limitations in maintaining the detail integrity of building footprint. Firstly, skip connections typically involve the direct concatenation of feature maps from adjacent levels, which inevitably leads to misalignment due to semantic differences. Second, the integration of building-related details remains a challenging task in the context of cross-modal remote sensing image. Third, the oversimplified upsampling structure used in previous methods may lead to loss of spatial details. In this paper, we propose a novel building extraction method ACMFNet based on cross-modal HSR remote sensing images using an encoder-decoder structure. First, we propose a global and local feature refinement module (GL-FRM) to refine features and establish contextual dependencies at multiple scales and levels, mitigating the spatial discrepancy among multi-level features. Meanwhile, a cross-modal fusion module (CFM) is utilized to integrate complementary features extracted from multispectral (MS) data and normalized digital surface model (nDSM) data. Additionally, we employed a lightweight residual upsampling module (RUM) for feature resolution recovery. We conducted complete experiments on two benchmark datasets, and the results indicate that our proposed ACMFNet achieves state-of-the-art (SOTA) performance without bells and whistles.

Keywords:
Computer science Remote sensing Modal Extraction (chemistry) Feature extraction Artificial intelligence Sensor fusion Computer vision Fusion Geology Materials science

Metrics

3
Cited By
1.84
FWCI (Field Weighted Citation Impact)
70
Refs
0.80
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Remote-Sensing Image Classification
Physical Sciences →  Engineering →  Media Technology
Remote Sensing and Land Use
Physical Sciences →  Earth and Planetary Sciences →  Atmospheric Science
Automated Road and Building Extraction
Physical Sciences →  Engineering →  Ocean Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.