JOURNAL ARTICLE

MultiTrans-LC: Multimodal Fusion Transformer for Remote Sensing Land Cover Classification

Qixuan WangNing LiYiheng ChenHai Zhu

Year: 2025 Journal:   ˜The œinternational archives of the photogrammetry, remote sensing and spatial information sciences/International archives of the photogrammetry, remote sensing and spatial information sciences Vol: XLVIII-1/W5-2025 Pages: 133-138   Publisher: Copernicus Publications

Abstract

Abstract. The use of remote sensing images for land cover classification is crucial for environmental monitoring, urban planning, and sustainable resource management. Despite advances in deep learning, existing methods suffer from blurred boundaries in complex landscapes and perform poorly in identifying small or overlapping land cover categories. This article introduces MultiTrans LC, a novel multimodal fusion framework that integrates visual language interaction and boundary perception optimization to address these challenges. The proposed architecture utilizes a hierarchical Transformer encoder to extract global visual features from high-resolution images and aligns them with semantic embeddings in text prompts through cross modal attention. The visual language decoder further refines the multi-scale feature representation through progressive fusion, while the edge aware loss function jointly optimizes pixel level classification and boundary localization. Experiments on three benchmark datasets (GID-15, LoveDA, RSSCN7) have demonstrated state-of-the-art performance, achieving an overall accuracy of 90.7% and a Kappa coefficient of 0.901 on GID-15, which is 1.6% higher than the leading method in OA. Visualization confirms that MultiTrans LC performs well compared to CNN and Transformer baselines. By bridging visual and textual semantics, MultiTrans LC improves the accuracy of large-scale land cover mapping and provides a powerful solution for geospatial intelligence applications. Discussed the limitations and future directions of open vocabulary classification and edge device deployment.

Keywords:

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Related Documents

JOURNAL ARTICLE

Land Cover Classification Based on Multimodal Remote Sensing Fusion

Wei ChenJiage ChenYuewu WanXining LiuMengya CaiJingguo XuHongbo CuiMengdie Duan

Journal:   ISPRS annals of the photogrammetry, remote sensing and spatial information sciences Year: 2024 Vol: X-1-2024 Pages: 35-40
JOURNAL ARTICLE

Multimodal Fusion Transformer for Remote Sensing Image Classification

Swalpa Kumar RoyAnkur DeriaDanfeng HongBehnood RastiAntonio PlazaJocelyn Chanussot

Journal:   IEEE Transactions on Geoscience and Remote Sensing Year: 2023 Vol: 61 Pages: 1-20
JOURNAL ARTICLE

Multimodal Remote Sensing Benchmark Datasets for Land Cover Classification

Jing YaoDanfeng HongLianru GaoJocelyn Chanussot

Journal:   IGARSS 2022 - 2022 IEEE International Geoscience and Remote Sensing Symposium Year: 2022 Pages: 4807-4810
JOURNAL ARTICLE

Cross‐Transformer Fusion Network for Multimodal Remote Sensing Image Classification

Huiqing WangZhongyu LiLinfeng Wu

Journal:   The Photogrammetric Record Year: 2025 Vol: 40 (191)
© 2026 ScienceGate Book Chapters — All rights reserved.