JOURNAL ARTICLE

EST-STFM: An Efficient Deep-Learning-Based Spatiotemporal Fusion Method for Remote Sensing Images

Qiyuan ZhangXiaodan ZhangChen QuanTong ZhaoWei HuoYuanchen Huang

Year: 2025 Journal:   IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Vol: 18 Pages: 18633-18655   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Spatiotemporal fusion methods address the limitation that a single satellite cannot simultaneously provide high spatial and temporal resolution imagery. By integrating images with different spatial and temporal characteristics, it is possible to generate remote sensing data with enhanced detail and frequency. However, existing methods face the following challenges: 1) traditional approaches rely on linear assumptions; 2) convolutional neural networks in deep learning struggle with capturing global context; 3) generative adversarial networks suffer from mode collapse; and 4) while Transformers excel at modeling global dependencies, they are computationally intensive. To overcome these limitations, we propose an efficient hierarchical Transformer-based spatiotemporal fusion method, named the efficient sparse Transformer spatiotemporal fusion model (EST-STFM). This is the first model to introduce a Top-$K$ sparse attention mechanism into spatiotemporal fusion for remote sensing. The EST-STFM consists of a feature extraction network and a multibranch feature fusion network. The extraction network includes the TopSparseNet (TSN) and a multibranch feedforward neural network (MFNN). The fusion network is built on the multibranch feature fusion block (MFFB), integrating multiple TSNs to combine multiscale features. TSN adopts a Top-$K$ sparse self-attention mechanism, which effectively reduces computational overhead while preserving critical local features, the MFNN improves multi-scale representation learning, and the MFFB improves the fusion process by integrating features of different resolutions and semantic levels through four independent attention branches. Experimental results on three public datasets demonstrate that the EST-STFM outperforms existing methods in fusion performance. The effectiveness of each module is validated through ablation studies, while the model’s robustness and practical utility are further confirmed through efficiency analysis and a clustering task.

Keywords:
Computer science Artificial intelligence Fusion Sensor fusion Remote sensing Deep learning Geology

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
59
Refs
0.32
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Image Fusion Techniques
Physical Sciences →  Engineering →  Media Technology
Remote-Sensing Image Classification
Physical Sciences →  Engineering →  Media Technology
Image and Signal Denoising Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Spatiotemporal Fusion of Remote Sensing Image Based on Deep Learning

Xiaofei WangXiaoyi Wang

Journal:   Journal of Sensors Year: 2020 Vol: 2020 Pages: 1-11
JOURNAL ARTICLE

A Two-Stage Spatiotemporal Fusion Method for Remote Sensing Images

Yue SunZhang Hua

Journal:   Photogrammetric Engineering & Remote Sensing Year: 2019 Vol: 85 (12)Pages: 907-914
BOOK-CHAPTER

Adaptive Remote Sensing Image Fusion Method Based on Deep Learning

Tongdi HeShunhu Wang

Lecture notes in electrical engineering Year: 2023 Pages: 503-511
JOURNAL ARTICLE

An object-based spatiotemporal fusion model for remote sensing images

Zhang HuaYue SunWenzhong ShiDizhou GuoNanshan Zheng

Journal:   European Journal of Remote Sensing Year: 2021 Vol: 54 (1)Pages: 86-101
© 2026 ScienceGate Book Chapters — All rights reserved.