M2Convformer: Multiscale Masked Hybrid Convolution-Transformer Network for Hyperspectral Image Super-Resolution

Shuo Wang; Breanna Shi; Ninglian Wang; Yuzhu Zhang; Yan Zhu

doi:10.1109/jstars.2025.3569410

ScienceGate Book Chapters

JOURNAL ARTICLE

M2Convformer: Multiscale Masked Hybrid Convolution-Transformer Network for Hyperspectral Image Super-Resolution

Shuo Wang Breanna Shi Ninglian Wang Yuzhu Zhang Yan Zhu

Year: 2025 Journal: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Vol: 18 Pages: 13035-13047 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/jstars.2025.3569410

Get Full-Text PDF Get Analytical Report

Abstract

How to fully utilize spectral correlation to improve spatial resolution is the focus of current hyperspectral image super-resolution (HSI-SR) research. Existing approaches either combine deep models with various advanced attention mechanisms to form an end-to-end framework, or concentrate on the problem of modeling prior estimates of spectral bands and space. While most of these methods are designed for supervised learning with paired labels, they may also benefit from self-supervised learning techniques such as masked autoencoders (MAE). This work focuses on the single hyperspectral image super-resolution problem and develops a multiscale masked hybrid convolution-transformer framework. The starting point of this work is an attempt to add a random mask to the input signal to reduce the redundancy of the original features, which the model combines with multiscale representation inference to improve its learning and generalization capabilities. However, we found that simply deploying MAE only for HSI-SR tasks leads to subpar performance. To solve this problem and coordinate with the multiscale network, we propose a multiscale interchannel masking fusion strategy to save computational overhead and bridge the gap between spectral resolution and spatial resolution. Extensive evaluations on three benchmark datasets demonstrate that the proposed method achieves superior performance than state-of-the-art methods.

Keywords:

Hyperspectral imaging Computer science Convolution (computer science) Image resolution Artificial intelligence Transformer Computer vision Artificial neural network Physics Voltage

Metrics

Cited By

3.52

FWCI (Field Weighted Citation Impact)

Refs

0.84

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Image Fusion Techniques

Physical Sciences → Engineering → Media Technology

Image and Signal Denoising Methods

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Remote-Sensing Image Classification

Physical Sciences → Engineering → Media Technology

M2Convformer: Multiscale Masked Hybrid Convolution-Transformer Network for Hyperspectral Image Super-Resolution

Abstract

Metrics

Citation History

Topics

Related Documents

Multiscale Spatial-Spectral CNN-Transformer Network for Hyperspectral Image Super-Resolution

MSDformer: Multiscale Deformable Transformer for Hyperspectral Image Super-Resolution

Hyperspectral image super-resolution employing nonlocal block and hybrid multiscale three-dimensional convolution

Hyperspectral Image Super-Resolution via Dual-Domain Network Based on Hybrid Convolution

Hybrid Convolution-Transformer for Lightweight Single Image Super-Resolution