JOURNAL ARTICLE

Hybrid Vision Transformer Model for Hyperspectral Image Classification

Jiaqi YangBo DuChen Wu

Year: 2022 Journal:   IGARSS 2022 - 2022 IEEE International Geoscience and Remote Sensing Symposium Pages: 1388-1391

Abstract

Due to the local connectivity property, convolutional neural network (CNN) can effectively extract contextual detailed information. Therefore, a large number of CNN-based methods are introduced to hyperspectral image (HSI) classification. However, receptive fields of these methods are greatly limited, and information extraction process is usually inadequate. Recently, transformer structure has attracted extensive attention owing to its ability to capture global dependency. With a self-attention mechanism, transformer can extract long-tail distribution and model global features to enhance the representation of data. Consequently, it is a natural idea to combine CNN and transformer to obtain both local detail and global distribution. In this paper, we propose a hybrid vision transformer model (Hybrid ViT) to jointly learn global and local information of HSI, including a convolution block and a vision transformer block. With the unified architecture, Hybrid ViT model can not only access detailed features of narrow targets but also extract the global distribution of large objects. Experimental results on benchmark HSI datasets demonstrate that the proposed Hybrid ViT can outperform other methods with higher classification accuracy and finer classification maps.

Keywords:
Computer science Artificial intelligence Convolutional neural network Transformer Pattern recognition (psychology) Hyperspectral imaging Engineering

Metrics

10
Cited By
2.48
FWCI (Field Weighted Citation Impact)
6
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Remote-Sensing Image Classification
Physical Sciences →  Engineering →  Media Technology
Remote Sensing and Land Use
Physical Sciences →  Earth and Planetary Sciences →  Atmospheric Science
Advanced Image Fusion Techniques
Physical Sciences →  Engineering →  Media Technology

Related Documents

JOURNAL ARTICLE

Masked Vision Transformer for Fast Hyperspectral Image Classification

Liguo WangHeng WangShoulin YinLifeng Wang

Journal:   IEEE Transactions on Geoscience and Remote Sensing Year: 2025 Vol: 63 Pages: 1-16
JOURNAL ARTICLE

BinaryViT: Binary Vision Transformer for Hyperspectral Image Classification

Xiang HuTaolin LiuZhe GuoYuxiang TangYuanxi PengTong Zhou

Journal:   IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Year: 2025 Vol: 18 Pages: 20469-20486
© 2026 ScienceGate Book Chapters — All rights reserved.