Adaptive Pixel-Level and Superpixel-Level Feature Fusion Transformer for Hyperspectral Image Classification

Wei Huang; Dazhan Zhou; Le Sun; Qiqiang Chen; Junru Yin

doi:10.1109/jstars.2024.3455561

ScienceGate Book Chapters

JOURNAL ARTICLE

Adaptive Pixel-Level and Superpixel-Level Feature Fusion Transformer for Hyperspectral Image Classification

Wei Huang Dazhan Zhou Le Sun Qiqiang Chen Junru Yin

Year: 2024 Journal: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Vol: 17 Pages: 16876-16889 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/jstars.2024.3455561

Get Full-Text PDF Get Analytical Report

Abstract

Significant progress has been achieved in hyperspectral image (HSI) classification research through the application of the transformer blocks. Despite transformers possess strong long-range dependence modeling capabilities, they primarily extract nonlocal information from patches and often fail to fully capture global information, leading to incomplete spectral-spatial feature extraction. However, graph convolutional networks (GCNs) can effectively extract features from the global structure. This article proposes an adaptive pixel-level and superpixel-level feature fusion transformer (APSFFT). The network comprises two branches: one is the convolutional neural networks (CNNs) and transformer networks (CNTN), and the other is the GCNs and transformer networks (GNTN). These branches are designed to extract pixel-level and superpixel-level feature information from HSI, respectively. CNTN leverages the strengths of CNNs in extracting spectral–spatial information, combined with the transformer network's ability to establish long-range dependencies based on self-attention (SA). The GNTN fully extracts superpixel-level features while establishing long-range dependencies. To adaptively fuse the features from these two branches, an adaptive cross-token attention fusion (ACTAF) encoder is utilized. The ACTAF encoder fuses the classification token from both branches through SA, thereby enhancing the model's ability to capture interactions between pixel-level and superpixel-level features. We compared and analyzed seven advanced HSI classification algorithms, and experiments showed that APSFFT outperforms other state-of-the-art methods.

Keywords:

Hyperspectral imaging Artificial intelligence Pattern recognition (psychology) Computer science Pixel Computer vision Contextual image classification Feature extraction Fusion Feature (linguistics) Image fusion Image (mathematics)

Metrics

Cited By

2.46

FWCI (Field Weighted Citation Impact)

Refs

0.85

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Remote-Sensing Image Classification

Physical Sciences → Engineering → Media Technology

Remote Sensing and Land Use

Physical Sciences → Earth and Planetary Sciences → Atmospheric Science

Advanced Image Fusion Techniques

Physical Sciences → Engineering → Media Technology

Adaptive Pixel-Level and Superpixel-Level Feature Fusion Transformer for Hyperspectral Image Classification

Abstract

Metrics

Citation History

Topics

Related Documents

Probabilistic Fusion of Pixel-Level and Superpixel-Level Hyperspectral Image Classification

PiCovS: Pixel-Level With Covariance Pooling Feature and Superpixel-Level Feature Fusion for Hyperspectral Image Classification

Decision fusion of pixel-level and superpixel-level hyperspectral image classifiers

CNN-Enhanced Graph Convolutional Network With Pixel- and Superpixel-Level Feature Fusion for Hyperspectral Image Classification

PSASL: Pixel-Level and Superpixel-Level Aware Subspace Learning for Hyperspectral Image Classification