Multi-Modal Domain Generalization for Cross-Scene Hyperspectral Image Classification

Yuxiang Zhang; Mengmeng Zhang; Wei Li; Ran Tao

doi:10.1109/icassp49357.2023.10095723

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-Modal Domain Generalization for Cross-Scene Hyperspectral Image Classification

Yuxiang Zhang Mengmeng Zhang Wei Li Ran Tao

Year: 2023 Pages: 1-5

DOI: 10.1109/icassp49357.2023.10095723

Get Full-Text PDF Get Analytical Report

Abstract

The large-scale pre-training image-text foundation models have excelled in a number of downstream applications. The majority of domain generalization techniques, however, have never focused on mining linguistic modal knowledge to enhance model generalization performance. Additionally, text information has been ignored in hyperspectral image classification (HSI) tasks. To address the aforementioned shortcomings, a Multi-modal Domain Generalization Network (MDG) is proposed to learn cross-domain invariant representation from cross-domain shared semantic space. Only the source domain (SD) is used for training in the proposed method, after which the model is directly transferred to the target domain (TD). Visual and linguistic features are extracted using the dual-stream architecture, which consists of an image encoder and a text encoder. A generator is designed to obtain extended domain (ED) samples that are different from SD. Furthermore, linguistic features are used to construct a cross-domain shared semantic space, where visual-linguistic alignment is accomplished by supervised contrastive learning. Extensive experiments on two datasets show that the proposed method outperforms state-of-the-art approaches.

Keywords:

Computer science Artificial intelligence Generalization Domain (mathematical analysis) Hyperspectral imaging Image (mathematics) Pattern recognition (psychology) Representation (politics) Modal Generator (circuit theory) Encoder Contextual image classification Autoencoder Natural language processing Artificial neural network Mathematics

Metrics

Cited By

1.09

FWCI (Field Weighted Citation Impact)

Refs

0.76

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Remote-Sensing Image Classification

Physical Sciences → Engineering → Media Technology

Domain Adaptation and Few-Shot Learning

Physical Sciences → Computer Science → Artificial Intelligence

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Multi-Modal Domain Generalization for Cross-Scene Hyperspectral Image Classification

Abstract

Metrics

Citation History

Topics

Related Documents

Language-Aware Domain Generalization Network for Cross-Scene Hyperspectral Image Classification

Adversarial decoupling domain generalization network for cross-scene hyperspectral image classification

Invariant semantic domain generalization shuffle network for cross-scene hyperspectral image classification

Disentanglement-inspired single-source domain-generalization network for cross-scene hyperspectral image classification

ULDGN: Uncertainty-aware language-guided domain generalization network for cross-scene hyperspectral image classification