A Self-Supervised Cross-Modal Remote Sensing Foundation Model with Multi-Domain Representation and Cross-Domain Fusion

Yingchao Feng; Peijin Wang; Wenhui Diao; Qibin He; Huiyang Hu; Hanbo Bi; Xian Sun; Kun Fu

doi:10.1109/igarss52108.2023.10282433

ScienceGate Book Chapters

JOURNAL ARTICLE

A Self-Supervised Cross-Modal Remote Sensing Foundation Model with Multi-Domain Representation and Cross-Domain Fusion

Yingchao Feng Peijin Wang Wenhui Diao Qibin He Huiyang Hu Hanbo Bi Xian Sun Kun Fu

Year: 2023 Pages: 2239-2242

DOI: 10.1109/igarss52108.2023.10282433

Get Full-Text PDF Get Analytical Report

Abstract

The construction of a basic model to extract generalized features from a large number of multimodal data is a new challenge in the field of remote sensing. Compared with natural scene images, When faced with a complex application scenario of remote sensing of multi-sensor acquisition, models that are suitable for a specific task are difficult to generalize to new scenarios. In this paper, we propose a model architecture based on the concepts of multi-domain representation and cross-domain fusion. By extracting strong generalization features from massive multi-modal data, a single foundation model can accomplish generalization interpretation for multiple downstream tasks. Experimental results show that the proposed model performs well on multiple downstream tasks, which validates the feasibility of the remote sensing cross-modal foundation model in the interpretation task.

Keywords:

Computer science Generalization Modal Domain (mathematical analysis) Representation (politics) Task (project management) Sensor fusion Artificial intelligence Field (mathematics) Machine learning Data mining Remote sensing Systems engineering Engineering

Metrics

Cited By

1.74

FWCI (Field Weighted Citation Impact)

Refs

0.84

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Remote-Sensing Image Classification

Physical Sciences → Engineering → Media Technology

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Remote Sensing and Land Use

Physical Sciences → Earth and Planetary Sciences → Atmospheric Science

A Self-Supervised Cross-Modal Remote Sensing Foundation Model with Multi-Domain Representation and Cross-Domain Fusion

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-modal cross-domain self-supervised pre-training for fMRI and EEG fusion

Cross-modal Representation Flattening for Multi-modal Domain Generalization

VLSDA: Vision–Language Model-Supervised Domain Adaptation for Cross-Domain Object Detection in Remote Sensing

Cross-Domain Multi-Prototypes with Contradictory Structure Learning for Semi-Supervised Domain Adaptation Segmentation of Remote Sensing Images

Multi-Modal Self-Supervised Learning for Cross-Domain One-Shot Bearing Fault Diagnosis