Few-shot remote sensing image scene classification: Recent advances, new baselines, and future trends

Chunping Qiu; Xiaoyu Zhang; Xiaochong Tong; Naiyang Guan; Xiaodong Yi; Ke Yang; Junjie Zhu; Anzhu Yu

doi:10.1016/j.isprsjprs.2024.02.005

ScienceGate Book Chapters

JOURNAL ARTICLE

Few-shot remote sensing image scene classification: Recent advances, new baselines, and future trends

Chunping Qiu Xiaoyu Zhang Xiaochong Tong Naiyang Guan Xiaodong Yi Ke Yang Junjie Zhu Anzhu Yu

Year: 2024 Journal: ISPRS Journal of Photogrammetry and Remote Sensing Vol: 209 Pages: 368-382 Publisher: Elsevier BV

DOI: 10.1016/j.isprsjprs.2024.02.005

Get Full-Text PDF Get Analytical Report

Abstract

Remote sensing image scene classification (RSI-SC) is crucial for various high-level applications, including RSI retrieval, image captioning, and object detection. Deep learning-based methods can accurately predict scene categories. However, these approaches often require numerous labeled samples for training, limiting their practicality in real-world RS applications with scarce label resources. In contrast, few-shot remote sensing image scene classification (FS-RSI-SC) has garnered substantial research interest owing to its potential to mitigate the need for extensive training samples. In recent years, there has been a surge in studies on FS-RSI-SC. This paper presents a comprehensive overview of FS-RSI-SC research, categorizing existing methods into two groups. The first group comprises approaches based on data augmentation, transfer learning, metric learning, and meta-learning. Our analysis reveals that most existing FS-RSI-SC methods fall into the meta-learning category, employing attention mechanisms, self-supervised learning (SSL), and feature fusion techniques for enhanced performance. Additionally, transfer learning-based methods consistently outperform other approaches in this category. The second group is centered around large-scale pre-training, which has demonstrated remarkable competitiveness across various tasks, including FS-RSI-SC. This special group of methods has shown considerable potential and is expected to attract more attention with the increasing popularity of large-scale pre-training and the unimodal and multimodal foundation models. Moreover, we proposed a pipeline that harnesses the capabilities of powerful large vision-language models (VLMs) as image encoders, establishing new baselines for FS-RSI-SC on commonly used datasets under standard experimental settings. Our empirical results validated the effectiveness of utilizing large VLMs and highlighted their potential for FS-RSI-SC. Through a joint analysis of state-of-the-art methods and our experiments with VLMs, we identified the prevailing challenges in FS-RSI-SC and outlined promising directions for future research.

Keywords:

Shot (pellet) Remote sensing Computer science One shot Artificial intelligence Image (mathematics) Single shot Computer vision Geography Engineering

Metrics

Cited By

27.06

FWCI (Field Weighted Citation Impact)

119

Refs

0.99

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Remote-Sensing Image Classification

Physical Sciences → Engineering → Media Technology

Remote Sensing in Agriculture

Physical Sciences → Environmental Science → Ecology

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Few-shot remote sensing image scene classification: Recent advances, new baselines, and future trends

Abstract

Metrics

Citation History

Topics

Related Documents

Few Shot Metric Learning for Remote Sensing Image Scene Classification

TeAw: Text-Aware Few-Shot Remote Sensing Image Scene Classification

DLA-MatchNet for Few-Shot Remote Sensing Image Scene Classification

Self-supervised learning based few-shot remote sensing scene image classification

Unsupervised Few-Shot Continual Learning for Remote Sensing Image Scene Classification