Chunping QiuXiaoyu ZhangXiaochong TongNaiyang GuanXiaodong YiKe YangJunjie ZhuAnzhu Yu
Remote sensing image scene classification (RSI-SC) is crucial for various high-level applications, including RSI retrieval, image captioning, and object detection. Deep learning-based methods can accurately predict scene categories. However, these approaches often require numerous labeled samples for training, limiting their practicality in real-world RS applications with scarce label resources. In contrast, few-shot remote sensing image scene classification (FS-RSI-SC) has garnered substantial research interest owing to its potential to mitigate the need for extensive training samples. In recent years, there has been a surge in studies on FS-RSI-SC. This paper presents a comprehensive overview of FS-RSI-SC research, categorizing existing methods into two groups. The first group comprises approaches based on data augmentation, transfer learning, metric learning, and meta-learning. Our analysis reveals that most existing FS-RSI-SC methods fall into the meta-learning category, employing attention mechanisms, self-supervised learning (SSL), and feature fusion techniques for enhanced performance. Additionally, transfer learning-based methods consistently outperform other approaches in this category. The second group is centered around large-scale pre-training, which has demonstrated remarkable competitiveness across various tasks, including FS-RSI-SC. This special group of methods has shown considerable potential and is expected to attract more attention with the increasing popularity of large-scale pre-training and the unimodal and multimodal foundation models. Moreover, we proposed a pipeline that harnesses the capabilities of powerful large vision-language models (VLMs) as image encoders, establishing new baselines for FS-RSI-SC on commonly used datasets under standard experimental settings. Our empirical results validated the effectiveness of utilizing large VLMs and highlighted their potential for FS-RSI-SC. Through a joint analysis of state-of-the-art methods and our experiments with VLMs, we identified the prevailing challenges in FS-RSI-SC and outlined promising directions for future research.
Kaihui ChengChule YangZunlin FanDayan WuNaiyang Guan
Lingjun LiJunwei HanXiwen YaoGong ChengLei Guo
Rui ZhangYixin YangYang LiJiabao WangZhuang MiaoHang LiZiqi Wang
M. Anwar Ma’sumMahardhika PratamaSavitha RamasamyLin LiuH. HabibullahRyszard Kowalczyk