JOURNAL ARTICLE

Contrastive learning based remote sensing text-to-image generation for few-shot remote sensing image captioning

Haonan ZhouHui TangXiangchun LiuXiaoxiao ShiLurui Xia

Year: 2025 Journal:   International Journal of Digital Earth Vol: 18 (1)   Publisher: Taylor & Francis

Abstract

In few-shot scenarios, the lack of caption-labeled samples and prior knowledge leads to insufficient training and performance degradation of remote sensing image captioning (RC) models. We propose an iterative remote sensing image captioning method named IRIC to promote RC model performance iteration and generate higher quality captions. The IRIC first constructs a remote sensing text-to-image model CRTI based on contrastive learning, which can generate remote sensing images with the same semantic content from text and achieve text-driven remote sensing image transformation; Subsequently, caption-labeled sample amplification with prior knowledge introduction is implemented, which incorporates prior knowledge into the text-driven remote sensing image transformation to achieve caption-labeled sample amplification; Finally, the amplified caption-labeled samples are added to the original train set, and the RC model is retrained to achieve iterative performance improvement. The experimental results show that the IRIC is highly effective in few-shot scenarios and can iteratively improve the CIDEr scores of the latest few-shot RC model by 8.5%.

Keywords:
Closed captioning Image (mathematics) Computer science Remote sensing Artificial intelligence Shot (pellet) Computer vision Geography

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
44
Refs
0.21
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Self-Learning for Few-Shot Remote Sensing Image Captioning

Haonan ZhouXiaoping DuLurui XiaSen Li

Journal:   Remote Sensing Year: 2022 Vol: 14 (18)Pages: 4606-4606
JOURNAL ARTICLE

Attention-Based Contrastive Learning for Few-Shot Remote Sensing Image Classification

Yulong XuHanbo BiHongfeng YuWanxuan LuPeifeng LiXinming LiXian Sun

Journal:   IEEE Transactions on Geoscience and Remote Sensing Year: 2024 Vol: 62 Pages: 1-17
JOURNAL ARTICLE

FRIC: a framework for few-shot remote sensing image captioning

Haonan ZhouLurui XiaXiaoping DuSen Li

Journal:   International Journal of Digital Earth Year: 2024 Vol: 17 (1)
JOURNAL ARTICLE

Task-specific contrastive learning for few-shot remote sensing image scene classification

Qingjie ZengJie Geng

Journal:   ISPRS Journal of Photogrammetry and Remote Sensing Year: 2022 Vol: 191 Pages: 143-154
JOURNAL ARTICLE

Foreground–Background Contrastive Learning for Few-Shot Remote Sensing Image Scene Classification

Jie GengBohan XueWen Jiang

Journal:   IEEE Transactions on Geoscience and Remote Sensing Year: 2023 Vol: 61 Pages: 1-12
© 2026 ScienceGate Book Chapters — All rights reserved.