JOURNAL ARTICLE

Multiscale Salient Alignment Learning for Remote-Sensing Image–Text Retrieval

Yaxiong ChenJinghao HuangXiaoyu LiShengwu XiongXiaoqiang Lu

Year: 2023 Journal:   IEEE Transactions on Geoscience and Remote Sensing Vol: 62 Pages: 1-13   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Remote-sensing image–text (RSIT) retrieval involves the use of either textual descriptions or remote-sensing images (RSI) as queries to retrieve relevant RSIs or corresponding text descriptions. Many traditional cross-modal RSIT retrieval methods tend to overlook the importance of capturing salient information and establishing the prior similarity between RSIs and texts, leading to a decline in cross-modal retrieval performance. In this article, we address these challenges by introducing a novel approach known as multiscale salient image-guided text alignment (MSITA). This approach is designed to learn salient information by aligning text with images for effective cross-modal RSIT retrieval. The MSITA approach first incorporates a multiscale fusion module and a salient learning module to facilitate the extraction of salient information. In addition, it introduces an image-guided text alignment (IGTA) mechanism that uses image information to guide the alignment of texts, enabling the effective capture of fine-grained correspondences between RSI regions and textual descriptions. In addition to these components, a novel loss function is devised to enhance the similarity across different modalities and reinforce the prior similarity between RSIs and texts. Extensive experiments conducted on four widely adopted RSIT datasets affirm that the MSITA approach significantly enhances cross-modal RSIT retrieval performance in comparison to other state-of-the-art methods.

Keywords:
Salient Computer science Remote sensing Image retrieval Artificial intelligence Computer vision Image (mathematics) Information retrieval Geology

Metrics

20
Cited By
3.64
FWCI (Field Weighted Citation Impact)
61
Refs
0.92
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Remote-Sensing Image Classification
Physical Sciences →  Engineering →  Media Technology

Related Documents

JOURNAL ARTICLE

Visual Global-Salient-Guided Network for Remote Sensing Image-Text Retrieval

Yangpeng HeXin XuHongjia ChenJinwen LiFangling Pu

Journal:   IEEE Transactions on Geoscience and Remote Sensing Year: 2024 Vol: 62 Pages: 1-14
JOURNAL ARTICLE

Context-Aware Local–Global Semantic Alignment for Remote Sensing Image–Text Retrieval

Xiumei ChenXiangtao ZhengXiaoqiang Lu

Journal:   IEEE Transactions on Geoscience and Remote Sensing Year: 2025 Vol: 63 Pages: 1-12
© 2026 ScienceGate Book Chapters — All rights reserved.