JOURNAL ARTICLE

Attention-Driven Cross-Modal Remote Sensing Image Retrieval

Abstract

In this work, we address a cross-modal retrieval problem in remote sensing (RS) data. A cross-modal retrieval problem is more challenging than the conventional uni-modal data retrieval frameworks as it requires learning of two completely different data representations to map onto a shared feature space. For this purpose, we chose a photo-sketch RS database. We exploit the data modality comprising more spatial information (sketch) to extract the other modality features (photo) with cross-attention networks. This sketch-attended photo features are more robust and yield better retrieval results. We validate our proposal by performing experiments on the benchmarked Earth on Canvas dataset. We show a boost in the overall performance in comparison to the existing literature. Besides, we also display the Grad-CAM visualizations of the trained model's weights to highlight the framework's efficacy.

Keywords:
Sketch Computer science Modal Exploit Modality (human–computer interaction) Information retrieval Feature (linguistics) Image retrieval Data retrieval Artificial intelligence Data mining Pattern recognition (psychology) Image (mathematics) Algorithm

Metrics

9
Cited By
0.82
FWCI (Field Weighted Citation Impact)
17
Refs
0.74
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Remote-Sensing Image Classification
Physical Sciences →  Engineering →  Media Technology
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Deep Cross-Modal Image–Voice Retrieval in Remote Sensing

Yaxiong ChenXiaoqiang LuShuai Wang

Journal:   IEEE Transactions on Geoscience and Remote Sensing Year: 2020 Vol: 58 (10)Pages: 7049-7061
JOURNAL ARTICLE

Remote Sensing Cross-Modal Retrieval by Deep Image-Voice Hashing

Yichao ZhangXiangtao ZhengXiaoqiang Lu

Journal:   IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Year: 2022 Vol: 15 Pages: 9327-9338
© 2026 ScienceGate Book Chapters — All rights reserved.