Attention-Driven Cross-Modal Remote Sensing Image Retrieval

Ushasi Chaudhuri; Biplab Banerjee; Avik Bhattacharya; Mihai Datcu

doi:10.1109/igarss47720.2021.9554838

ScienceGate Book Chapters

JOURNAL ARTICLE

Attention-Driven Cross-Modal Remote Sensing Image Retrieval

Ushasi Chaudhuri Biplab Banerjee Avik Bhattacharya Mihai Datcu

Year: 2021 Pages: 4783-4786

DOI: 10.1109/igarss47720.2021.9554838

Get Full-Text PDF Get Analytical Report

Abstract

In this work, we address a cross-modal retrieval problem in remote sensing (RS) data. A cross-modal retrieval problem is more challenging than the conventional uni-modal data retrieval frameworks as it requires learning of two completely different data representations to map onto a shared feature space. For this purpose, we chose a photo-sketch RS database. We exploit the data modality comprising more spatial information (sketch) to extract the other modality features (photo) with cross-attention networks. This sketch-attended photo features are more robust and yield better retrieval results. We validate our proposal by performing experiments on the benchmarked Earth on Canvas dataset. We show a boost in the overall performance in comparison to the existing literature. Besides, we also display the Grad-CAM visualizations of the trained model's weights to highlight the framework's efficacy.

Keywords:

Sketch Computer science Modal Exploit Modality (human–computer interaction) Information retrieval Feature (linguistics) Image retrieval Data retrieval Artificial intelligence Data mining Pattern recognition (psychology) Image (mathematics) Algorithm

Metrics

Cited By

0.82

FWCI (Field Weighted Citation Impact)

Refs

0.74

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Remote-Sensing Image Classification

Physical Sciences → Engineering → Media Technology

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Attention-Driven Cross-Modal Remote Sensing Image Retrieval

Abstract

Metrics

Citation History

Topics

Related Documents

Remote Sensing Cross-Modal Text-Image Retrieval Based on Attention Correction and Filtering

Deep Cross-Modal Image–Voice Retrieval in Remote Sensing

Cross-Modal Feature Fusion Retrieval for Remote Sensing Image-Voice Retrieval

Remote Sensing Cross-Modal Retrieval by Deep Image-Voice Hashing

Deep Cross-Modal Retrieval for Remote Sensing Image and Audio