A Texture and Saliency Enhanced Image Learning Method For Cross-Modal Remote Sensing Image-Text Retrieval

Rui Yang; Di Zhang; Yanhe Guo; Shuang Wang

doi:10.1109/igarss52108.2023.10282896

ScienceGate Book Chapters

JOURNAL ARTICLE

A Texture and Saliency Enhanced Image Learning Method For Cross-Modal Remote Sensing Image-Text Retrieval

Rui Yang Di Zhang Yanhe Guo Shuang Wang

Year: 2023 Pages: 4895-4898

DOI: 10.1109/igarss52108.2023.10282896

Get Full-Text PDF Get Analytical Report

Abstract

Cross-modal remote sensing image-text retrieval (CMRSITR) can retrieve images of interest from a vast amount of remote sensing images and has received significant attention in recent years. However, existing methods do not consider saliency and texture information, which are essential for remote sensing images when extracting image features. Therefore, this paper proposes a novel texture and saliency enhanced image learning method for CMRSITR. We constructed a multi-task image feature extractor in this new method. A texture map and a saliency map are created by extracting texture and detecting the saliency of each RS image. Both maps are set as supervised information during training to make the extracted saliency and texture features gradually reconstructed to a saliency map and a texture map, respectively. At the same time, the retrieval features of each RS image are obtained from the retrieval feature branch of the image. Experiments conducted on two commonly used CMRSITR datasets, RSICD and UCM, showed that the proposed method is effective in improving retrieval performance and achieved state-of-the-art retrieval performance compared to existing methods.

Keywords:

Computer science Image retrieval Artificial intelligence Image texture Computer vision Texture (cosmology) Extractor Image (mathematics) Feature (linguistics) Pattern recognition (psychology) Visual Word Feature extraction Automatic image annotation Image processing

Metrics

Cited By

0.73

FWCI (Field Weighted Citation Impact)

Refs

0.67

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Retrieval and Classification Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

A Texture and Saliency Enhanced Image Learning Method For Cross-Modal Remote Sensing Image-Text Retrieval

Abstract

Metrics

Citation History

Topics

Related Documents

Local Saliency-Guided Dynamic Matching for Cross-Modal Remote Sensing Image–Text Retrieval

CECMR: Calibrated Evidential Learning For Cross Modal Remote Sensing Image-Text Retrieval

Hypersphere-Based Remote Sensing Cross-Modal Text–Image Retrieval via Curriculum Learning

Masking-Based Cross-Modal Remote Sensing Image–Text Retrieval via Dynamic Contrastive Learning

A Review of Cross-Modal Image–Text Retrieval in Remote Sensing