JOURNAL ARTICLE

Enhancing Semantic Clarity: Discriminative and Fine-grained Information Mining for Remote Sensing Image-Text Retrieval

Abstract

Remote sensing image-text retrieval is a fundamental task in remote sensing multimodal analysis, promoting the alignment of visual and language representations. The mainstream approaches commonly focus on capturing shared semantic representations between visual and textual modalities. However, the inherent characteristics of remote sensing image-text pairs lead to a semantic confusion problem, stemming from redundant visual representations and high inter-class similarity. To tackle this problem, we propose a novel Discriminative and Fine-grained Information Mining (DFIM) model, which aims to enhance semantic clarity by reducing visual redundancy and increasing the semantic gap between different classes. Specifically, the Dynamic Visual Enhancement (DVE) module adaptively enhances the visual discriminative features under the guidance of multimodal fusion information. Meanwhile, the Fine-grained Semantic Matching (FSM) module cleverly models the matching relationship between image regions and text words as an optimal transport problem, thereby refining intra-instance matching. Extensive experiments on two benchmark datasets justify the superiority of DFIM in terms of retrieval accuracy and visual interpretability over the leading methods.

Keywords:

Metrics

1
Cited By
4.77
FWCI (Field Weighted Citation Impact)
0
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Discriminative feature mining hashing for fine-grained image retrieval

Wenxi LangHan SunCan XuNingzhong LiuHuiyu Zhou

Journal:   Journal of Visual Communication and Image Representation Year: 2022 Vol: 87 Pages: 103592-103592
JOURNAL ARTICLE

Fine-Grained Information Supplementation and Value-Guided Learning for Remote Sensing Image-Text Retrieval

Zihui ZhouYong FengAgen QiuGuangyao DuanMingliang Zhou

Journal:   IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Year: 2024 Vol: 17 Pages: 19194-19210
JOURNAL ARTICLE

Fine-Grained Image-Text Retrieval via Discriminative Latent Space Learning

Min ZhengWen WangQingyong Li

Journal:   IEEE Signal Processing Letters Year: 2021 Vol: 28 Pages: 643-647
© 2026 ScienceGate Book Chapters — All rights reserved.