JOURNAL ARTICLE

Deep Ranking Distribution Preserving Hashing for Robust Multi-Label Cross-Modal Retrieval

Ge SongKai HuangHanwen SuFengyi SongMing Yang

Year: 2024 Journal:   IEEE Transactions on Multimedia Vol: 26 Pages: 7027-7042   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Deep supervised hashing techniques have exhibited remarkable efficiency in cross-modal retrieval tasks, because they enable the transformation of data from different modalities into compact binary codes that preserve semantic similarity structures. Nonetheless, existing methods often rely on pairwise or triplet relationships within known (or in-distribution) semantics during training, failing to capture the comprehensive ranking information inherent in web data that encompasses diverse concepts. In addition, these methods are vulnerable to out-of-distribution (OOD) semantic data when applied in realistic scenarios, resulting in suboptimal performance. In this paper, we propose ranking distribution preserving hashing (RDPH) to address these problems. We present a novel ranking loss, a differentiable surrogate that maximizes the NDCG metric for cross-modal retrieval. This loss incorporates two target ranking distributions derived from the ideal NDCG scores of samples and the cosine similarity of features. These distributions encourage RDPH to generate hash codes that approximate the desired inter-modal and intra-modal ranking distributions. To enhance the robustness of the hash codes against OOD data, RDPH leverages the CLIP paradigm to acquire OOD-resilient intermediate representations. Besides, we utilize the outlier exposure strategy to enhance the discriminative ability of OOD for hash codes under supervision by constructing auxiliary pseudo-OOD data from known data in feature space. Experiments on three datasets demonstrate that the proposed method achieves state-ofthe-art performance on regular retrieval tasks and good results on simulated real-world retrieval tasks.

Keywords:
Computer science Hash function Data mining Ranking (information retrieval) Learning to rank Discriminative model Artificial intelligence Binary code Outlier Machine learning Pattern recognition (psychology) Information retrieval Binary number Mathematics

Metrics

24
Cited By
12.72
FWCI (Field Weighted Citation Impact)
60
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Deep self-enhancement hashing for robust multi-label cross-modal retrieval

Ge SongHanwen SuKai HuangFengyi SongMing Yang

Journal:   Pattern Recognition Year: 2023 Vol: 147 Pages: 110079-110079
JOURNAL ARTICLE

Deep robust multilevel semantic hashing for multi-label cross-modal retrieval

Ge SongXiaoyang TanJun ZhaoMing Yang

Journal:   Pattern Recognition Year: 2021 Vol: 120 Pages: 108084-108084
JOURNAL ARTICLE

Multi-label semantics preserving based deep cross-modal hashing

Xitao ZouXinzhi WangErwin M. BakkerSong Wu

Journal:   Signal Processing Image Communication Year: 2021 Vol: 93 Pages: 116131-116131
JOURNAL ARTICLE

Deep Class-Guided Hashing for Multi-Label Cross-Modal Retrieval

Hao ChenZhuoyang ZouYiqiang LiuXinghui Zhu

Journal:   Applied Sciences Year: 2025 Vol: 15 (6)Pages: 3068-3068
JOURNAL ARTICLE

Deep synthetic-proxy hashing for multi-label cross-modal retrieval

Kun LiQibing QinJinkui HouWenfeng ZhangChunlei ChenLei Huang

Journal:   Neurocomputing Year: 2026 Vol: 670 Pages: 132561-132561
© 2026 ScienceGate Book Chapters — All rights reserved.