JOURNAL ARTICLE

Deep Adversarial Discrete Hashing for Cross-Modal Retrieval

Abstract

Cross-modal hashing has received widespread attentions on cross-modal retrieval task due to its superior retrieval efficiency and low storage cost. However, most existing cross-modal hashing methods learn binary codes directly from multimedia data, which cannot fully utilize the semantic knowledge of the data. Furthermore, they cannot learn the ranking based similarity relevance of data points with multi-label. And they usually use a relax constraint of hash code which causes non-negligible quantization loss in the optimization. In this paper, a hashing method called Deep Adversarial Discrete Hashing (DADH) is proposed to address these issues for cross-modal retrieval. The proposed method uses adversarial training to learn features across modalities and ensure the distribution consistency of feature representations across modalities. We also introduce a weighted cosine triplet constraint which can make full use of semantic knowledge from the multi-label to ensure the precise ranking relevance of item pairs. In addition, we use a discrete hashing strategy to learn the discrete binary codes without relaxation, by which the semantic knowledge from label in the hash codes can be preserved while the quantization loss can be minimized. Ablation experiments and comparison experiments on two cross-modal databases show that the proposed DADH improves the performance and outperforms several state-of-the-art hashing methods for cross-modal retrieval.

Keywords:
Computer science Hash function Binary code Quantization (signal processing) Theoretical computer science Artificial intelligence Data mining Binary number Information retrieval Machine learning Pattern recognition (psychology) Algorithm Mathematics

Metrics

108
Cited By
5.35
FWCI (Field Weighted Citation Impact)
20
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Discrete Fusion Adversarial Hashing for cross-modal retrieval

Jing LiEn YuJianhua MaXiaojun ChangHuaxiang ZhangJiande Sun

Journal:   Knowledge-Based Systems Year: 2022 Vol: 253 Pages: 109503-109503
JOURNAL ARTICLE

Deep Discrete Cross-Modal Hashing for Cross-Media Retrieval

Fangming ZhongZhikui ChenGeyong Min

Journal:   Pattern Recognition Year: 2018 Vol: 83 Pages: 64-77
JOURNAL ARTICLE

Targeted Adversarial Attack Against Deep Cross-Modal Hashing Retrieval

Tianshi WangLei ZhuZheng ZhangHuaxiang ZhangJunwei Han

Journal:   IEEE Transactions on Circuits and Systems for Video Technology Year: 2023 Vol: 33 (10)Pages: 6159-6172
JOURNAL ARTICLE

Deep semantic similarity adversarial hashing for cross-modal retrieval

Haopeng QiangYuan WanLun XiangXiaojing Meng

Journal:   Neurocomputing Year: 2020 Vol: 400 Pages: 24-33
BOOK-CHAPTER

Attention-Aware Deep Adversarial Hashing for Cross-Modal Retrieval

Xi ZhangHanjiang LaiJiashi Feng

Lecture notes in computer science Year: 2018 Pages: 614-629
© 2026 ScienceGate Book Chapters — All rights reserved.