JOURNAL ARTICLE

Two-Stage Supervised Discrete Hashing for Cross-Modal Retrieval

Donglin ZhangXiao‐Jun WuTianyang XuJosef Kittler

Year: 2022 Journal:   IEEE Transactions on Systems Man and Cybernetics Systems Vol: 52 (11)Pages: 7014-7026   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Recently, hashing-based multimodal learning systems have received increasing attention due to their query efficiency and parsimonious storage costs. However, impeded by the quantization loss caused by numerical optimization, the existing cross-media hashing approaches are unable to capture all the discriminative information present in the original multimodal data. Besides, most cross-modal methods belong to the one-step paradigm, which learn the binary codes and hash function simultaneously, increasing the complexity of optimization. To address these issues, we propose a novel two-stage approach, named the two-stage supervised discrete hashing (TSDH) method. In particular, in the first phase, TSDH generates a latent representation for each modality. These representations are then mapped to a common Hamming space to generate the binary codes. In addition, TSDH directly endows the hash codes with the semantic labels, enhancing the discriminatory power of the learned binary codes. A discrete hash optimization approach is developed to learn the binary codes without relaxation, avoiding the large quantization loss. The proposed hash function learning scheme reuses the semantic information contained by the embeddings, endowing the hash functions with enhanced discriminability. Extensive experiments on several databases demonstrate the effectiveness of the developed TSDH, outperforming several recent competitive cross-media algorithms.

Keywords:
Dynamic perfect hashing Hash function Double hashing Feature hashing Universal hashing Computer science Locality-sensitive hashing Binary code Hash table Discriminative model Quantization (signal processing) Theoretical computer science Hamming space K-independent hashing Discrete optimization Binary number Artificial intelligence Pattern recognition (psychology) Algorithm Hamming code Optimization problem Mathematics Block code Decoding methods

Metrics

26
Cited By
3.22
FWCI (Field Weighted Citation Impact)
62
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.