JOURNAL ARTICLE

Self-Supervised Cluster-Contrast Distillation Hashing Network for Cross-Modal Retrieval

Haoxuan SunYudong CaoGuangyuan Liu

Year: 2023 Journal:   IEEE Access Vol: 11 Pages: 96584-96593   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Traditional cross-modal hash models enable efficient and fast retrieval between multimodal data by training high-quality hash representations. The key to the cross-modal hashing model is feature extraction. However, the quality of the features largely depends on the semantic similarity between the multi-modal data, and the existing methods do not effectively utilize the semantic information between the data. In this paper, we attempt to explore the semantic information inherent within the data using contrastive learning. Specifically, we propose a end-to-end cluster-level contrastive learning method (SCCDH) for cross-modal hashing. The method utilizes the clustering results to guide feature learning in an appropriately designed contrast framework. In SCCDH, feature-level and hash cluster-level contrastive learning are used to help the model learn discriminative features among multimodal data. In addition, we propose a distillation filtering method to filter out a large amount of noise in the data. Extensive experiments were conducted on the MIRFLICKR-25K, NUS-WIDE, and MS-COCO datasets, and the results demonstrate that the proposed method outperformed several state-of-the-art methods.

Keywords:
Computer science Hash function Artificial intelligence Cluster analysis Pattern recognition (psychology) Feature extraction Feature hashing Feature learning Discriminative model Feature (linguistics) Data mining Machine learning Hash table Double hashing

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
54
Refs
0.10
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Self-supervised incomplete cross-modal hashing retrieval

Shouyong PengTao YaoYing LiGang WangLili WangZhiming Yan

Journal:   Expert Systems with Applications Year: 2024 Vol: 262 Pages: 125592-125592
JOURNAL ARTICLE

Autoencoder-based self-supervised hashing for cross-modal retrieval

Yi-Fan LiXuan WangLei CuiJiajia ZhangCheng-Kai HuangXuan LuoShuhan Qi

Journal:   Multimedia Tools and Applications Year: 2020 Vol: 80 (11)Pages: 17257-17274
JOURNAL ARTICLE

Semi-Supervised Knowledge Distillation for Cross-Modal Hashing

Mingyue SuGuanghua GuXianlong RenHao FuYao Zhao

Journal:   IEEE Transactions on Multimedia Year: 2021 Vol: 25 Pages: 662-675
BOOK-CHAPTER

Supervised Discriminative Discrete Hashing for Cross-Modal Retrieval

Xingyu LuChi‐Man Pun

Lecture notes in computer science Year: 2023 Pages: 599-613
© 2026 ScienceGate Book Chapters — All rights reserved.