JOURNAL ARTICLE

Contrastive hashing with vision transformer for image retrieval

Xiuxiu RenXiangwei ZhengHuiyu ZhouWeilong LiuXiao Dong

Year: 2022 Journal:   International Journal of Intelligent Systems Vol: 37 (12)Pages: 12192-12211   Publisher: Wiley

Abstract

Hashing techniques have attracted considerable attention owing to their advantages of efficient computation and economical storage. However, it is still a challenging problem to generate more compact binary codes for promising performance. In this paper, we propose a novel contrastive vision transformer hashing method, which seamlessly integrates contrastive learning and vision transformers (ViTs) with hash technology into a well-designed model to learn informative features and compact binary codes simultaneously. First, we modify the basic contrastive learning framework by designing several hash layers to meet the specific requirement of hash learning. In our hash network, ViTs are applied as backbones for feature learning, which is rarely performed in existing hash learning methods. Then, we design a multiobjective loss function, in which contrastive loss explores discriminative features by maximizing agreement between different augmented views from the same image, similarity preservation loss performs pairwise semantic preservation to enhance the representative capabilities of hash codes, and quantization loss controls the quantitative error. Hence, we can facilitate end-to-end joint training to improve the retrieval performance. The encouraging experimental results on three widely used benchmark databases demonstrate the superiority of our algorithm compared with several state-of-the-art hashing algorithms.

Keywords:
Hash function Feature hashing Computer science Universal hashing Artificial intelligence Discriminative model Image retrieval Hash table Double hashing Binary code Theoretical computer science Machine learning Dynamic perfect hashing Pattern recognition (psychology) Computer engineering Binary number Image (mathematics) Mathematics Arithmetic Programming language

Metrics

8
Cited By
0.99
FWCI (Field Weighted Citation Impact)
46
Refs
0.73
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Vision Transformer Hashing for Image Retrieval

Shiv Ram DubeySatish Kumar SinghWei-Ta Chu

Journal:   2022 IEEE International Conference on Multimedia and Expo (ICME) Year: 2022 Pages: 1-6
JOURNAL ARTICLE

HashFormer: Vision Transformer Based Deep Hashing for Image Retrieval

Tao LiZheng ZhangLishen PeiYan Gan

Journal:   IEEE Signal Processing Letters Year: 2022 Vol: 29 Pages: 827-831
JOURNAL ARTICLE

Attention-guided Contrastive Hashing for Long-tailed Image Retrieval

Xuan KouChenghao XuXu YangCheng Deng

Journal:   Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence Year: 2022 Pages: 1017-1023
© 2026 ScienceGate Book Chapters — All rights reserved.