JOURNAL ARTICLE

Unsupervised Generative Adversarial Cross-Modal Hashing

Jian ZhangYuxin PengMingkuan Yuan

Year: 2018 Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Vol: 32 (1)   Publisher: Association for the Advancement of Artificial Intelligence

Abstract

Cross-modal hashing aims to map heterogeneous multimedia data into a common Hamming space, which can realize fast and flexible retrieval across different modalities. Unsupervised cross-modal hashing is more flexible and applicable than supervised methods, since no intensive labeling work is involved. However, existing unsupervised methods learn hashing functions by preserving inter and intra correlations, while ignoring the underlying manifold structure across different modalities, which is extremely helpful to capture meaningful nearest neighbors of different modalities for cross-modal retrieval. To address the above problem, in this paper we propose an Unsupervised Generative Adversarial Cross-modal Hashing approach (UGACH), which makes full use of GAN's ability for unsupervised representation learning to exploit the underlying manifold structure of cross-modal data. The main contributions can be summarized as follows: (1) We propose a generative adversarial network to model cross-modal hashing in an unsupervised fashion. In the proposed UGACH, given a data of one modality, the generative model tries to fit the distribution over the manifold structure, and select informative data of another modality to challenge the discriminative model. The discriminative model learns to distinguish the generated data and the true positive data sampled from correlation graph to achieve better retrieval accuracy. These two models are trained in an adversarial way to improve each other and promote hashing function learning. (2) We propose a correlation graph based approach to capture the underlying manifold structure across different modalities, so that data of different modalities but within the same manifold can have smaller Hamming distance and promote retrieval accuracy. Extensive experiments compared with 6 state-of-the-art methods on 2 widely-used datasets verify the effectiveness of our proposed approach.

Keywords:
Computer science Hash function Discriminative model Hamming space Artificial intelligence Generative model Pattern recognition (psychology) Unsupervised learning Machine learning Generative grammar Hamming code Algorithm

Metrics

218
Cited By
13.98
FWCI (Field Weighted Citation Impact)
38
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Surveillance and Tracking Methods
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Multi-Pathway Generative Adversarial Hashing for Unsupervised Cross-Modal Retrieval

Jian ZhangYuxin Peng

Journal:   IEEE Transactions on Multimedia Year: 2019 Vol: 22 (1)Pages: 174-187
JOURNAL ARTICLE

Attention-based Generative Adversarial Hashing for Cross-modal Retrieval

Jianqiong XiaoXiaoqing Zhou

Journal:   2022 IEEE 10th Joint International Information Technology and Artificial Intelligence Conference (ITAIC) Year: 2022
© 2026 ScienceGate Book Chapters — All rights reserved.