JOURNAL ARTICLE

MULTI-LEVEL CONTRASTIVE LEARNING FOR HYBRID CROSS-MODAL RETRIEVAL

Abstract

Hybrid image retrieval is a significant task for a wide range of applications. In this scenario, the hybrid query for searching images consists of a reference image and a text modifier. The reference image provides a vital visual context and displays some semantic details, while the text modifier specifies the modifications to the reference image. To address such hybrid cross-modal retrieval, we propose a multi-level contrastive learning (MLCL) method for combining the hybrid query features into a fused feature by cross-modal contrastive learning with multi-level semantic alignment. Meanwhile, we additionally consider self-supervised contrastive learning to enhance the semantic correlation of the features at different levels of the combiner network. Extensive results on three public datasets (i.e., FashionIQ, Shoes, and CIRR) demonstrate that our proposed MLCL significantly outperforms the state-of-the-art methods under the hybrid cross-modal retrieval setting.

Keywords:
Feature (linguistics) Semantic feature Context (archaeology) Pattern recognition (psychology) Image retrieval Task (project management) Range (aeronautics) Semantics (computer science)

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.43
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Contrastive Learning for Cross-Modal Artist Retrieval

Andres FerraroJaehun KimSergio OramasAndreas EhmannFabien Gouyon

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2023
JOURNAL ARTICLE

Contrastive Learning for Cross-Modal Artist Retrieval

Andres FerraroJaehun KimSergio OramasAndreas EhmannFabien Gouyon

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2023
JOURNAL ARTICLE

Category-Level Contrastive Learning for Unsupervised Hashing in Cross-Modal Retrieval

Mengying XuLinyin LuoHanjiang LaiJian Yin

Journal:   Data Science and Engineering Year: 2024 Vol: 9 (3)Pages: 251-263
JOURNAL ARTICLE

Multi-level cross-modal contrastive learning for review-aware recommendation

Yibiao WeiYang XuLei ZhuJingwei MaChengmei Peng

Journal:   Expert Systems with Applications Year: 2024 Vol: 247 Pages: 123341-123341
© 2026 ScienceGate Book Chapters — All rights reserved.