MULTI-LEVEL CONTRASTIVE LEARNING FOR HYBRID CROSS-MODAL RETRIEVAL

doi:10.60864/3j14-6y84

ScienceGate Book Chapters

JOURNAL ARTICLE

MULTI-LEVEL CONTRASTIVE LEARNING FOR HYBRID CROSS-MODAL RETRIEVAL

Year: 2024 Journal: IEEE SIGPORT

DOI: 10.60864/3j14-6y84

Get Full-Text PDF Get Analytical Report

Abstract

Hybrid image retrieval is a significant task for a wide range of applications. In this scenario, the hybrid query for searching images consists of a reference image and a text modifier. The reference image provides a vital visual context and displays some semantic details, while the text modifier specifies the modifications to the reference image. To address such hybrid cross-modal retrieval, we propose a multi-level contrastive learning (MLCL) method for combining the hybrid query features into a fused feature by cross-modal contrastive learning with multi-level semantic alignment. Meanwhile, we additionally consider self-supervised contrastive learning to enhance the semantic correlation of the features at different levels of the combiner network. Extensive results on three public datasets (i.e., FashionIQ, Shoes, and CIRR) demonstrate that our proposed MLCL significantly outperforms the state-of-the-art methods under the hybrid cross-modal retrieval setting.

Keywords:

Feature (linguistics) Semantic feature Context (archaeology) Pattern recognition (psychology) Image retrieval Task (project management) Range (aeronautics) Semantics (computer science)

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.43

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Image Retrieval and Classification Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

MULTI-LEVEL CONTRASTIVE LEARNING FOR HYBRID CROSS-MODAL RETRIEVAL

Abstract

Metrics

Topics

Related Documents

Multi-Level Contrastive Learning For Hybrid Cross-Modal Retrieval

Contrastive Learning for Cross-Modal Artist Retrieval

Contrastive Learning for Cross-Modal Artist Retrieval

Category-Level Contrastive Learning for Unsupervised Hashing in Cross-Modal Retrieval

Multi-level cross-modal contrastive learning for review-aware recommendation