Interaction-Assisted Multi-Modal Representation Learning for Recommendation

Hao Wu; Jiajie Wang; Zhonglin Zu

doi:10.1109/icassp49357.2023.10095080

ScienceGate Book Chapters

JOURNAL ARTICLE

Interaction-Assisted Multi-Modal Representation Learning for Recommendation

Hao Wu Jiajie Wang Zhonglin Zu

Year: 2023 Vol: 15 Pages: 1-5

DOI: 10.1109/icassp49357.2023.10095080

Get Full-Text PDF Get Analytical Report

Abstract

Personalized recommender systems have attracted significant attentions from both industry and academic. Recent studies have shed light on incorporating multi-modal side information into the recommender systems to further boost the performance. Meanwhile, transformer-based multi-modal representation learning has shown great enhancement for downstream visual and textual tasks. However, these self-supervised pre-training methods are not tailored for recommendation and may lead to suboptimal representations. To this end, we propose Interaction-Assisted Multi-Modal Representation Learning for Recommendation (IRL) to inject the information of user interactions into item multi-modal representation learning. Specifically, we extract item graph embedding through user-item interactions and then utilize it to formulate a novel triplet IRL training objective which serves as a behavior-aware pre-training task for the representation learning model. A range of experiments have been conducted on several real-world datasets and extensive results indicate the effectiveness of IRL.

Keywords:

Computer science Modal Recommender system Feature learning Embedding Representation (politics) Artificial intelligence Machine learning Graph Transformer Graph embedding Information retrieval Theoretical computer science Engineering

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.07

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Recommender Systems and Techniques

Physical Sciences → Computer Science → Information Systems

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Interaction-Assisted Multi-Modal Representation Learning for Recommendation

Abstract

Metrics

Topics

Related Documents

Multi-modal multi-task representation learning model for recommendation

Joint Representation Learning for Multi-Modal Transportation Recommendation

Multi-modal transportation recommendation with unified route representation learning

Multi-modal transportation recommendation with unified route representation learning

Multi-modal Representation Learning for Short Video Understanding and Recommendation