Attention-guided Multi-step Fusion: A Hierarchical Fusion Network for Multimodal Recommendation

Yan Zhou; Jie Guo; Hao Sun; Bin Song; F. Richard Yu

doi:10.1145/3539618.3591950

ScienceGate Book Chapters

JOURNAL ARTICLE

Attention-guided Multi-step Fusion: A Hierarchical Fusion Network for Multimodal Recommendation

Yan Zhou Jie Guo Hao Sun Bin Song F. Richard Yu

Year: 2023 Pages: 1816-1820

DOI: 10.1145/3539618.3591950

Get Full-Text PDF Get Analytical Report

Abstract

The main idea of multimodal recommendation is the rational utilization of the item's multimodal information to improve the recommendation performance. Previous works directly integrate item multimodal features with item ID embeddings, ignoring the inherent semantic relations contained in the multimodal features. In this paper, we propose a novel and effective aTtention-guided Multi-step FUsion Network for multimodal recommendation, named TMFUN. Specifically, our model first constructs modality feature graph and item feature graph to model the latent item-item semantic structures. Then, we use the attention module to identify inherent connections between user-item interaction data and multimodal data, evaluate the impact of multimodal data on different interactions, and achieve early-step fusion of item features. Furthermore, our model optimizes item representation through the attention-guided multi-step fusion strategy and contrastive learning to improve recommendation performance. The extensive experiments on three real-world datasets show that our model has superior performance compared to the state-of-the-art models.

Keywords:

Computer science Artificial intelligence Graph Feature (linguistics) Feature learning Machine learning Sensor fusion Semantic feature Modality (human–computer interaction) Theoretical computer science

Metrics

Cited By

8.66

FWCI (Field Weighted Citation Impact)

Refs

0.97

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Recommender Systems and Techniques

Physical Sciences → Computer Science → Information Systems

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Graph Neural Networks

Physical Sciences → Computer Science → Artificial Intelligence

Attention-guided Multi-step Fusion: A Hierarchical Fusion Network for Multimodal Recommendation

Abstract

Metrics

Citation History

Topics

Related Documents

Discrepancy Learning Guided Hierarchical Fusion Network for Multi-modal Recommendation

A hierarchical attention neural network with multi-view fusion for online course recommendation

Attention Guided Network for Multi Exposure Image Fusion

Hierarchical Attention‐Based Multimodal Fusion Network for Video Emotion Recognition

Hierarchical Multimodal Fusion Network with Dynamic Multi-task Learning