JOURNAL ARTICLE

Multi-modal Graph Attention Network for Video Recommendation

Abstract

In view of the problems of cold start and data interaction in recommendation systems, and most current recommendation algorithms ignore the diversity of data types, the combination of multimodal data and knowledge graph is bound to improve the pertinence of video recommendation. In this paper, we propose Multi-modal Knowledge Graph Attention Network (MMKGV) model, and all the entity nodes of the knowledge graph are innovatively introduced into multimodal information. The high-order recursive node information dissemination and information aggregation are carried out on the multimodal knowledge graph through the graph attention network. In the model, the triplet function of the knowledge graph is used to construct the triplet inference relationship, and the vector representation generated by the final aggregation is used for recommendation. Through extensive experiments on two public datasets TikTok and Kwai, the results show that the MMKGV can effectively improve the effect of video recommendation compared with other comparison algorithms.

Keywords:
Computer science Graph Modal Inference Theoretical computer science Graph database Recommender system Construct (python library) Information retrieval Artificial intelligence

Metrics

2
Cited By
0.76
FWCI (Field Weighted Citation Impact)
17
Refs
0.74
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Recommender Systems and Techniques
Physical Sciences →  Computer Science →  Information Systems
Advanced Graph Neural Networks
Physical Sciences →  Computer Science →  Artificial Intelligence
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.