JOURNAL ARTICLE

Knowledge-Enhanced Scene Graph Generation with Multimodal Relation Alignment (Student Abstract)

Ze FuJunhao FengChangmeng ZhengYi Cai

Year: 2022 Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Vol: 36 (11)Pages: 12947-12948   Publisher: Association for the Advancement of Artificial Intelligence

Abstract

Existing scene graph generation methods suffer the limitations when the image lacks of sufficient visual contexts. To address this limitation, we propose a knowledge-enhanced scene graph generation model with multimodal relation alignment, which supplements the missing visual contexts by well-aligned textual knowledge. First, we represent the textual information into contextualized knowledge which is guided by the visual objects to enhance the contexts. Furthermore, we align the multimodal relation triplets by co-attention module for better semantics fusion. The experimental results show the effectiveness of our method.

Keywords:
Computer science Relation (database) Knowledge graph Scene graph Graph Semantics (computer science) Artificial intelligence Information retrieval Natural language processing Theoretical computer science Data mining

Metrics

2
Cited By
0.14
FWCI (Field Weighted Citation Impact)
9
Refs
0.39
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.