Knowledge-Enhanced Scene Graph Generation with Multimodal Relation Alignment (Student Abstract)

Ze Fu; Junhao Feng; Changmeng Zheng; Yi Cai

doi:10.1609/aaai.v36i11.21610

ScienceGate Book Chapters

JOURNAL ARTICLE

Knowledge-Enhanced Scene Graph Generation with Multimodal Relation Alignment (Student Abstract)

Ze Fu Junhao Feng Changmeng Zheng Yi Cai

Year: 2022 Journal: Proceedings of the AAAI Conference on Artificial Intelligence Vol: 36 (11)Pages: 12947-12948 Publisher: Association for the Advancement of Artificial Intelligence

DOI: 10.1609/aaai.v36i11.21610

Get Full-Text PDF Get Analytical Report

Abstract

Existing scene graph generation methods suffer the limitations when the image lacks of sufficient visual contexts. To address this limitation, we propose a knowledge-enhanced scene graph generation model with multimodal relation alignment, which supplements the missing visual contexts by well-aligned textual knowledge. First, we represent the textual information into contextualized knowledge which is guided by the visual objects to enhance the contexts. Furthermore, we align the multimodal relation triplets by co-attention module for better semantics fusion. The experimental results show the effectiveness of our method.

Keywords:

Computer science Relation (database) Knowledge graph Scene graph Graph Semantics (computer science) Artificial intelligence Information retrieval Natural language processing Theoretical computer science Data mining

Metrics

Cited By

0.14

FWCI (Field Weighted Citation Impact)

Refs

0.39

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Knowledge-Enhanced Scene Graph Generation with Multimodal Relation Alignment (Student Abstract)

Abstract

Metrics

Citation History

Topics

Related Documents

DRAKE: Deep Pair-Wise Relation Alignment for Knowledge-Enhanced Multimodal Scene Graph Generation in Social Media Posts

Self-Supervised Relation Alignment for Scene Graph Generation

Relation Extraction with Knowledge-Enhanced Prompt-Tuning on Multimodal Knowledge Graph

Multimodal Relation Extraction with Efficient Graph Alignment

A relation enhanced model for temporal knowledge graph alignment