A Novel Framework for Scene Graph Generation via Prior Knowledge

Zhenghao Wang; Jing Lian; Linhui Li; Jian Zhao

doi:10.1109/tcsvt.2023.3319633

ScienceGate Book Chapters

JOURNAL ARTICLE

A Novel Framework for Scene Graph Generation via Prior Knowledge

Zhenghao Wang Jing Lian Linhui Li Jian Zhao

Year: 2023 Journal: IEEE Transactions on Circuits and Systems for Video Technology Vol: 34 (5)Pages: 3768-3781 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/tcsvt.2023.3319633

Get Full-Text PDF Get Analytical Report

Abstract

The scene graph generation aims to recognize objects and infer the relationships between them, which can provide a comprehensive understanding of image visual perception. However, the long-tailed issue of relations remains challenging for scene graph generation. This paper proposes a novel framework based on knowledge-driven data-driven joining to address the long-tail issues in scene graph generation. The proposed framework consists of two modules: the relation inference module and the prior knowledge learning module. The relation inference module aims to learn the relational features of entity pairs in images and the structural features of scene graphs. The prior knowledge learning module aims to learn the triplet representation from the knowledge graph and use it as prior knowledge to provide logical guidance and constraints for relation inference. This provides prior bias for relation inference to transfer the bias towards head categories to reasonable categories, thereby mitigating the long-tail problem. Experiment results indicate that the proposed framework outperforms on Visual Genome datasets and that the generated scene graph relation is logically reasonable.

Keywords:

Inference Computer science Scene graph Relation (database) Artificial intelligence Graph Knowledge graph Machine learning Statistical relational learning Knowledge representation and reasoning Theoretical computer science Relational database Data mining Rendering (computer graphics)

Metrics

Cited By

0.55

FWCI (Field Weighted Citation Impact)

Refs

0.62

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Advanced Graph Neural Networks

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Image and Video Retrieval Techniques

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

A Novel Framework for Scene Graph Generation via Prior Knowledge

Abstract

Metrics

Citation History

Topics

Related Documents

Enriching Scene-Graph Generation with Prior Knowledge from Work Instruction

Prior Knowledge-driven Dynamic Scene Graph Generation with Causal Inference

3D Scene Graph Generation Using Prior Knowledge from Large Language Model (LLM)

Dynamic Scene Graph Generation via Temporal Prior Inference

Zero-Shot Scene Graph Generation with Knowledge Graph Completion