JOURNAL ARTICLE

Complex Relation Embedding for Scene Graph Generation

Zheng WangXing XuYin Zhang⋆Yang YangHeng Tao Shen

Year: 2022 Journal:   IEEE Transactions on Neural Networks and Learning Systems Vol: 35 (6)Pages: 8321-8335   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Given an input image, scene graph generation (SGG) aims to generate comprehensive visual relationships between objects in the form of graphs. Recently, more attention to the design of complex networks and complicated strategies has been paid to the long tail issue caused by the imbalanced class distribution. However, most existing methods adopt the concatenated features of two objects in real space as the final relation representation for a given triplet. We mainly argue that such a simple concatenation may neglect the importance of complex interactions between objects, which results in the diversity of visual relations. In addition, the representation learning in real space is also inadequate to express this property. To alleviate these issues, we seamlessly incorporate Hermitian inner product into existing models to facilitate the generation of scene graphs by learning Relation Embedding in Complex space (CoRE). More specifically, we first introduce the concept of complex-valued representations for entities and then formulate the relation triplets with Hermitian inner product in complex space. Finally, we investigate the effect of utilizing only real component or both of Hermitian inner product on inferring more reasonable interaction between objects for scene graphs. Comprehensive experiments on two widely used benchmark datasets, Visual Genome (VG) and Open Image, demonstrate our effectiveness, superiority, and generalization on various metrics for biased or unbiased inference.

Keywords:
Embedding Computer science Theoretical computer science Scene graph Inference Relation (database) Graph product Generalization Hermitian matrix Concatenation (mathematics) Graph Representation (politics) Artificial intelligence Mathematics Data mining Pure mathematics Line graph Combinatorics Pathwidth

Metrics

10
Cited By
1.24
FWCI (Field Weighted Citation Impact)
69
Refs
0.78
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Quaternion Relation Embedding for Scene Graph Generation

Zheng WangXing XuGuoqing WangYang YangHeng Tao Shen

Journal:   IEEE Transactions on Multimedia Year: 2023 Vol: 25 Pages: 8646-8656
JOURNAL ARTICLE

Relation Regularized Scene Graph Generation

Yuyu GuoLianli GaoJingkuan SongPeng WangNicu SebeHeng Tao ShenXuelong Li

Journal:   IEEE Transactions on Cybernetics Year: 2021 Vol: 52 (7)Pages: 5961-5972
JOURNAL ARTICLE

Multimodal Context Embedding for Scene Graph Generation

Gayoung JungIncheol Kim

Journal:   Journal of Information Processing Systems Year: 2020 Vol: 16 (6)Pages: 1250-1260
JOURNAL ARTICLE

RelTR: Relation Transformer for Scene Graph Generation

Yuren CongMichael Ying YangBodo Rosenhahn

Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Year: 2023 Vol: 45 (9)Pages: 11169-11183
© 2026 ScienceGate Book Chapters — All rights reserved.