JOURNAL ARTICLE

Quaternion Relation Embedding for Scene Graph Generation

Zheng WangXing XuGuoqing WangYang YangHeng Tao Shen

Year: 2023 Journal:   IEEE Transactions on Multimedia Vol: 25 Pages: 8646-8656   Publisher: Institute of Electrical and Electronics Engineers

Abstract

As an important visual understanding task, scene graph generation has been drawing widespread attention and could boost a broad range of downstream vision applications. Traditional scene graph generation methods based on different context refinements are trained with probabilistic chain rule, which treats objects and relationships as independent entities. Despite their surprisingly great progress, such a plain formulation unconsciously ignores the latent geometric structure of entities and relationships. To address this issue, we move beyond the traditional real-valued representations and use Quat ernion R elation E mbedding (QuatRE) to generate scene graphs with more expressive hypercomplex representations. More specifically, we introduce the concept of quaternion representations, hyper-complex valued with three imaginary components for objects entities, then formulate the relation triplets with Hamilton product. Benefiting from explicitly modeling the latent inter-dependencies among all imaginary components and strong expressive capacity, our proposed QuatRE method could better capture the interactions between entities. More importantly, our novel QuatRE method can be treated as a plug-in and well generalized into other methods for performance improvement as it involves no additional layers. Finally, extensive comparisons of our proposed method against the state-of-the-art methods on two large-scale and widely-used datasets, i.e. Visual Genome and Open Images, demonstrated our superiority and generalization capability on various metrics for biased or unbiased inference.

Keywords:
Hypercomplex number Quaternion Computer science Scene graph Embedding Artificial intelligence Graph Relation (database) Theoretical computer science Probabilistic logic Natural language processing Algorithm Mathematics Data mining

Metrics

24
Cited By
4.37
FWCI (Field Weighted Citation Impact)
61
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Complex Relation Embedding for Scene Graph Generation

Zheng WangXing XuYin Zhang⋆Yang YangHeng Tao Shen

Journal:   IEEE Transactions on Neural Networks and Learning Systems Year: 2022 Vol: 35 (6)Pages: 8321-8335
JOURNAL ARTICLE

Relation Regularized Scene Graph Generation

Yuyu GuoLianli GaoJingkuan SongPeng WangNicu SebeHeng Tao ShenXuelong Li

Journal:   IEEE Transactions on Cybernetics Year: 2021 Vol: 52 (7)Pages: 5961-5972
JOURNAL ARTICLE

Multimodal Context Embedding for Scene Graph Generation

Gayoung JungIncheol Kim

Journal:   Journal of Information Processing Systems Year: 2020 Vol: 16 (6)Pages: 1250-1260
JOURNAL ARTICLE

RelTR: Relation Transformer for Scene Graph Generation

Yuren CongMichael Ying YangBodo Rosenhahn

Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Year: 2023 Vol: 45 (9)Pages: 11169-11183
© 2026 ScienceGate Book Chapters — All rights reserved.