JOURNAL ARTICLE

Self-Supervised Relation Alignment for Scene Graph Generation

Abstract

The goal of scene graph generation is to predict a graph from an input image, where nodes correspond to identified and localized objects and edges to their corresponding interaction predicates. Existing methods are trained in a fully supervised manner and focus on message passing mechanisms, loss functions, and/or bias mitigation. In this work we introduce a simple-yet-effective self-supervised relational alignment regularization designed to improve the scene graph generation performance. The proposed alignment is general and can be combined with any existing scene graph generation framework, where it is trained alongside the original model's objective. The alignment is achieved through distillation, where an auxiliary relation prediction branch, that mirrors and shares parameters with the supervised counterpart, is designed. In the auxiliary branch, relational input features are partially masked prior to message passing and predicate prediction. The predictions for masked relations are then aligned with the supervised counterparts after the message passing. We illustrate the effectiveness of this self-supervised relational alignment in conjunction with two scene graph generation architectures, SGTR [25] and Neural Motifs [53], and show that in both cases we achieve significantly improved performance.

Keywords:
Computer science Relation (database) Artificial intelligence Graph Computer vision Theoretical computer science Data mining

Metrics

1
Cited By
0.64
FWCI (Field Weighted Citation Impact)
68
Refs
0.64
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Knowledge-Enhanced Scene Graph Generation with Multimodal Relation Alignment (Student Abstract)

Ze FuJunhao FengChangmeng ZhengYi Cai

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2022 Vol: 36 (11)Pages: 12947-12948
JOURNAL ARTICLE

Relation Regularized Scene Graph Generation

Yuyu GuoLianli GaoJingkuan SongPeng WangNicu SebeHeng Tao ShenXuelong Li

Journal:   IEEE Transactions on Cybernetics Year: 2021 Vol: 52 (7)Pages: 5961-5972
JOURNAL ARTICLE

Improving Predicate Representation in Scene Graph Generation by Self-Supervised Learning

So HasegawaMasayuki HiromotoAkira NakagawaYuhei Umeda

Journal:   2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Year: 2023 Vol: 33 Pages: 2739-2748
JOURNAL ARTICLE

Quaternion Relation Embedding for Scene Graph Generation

Zheng WangXing XuGuoqing WangYang YangHeng Tao Shen

Journal:   IEEE Transactions on Multimedia Year: 2023 Vol: 25 Pages: 8646-8656
© 2026 ScienceGate Book Chapters — All rights reserved.