JOURNAL ARTICLE

Reasoning in Different Directions: Triplet Learning for Scene Graph Generation

Xuecheng SunZhe‐Ming LuZewei HeZiqian LuHao Luo

Year: 2023 Journal:   IEEE Access Vol: 11 Pages: 103069-103078   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Scene graph generation aims to detect objects and their relations in images, providing structured representations for scene understanding. Currently, mainstream approaches first detect the objects and then solve a classification task to determine the relation between each object pair, ignoring the other combinations of the subject-predicate-object triplet. In this work we propose a triplet learning paradigm for scene graph generation, where given any two entities of the triplet we learn to predict the third. The multi-task learning scheme is adopted to equip a scene graph generation model with the triplet learning task, in which the prediction heads for the subject, object and predicate share the same backbone and are jointly trained. The proposed method does not require any additional annotation and is easy to embed in existing networks. It benefits scene graph generation models in gaining more generalizability and thus can be applied to both biased and unbiased methods. Moreover, we introduce a new Graph Structure-Aware Transformer (GSAT) model that incorporates the structural information of the scene graph via a modified self-attention mechanism. Extensive experiments show that the proposed triplet learning consistently improves the performance of several state-of-the-art models on the Visual Genome dataset.

Keywords:
Computer science Scene graph Artificial intelligence Generalizability theory Graph Theoretical computer science Predicate (mathematical logic) Annotation Machine learning

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
52
Refs
0.10
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Heterogeneous Learning for Scene Graph Generation

Yunqing HeTongwei RenJinhui TangGangshan Wu

Journal:   Proceedings of the 30th ACM International Conference on Multimedia Year: 2022 Pages: 4704-4713
JOURNAL ARTICLE

Exploring correlation of relationship reasoning for scene graph generation

Peng TianHongwei MoLaihao Jiang

Journal:   International Journal of Machine Learning and Cybernetics Year: 2022 Vol: 13 (9)Pages: 2479-2493
JOURNAL ARTICLE

Adaptive Image-to-Video Scene Graph Generation via Knowledge Reasoning and Adversarial Learning

Jin ChenXiaofeng JiXinxiao Wu

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2022 Vol: 36 (1)Pages: 276-284
© 2026 ScienceGate Book Chapters — All rights reserved.