JOURNAL ARTICLE

Divide-and-Conquer Predictor for Unbiased Scene Graph Generation

Xianjing HanXingning DongXuemeng SongTian GanYibing ZhanYan YanLiqiang Nie

Year: 2022 Journal:   IEEE Transactions on Circuits and Systems for Video Technology Vol: 32 (12)Pages: 8611-8622   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Scene Graph Generation (SGG) aims to detect the objects and their pairwise predicates in an image. Existing SGG methods mainly fulfil the challenging predicate prediction task that involves severe long-tailed data distribution with a single classifier. However, we argue that this may be enough to differentiate predicates that present obvious differences (e.g., $on$ and $near$ ), but not sufficient to distinguish similar predicates that only have subtle differences (e.g., $on$ and $standing~on$ ). Towards this end, we divide the predicate prediction into a few sub-tasks with a Divide-and-Conquer Predictor (DC-Predictor). Specifically, we first develop an offline pattern-predicate correlation mining algorithm to discover the similar predicates that share the same object interaction pattern. Based on that, we devise a general pattern classifier and a set of specific predicate classifiers for DC-Predictor. The former works on recognizing the pattern of a given object pair and routing it to the corresponding specific predicate classifier, while the latter aims to differentiate similar predicates in each specific pattern. In addition, we introduce the Bayesian Personalized Ranking loss in each specific predicate classifier to enhance the pairwise differentiation between head predicates and their similar ones. Experiments on VG150 and GQA datasets show the superiority of our model over state-of-the-art methods.

Keywords:
Notation Predicate (mathematical logic) Classifier (UML) Artificial intelligence Divide and conquer algorithms Pairwise comparison Computer science Graph Mathematics Algorithm Natural language processing Theoretical computer science Discrete mathematics Programming language Arithmetic

Metrics

28
Cited By
3.47
FWCI (Field Weighted Citation Impact)
63
Refs
0.92
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.