JOURNAL ARTICLE

Decomposed Prototype Learning for Few-Shot Scene Graph Generation

X. Y. LiJun XiaoGuikun ChenYinfu FengYi YangAn-An LiuLong Chen

Year: 2024 Journal:   ACM Transactions on Multimedia Computing Communications and Applications Vol: 21 (1)Pages: 1-24   Publisher: Association for Computing Machinery

Abstract

Today's scene graph generation (SGG) models typically require abundant manual annotations to learn new predicate types. Therefore, it is difficult to apply them to real-world applications with massive uncommon predicate categories whose annotations are hard to collect. In this article, we focus on Few-Shot SGG (FSSGG) , which encourages SGG models to be able to quickly transfer previous knowledge and recognize unseen predicates well with only a few examples. However, current methods for FSSGG are hindered by the high intra-class variance of predicate categories in SGG: On one hand, each predicate category commonly has multiple semantic meanings under different contexts. On the other hand, the visual appearance of relation triplets with the same predicate differs greatly under different subject–object compositions. Such great variance of inputs makes it hard to learn generalizable representation for each predicate category with current few-shot learning (FSL) methods. However, we found that this intra-class variance of predicates is highly related to the composed subjects and objects. To model the intra-class variance of predicates with subject–object context, we propose a novel Decomposed Prototype Learning (DPL) model for FSSGG. Specifically, we first construct a decomposable prototype space to capture diverse semantics and visual patterns of subjects and objects for predicates by decomposing them into multiple prototypes. Afterwards, we integrate these prototypes with different weights to generate query-adaptive predicate representation with more reliable semantics for each query sample. We conduct extensive experiments and compare with various baseline methods to show the effectiveness of our method.

Keywords:
Computer science Shot (pellet) Graph Artificial intelligence Computer vision Computer graphics (images) Human–computer interaction Theoretical computer science

Metrics

2
Cited By
1.06
FWCI (Field Weighted Citation Impact)
54
Refs
0.69
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Subspace prototype learning for few-Shot remote sensing scene classification

Wuli WangLei XingPeng RenYumeng JiangGe WangBaodi Liu

Journal:   Signal Processing Year: 2023 Vol: 208 Pages: 108976-108976
JOURNAL ARTICLE

Panoptic Scene Graph Generation with Semantics-Prototype Learning

Li LiWei JiYiming WuMengze LiYou QinLina WeiRoger Zimmermann

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2024 Vol: 38 (4)Pages: 3145-3153
JOURNAL ARTICLE

Weighted Prototype Network for Few-Shot Learning Via Graph Structure

Yifan ZhouLei Yu

Journal:   SSRN Electronic Journal Year: 2023
JOURNAL ARTICLE

Geometer: Graph Few-Shot Class-Incremental Learning via Prototype Representation

Bin LüXiaoying GanLina YangWeinan ZhangLuoyi FuXinbing Wang

Journal:   Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining Year: 2022 Pages: 1152-1161
© 2026 ScienceGate Book Chapters — All rights reserved.