JOURNAL ARTICLE

Local context attention learning for fine-grained scene graph generation

Xuhan ZhuRuiping WangXiangyuan LanYaowei Wang

Year: 2024 Journal:   Pattern Recognition Vol: 156 Pages: 110708-110708   Publisher: Elsevier BV

Abstract

Fine-grained scene graph generation aims to parse the objects and their fine-grained relationships within scenes. Despite the significant progress in recent years, their performance is still limited by two major issues: (1) ambiguous perception under a global view; (2) the lack of reliable, fine-grained annotations. We argue that understanding the local context is important in addressing the two issues. However, previous works often overlook it, which limits their effectiveness in fine-grained scene graph generation. To tackle this challenge, we introduce a Local-context Attention Learning method that concentrates on local context and can generate high-reliability, fine-grained annotations. It comprises two components: (1) The Fine-grained Location Attention Network (FLAN), a multi-branch network that encompasses global and local branches, can attend to local informative context and perceive granularity levels in different regions, thereby adaptively enhancing the learning of fine-grained locations. (2) The Fine-grained Location Label Transfer (FLLT) method identifies coarse-grained labels inconsistent with the local context and determines which labels should be transferred through the global confidence thresholding strategy, finally transferring them to reliable local context-consistent fine-grained ones. Experiments conducted on the Visual Genome, OpenImage, and GQA-200 datasets show that the proposed methods achieve significant improvements on the fine-grained scene graph generation task. By addressing the challenge mentioned above, our method also achieves state-of-the-art performances on the three datasets.

Keywords:
Computer science Artificial intelligence Context (archaeology) Granularity Graph Scene graph Machine learning Rendering (computer graphics) Theoretical computer science

Metrics

6
Cited By
3.18
FWCI (Field Weighted Citation Impact)
57
Refs
0.86
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Neural Network Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Fine-Grained Predicates Learning for Scene Graph Generation

Xinyu LyuLianli GaoYuyu GuoZhou ZhaoHao HuangHeng Tao ShenJingkuan Song

Journal:   2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Year: 2022
JOURNAL ARTICLE

Adaptive Fine-Grained Predicates Learning for Scene Graph Generation

Xinyu LyuLianli GaoPengpeng ZengHeng Tao ShenJingkuan Song

Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Year: 2023 Vol: 45 (11)Pages: 13921-13940
© 2026 ScienceGate Book Chapters — All rights reserved.