JOURNAL ARTICLE

Topic scene graphs for image captioning

Min ZhangJingxiang ChenPengfei LiMing JiangZhe Zhou

Year: 2022 Journal:   IET Computer Vision Vol: 16 (4)Pages: 364-375   Publisher: Institution of Engineering and Technology

Abstract

Abstract When describing an image, people can rapidly extract the topic from the image and find the main object, generating sentences that match the main idea of the image. However, most of the scene graph generation methods do not emphasise the importance of the topic of the image. Consequently, the captions generated by the scene graph‐based image captioning models cannot reflect the topic in the image then expressing the central idea of the image. In this paper, we propose a method for image captioning based on topic scene graphs (TSG). Firstly, we propose the structure of topic scene graphs that express images' topics and the relationships between objects. Then, combined with the topic scene graph, we utilise the salient object detection to generate the topic scene graph highlighting the salient objects of the image. Note that our framework is agnostic to any scene graph‐based image captioning model and thus can be widely applied in the community which seeks salient object predictions. We compare the performance of our topic scene graph with the state‐of‐the‐art scene graph generation models and mainstream image captioning models on MSCOCO and Visual Genome datasets, both achieving better performance.

Keywords:
Closed captioning Computer science Image (mathematics) Artificial intelligence Computer vision Information retrieval

Metrics

7
Cited By
0.87
FWCI (Field Weighted Citation Impact)
43
Refs
0.69
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Video Analysis and Summarization
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Image captioning based on scene graphs: A survey

Junhua JiaXiangqian DingShunpeng PangXiaoyan GaoXiaowei XinRuotong HuJie Nie

Journal:   Expert Systems with Applications Year: 2023 Vol: 231 Pages: 120698-120698
BOOK-CHAPTER

Topic Guided Image Captioning with Scene and Spatial Features

Usman ZiaMuhammad Mohsin RiazAbdul Ghafoor

Lecture notes in networks and systems Year: 2022 Pages: 180-191
JOURNAL ARTICLE

Auto-encoding and Distilling Scene Graphs for Image Captioning

Xu YangHanwang ZhangJianfei Cai

Journal:   IEEE Transactions on Pattern Analysis and Machine Intelligence Year: 2020 Vol: 44 (5)Pages: 1-1
© 2026 ScienceGate Book Chapters — All rights reserved.