JOURNAL ARTICLE

Multi-Modal Graph Aggregation Transformer for image captioning

Lizhi ChenKesen Li

Year: 2024 Journal:   Neural Networks Vol: 181 Pages: 106813-106813   Publisher: Elsevier BV
Keywords:
Closed captioning Transformer Computer science Modal Graph Artificial intelligence Image (mathematics) Voltage Theoretical computer science Electrical engineering Engineering

Metrics

11
Cited By
5.83
FWCI (Field Weighted Citation Impact)
72
Refs
0.94
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Human Pose and Action Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

BOOK-CHAPTER

Adaptive Multi-granularity Aggregation Transformer for Image Captioning

D. LiYe WangQun Liu

Lecture notes in computer science Year: 2023 Pages: 339-353
JOURNAL ARTICLE

Boosting Entity-Aware Image Captioning With Multi-Modal Knowledge Graph

Wentian ZhaoXinxiao Wu

Journal:   IEEE Transactions on Multimedia Year: 2023 Vol: 26 Pages: 2659-2670
JOURNAL ARTICLE

Relational Graph Reasoning Transformer for Image Captioning

Xinyu XiaoZixun SunTingtian LiYipeng Yu

Journal:   2022 IEEE International Conference on Multimedia and Expo (ICME) Year: 2022
JOURNAL ARTICLE

Image captioning with transformer and knowledge graph

Yu ZhangXinyu ShiSiya MiXu Yang

Journal:   Pattern Recognition Letters Year: 2021 Vol: 143 Pages: 43-49
JOURNAL ARTICLE

Self-supervised modal optimization transformer for image captioning

Ye WangD. LiQun LiuLi LiuGuoyin Wang

Journal:   Neural Computing and Applications Year: 2024 Vol: 36 (31)Pages: 19863-19878
© 2026 ScienceGate Book Chapters — All rights reserved.