JOURNAL ARTICLE

Task-Adaptive Attention for Image Captioning

Chenggang YanYiming HaoLiang LiJian YinAn-An LiuZhendong MaoZhenyu ChenXingyu Gao

Year: 2021 Journal:   IEEE Transactions on Circuits and Systems for Video Technology Vol: 32 (1)Pages: 43-51   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Attention mechanisms are now widely used in image captioning models. However, most attention models only focus on visual features. When generating syntax related words, little visual information is needed. In this case, these attention models could mislead the word generation. In this paper, we propose Task-Adaptive Attention module for image captioning, which can alleviate this misleading problem and learn implicit non-visual clues which can be helpful for the generation of non-visual words. We further introduce a diversity regularization to enhance the expression ability of the Task-Adaptive Attention module. Extensive experiments on the MSCOCO captioning dataset demonstrate that by plugging our Task-Adaptive Attention module into a vanilla Transformer-based image captioning model, performance improvement can be achieved.

Keywords:
Closed captioning Computer science Artificial intelligence Syntax Task (project management) Transformer Regularization (linguistics) Natural language processing Word (group theory) Task analysis Image (mathematics) Speech recognition Linguistics

Metrics

329
Cited By
24.33
FWCI (Field Weighted Citation Impact)
34
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Advanced Image and Video Retrieval Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Domain Adaptation and Few-Shot Learning
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Adaptive Syncretic Attention for Constrained Image Captioning

Liang YangHaifeng Hu

Journal:   Neural Processing Letters Year: 2019 Vol: 50 (1)Pages: 549-564
JOURNAL ARTICLE

Adaptively Aligned Image Captioning via Adaptive Attention Time

Lun HuangWenmin WangYaxian XiaJie Chen

Journal:   arXiv (Cornell University) Year: 2019 Vol: 32 Pages: 8940-8949
JOURNAL ARTICLE

Image captioning with adaptive incremental global context attention

Changzhi WangXiaodong Gu

Journal:   Applied Intelligence Year: 2021 Vol: 52 (6)Pages: 6575-6597
JOURNAL ARTICLE

A Novel Adaptive Attention Model for Image Captioning

Donglin LiangJinzhao WuAnping HeMing Ding

Journal:   Journal of Physics Conference Series Year: 2020 Vol: 1549 (3)Pages: 032131-032131
© 2026 ScienceGate Book Chapters — All rights reserved.