JOURNAL ARTICLE

Abstractive Text-Image Summarization Using Multi-Modal Attentional Hierarchical RNN

Abstract

Rapid growth of multi-modal documents on the Internet makes multi-modal summarization research necessary. Most previous research summarizes texts or images separately. Recent neural summarization research shows the strength of the Encoder-Decoder model in text summarization. This paper proposes an abstractive text-image summarization model using the attentional hierarchical Encoder-Decoder model to summarize a text document and its accompanying images simultaneously, and then to align the sentences and images in summaries. A multi-modal attentional mechanism is proposed to attend original sentences, images, and captions when decoding. The DailyMail dataset is extended by collecting images and captions from the Web. Experiments show our model outperforms the neural abstractive and extractive text summarization methods that do not consider images. In addition, our model can generate informative summaries of images.

Keywords:
Automatic summarization Computer science Modal Artificial intelligence Encoder Natural language processing Decoding methods Image (mathematics) The Internet Visualization Pattern recognition (psychology) Information retrieval Algorithm

Metrics

92
Cited By
5.76
FWCI (Field Weighted Citation Impact)
45
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Multimodal Machine Learning Applications
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

Multi-Modal Abstractive Summarization based Transformer using Video Transcripts

Min Ye LeeSung Won Han

Journal:   Journal of Korean Institute of Industrial Engineers Year: 2021 Vol: 47 (5)Pages: 433-443
JOURNAL ARTICLE

Abstractive Text Summarization Using GAN

Tanushree BhartiSatyam Kumar SinhaHarshit SinghalRohit SainiDipesh Parihar

Journal:   International Journal of Innovative Science and Research Technology (IJISRT) Year: 2024 Pages: 1117-1122
JOURNAL ARTICLE

Abstractive Text Summarization Using BART

Attada VenkataramanaK. SrividyaR. Cristin

Journal:   2022 IEEE 2nd Mysore Sub Section International Conference (MysuruCon) Year: 2022 Pages: 1-6
© 2026 ScienceGate Book Chapters — All rights reserved.