JOURNAL ARTICLE

Iterative Autoregressive Generation for Abstractive Summarization

Abstract

Abstractive summarization suffers from exposure bias caused by the teacher-forced maximum likelihood estimation (MLE) learning, that an autoregressive language model predicts the next token distribution conditioned on the exact pre-context during training while on its own predictions at inference. Preceding resolutions for this problem straightforwardly augment the pure token-level MLE with summary-level objectives. Although this method effectively exposes a model to its prediction errors during summary-level learning, such errors accumulate in the unidirectional autoregressive generation and further limit the learning efficiency. To address this problem, we imitate the human behavior of revising a manuscript multiple times after writing it and introduce a novel iterative autoregressive summarization (IARSum) paradigm, which iteratively rewrites a generated summary to approximate an errorless version. Concretely, IARSum performs iterative revisions after summarization, where the output of the previous revision is taken as the input for the next, and a minimum-risk training strategy is used to ensure that the original summary is effectively polished in every revision round. We conduct extensive experiments on two widely used datasets and show the new or matched state-of-the-art performance of IARSum.

Keywords:
Automatic summarization Autoregressive model Computer science Artificial intelligence Natural language processing Econometrics Mathematics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
24
Refs
0.03
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Introducing bidirectional attention for autoregressive models in abstractive summarization

Jianfei ZhaoXin SunChong Feng

Journal:   Information Sciences Year: 2024 Vol: 689 Pages: 121497-121497
BOOK-CHAPTER

News Headline Generation Using Abstractive Text Summarization

Jeetendra KumarShashi ShekharRashmi Gupta

Communications in computer and information science Year: 2022 Pages: 112-125
JOURNAL ARTICLE

Abstractive Summarization for Urdu Video Description Generation

Ali FaheemFaizad UllahMuhammad Sohaib AyubAsim Karim

Journal:   ACM Transactions on Asian and Low-Resource Language Information Processing Year: 2025 Vol: 24 (10)Pages: 1-21
© 2026 ScienceGate Book Chapters — All rights reserved.