Neural Abstractive Text Summarization with Sequence-to-Sequence Models

Tian Shi; Yaser Keneshloo; Naren Ramakrishnan; Chandan K. Reddy

doi:10.1145/3419106

ScienceGate Book Chapters

JOURNAL ARTICLE

Neural Abstractive Text Summarization with Sequence-to-Sequence Models

Tian Shi Yaser Keneshloo Naren Ramakrishnan Chandan K. Reddy

Year: 2021 Journal: ACM/IMS Transactions on Data Science Vol: 2 (1)Pages: 1-37 Publisher: Association for Computing Machinery

DOI: 10.1145/3419106

Get Full-Text PDF Get Analytical Report

Abstract

In the past few years, neural abstractive text summarization with sequence-to-sequence (seq2seq) models have gained a lot of popularity. Many interesting techniques have been proposed to improve seq2seq models, making them capable of handling different challenges, such as saliency, fluency and human readability, and generate high-quality summaries. Generally speaking, most of these techniques differ in one of these three categories: network structure, parameter inference, and decoding/generation. There are also other concerns, such as efficiency and parallelism for training a model. In this article, we provide a comprehensive literature survey on different seq2seq models for abstractive text summarization from the viewpoint of network structures, training strategies, and summary generation algorithms. Several models were first proposed for language modeling and generation tasks, such as machine translation, and later applied to abstractive text summarization. Hence, we also provide a brief review of these models. As part of this survey, we also develop an open source library, namely, Neural Abstractive Text Summarizer (NATS) toolkit, for the abstractive text summarization. An extensive set of experiments have been conducted on the widely used CNN/Daily Mail dataset to examine the effectiveness of several different neural network components. Finally, we benchmark two models implemented in NATS on the two recently released datasets, namely, Newsroom and Bytecup.

Keywords:

Automatic summarization Computer science Artificial intelligence Natural language processing Artificial neural network Fluency Sequence (biology) Machine translation Benchmark (surveying) Set (abstract data type) Multi-document summarization Linguistics Programming language

Metrics

191

Cited By

21.73

FWCI (Field Weighted Citation Impact)

Refs

1.00

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Neural Abstractive Text Summarization with Sequence-to-Sequence Models

Abstract

Metrics

Citation History

Topics

Related Documents

Towards neural abstractive clinical trial text summarization with sequence to sequence models

Turkish abstractive text summarization using pretrained sequence-to-sequence models

Abstractive Text Summarization Methods with Sequence to Sequence RNN

Abstractive method of text summarization with sequence to sequence RNNs

Bengali abstractive text summarization using sequence to sequence RNNs