Turkish abstractive text summarization using pretrained sequence-to-sequence models

Batuhan Baykara; Tunga Güngör

doi:10.1017/s1351324922000195

ScienceGate Book Chapters

JOURNAL ARTICLE

Turkish abstractive text summarization using pretrained sequence-to-sequence models

Batuhan Baykara Tunga Güngör

Year: 2022 Journal: Natural Language Engineering Vol: 29 (5)Pages: 1275-1304 Publisher: Cambridge University Press

DOI: 10.1017/s1351324922000195

Get Full-Text PDF Get Analytical Report

Abstract

Abstract The tremendous amount of increase in the number of documents available on the Web has turned finding the relevant piece of information into a challenging, tedious, and time-consuming activity. Accordingly, automatic text summarization has become an important field of study by gaining significant attention from the researchers. Lately, with the advances in deep learning, neural abstractive text summarization with sequence-to-sequence (Seq2Seq) models has gained popularity. There have been many improvements in these models such as the use of pretrained language models (e.g., GPT, BERT, and XLM) and pretrained Seq2Seq models (e.g., BART and T5). These improvements have addressed certain shortcomings in neural summarization and have improved upon challenges such as saliency, fluency, and semantics which enable generating higher quality summaries. Unfortunately, these research attempts were mostly limited to the English language. Monolingual BERT models and multilingual pretrained Seq2Seq models have been released recently providing the opportunity to utilize such state-of-the-art models in low-resource languages such as Turkish. In this study, we make use of pretrained Seq2Seq models and obtain state-of-the-art results on the two large-scale Turkish datasets, TR-News and MLSum, for the text summarization task. Then, we utilize the title information in the datasets and establish hard baselines for the title generation task on both datasets. We show that the input to the models has a substantial amount of importance for the success of such tasks. Additionally, we provide extensive analysis of the models including cross-dataset evaluations, various text generation options, and the effect of preprocessing in ROUGE evaluations for Turkish. It is shown that the monolingual BERT models outperform the multilingual BERT models on all tasks across all the datasets. Lastly, qualitative evaluations of the generated summaries and titles of the models are provided.

Keywords:

Automatic summarization Computer science Artificial intelligence Natural language processing Task (project management) Fluency Language model Turkish Semantics (computer science) Sequence (biology) Information retrieval Linguistics

Metrics

Cited By

4.31

FWCI (Field Weighted Citation Impact)

Refs

0.92

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Multimodal Machine Learning Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Turkish abstractive text summarization using pretrained sequence-to-sequence models

Abstract

Metrics

Citation History

Topics

Related Documents

Neural Abstractive Text Summarization with Sequence-to-Sequence Models

Bengali abstractive text summarization using sequence to sequence RNNs

IMPROVING SEQUENCE-TO-SEQUENCE MODELS FOR ABSTRACTIVE TEXT SUMMARIZATION USING META HEURISTIC APPROACHES

AraBART: a Pretrained Arabic Sequence-to-Sequence Model for Abstractive Summarization

Towards neural abstractive clinical trial text summarization with sequence to sequence models