Abstractive Arabic Text Summarization Based on MT5 and AraBart Transformers

Asmaa Elsaid; Ammar Mohammed; Lamiaa Fattouh; Mohammed Sakre

doi:10.1109/imsa58542.2023.10217539

ScienceGate Book Chapters

JOURNAL ARTICLE

Abstractive Arabic Text Summarization Based on MT5 and AraBart Transformers

Asmaa Elsaid Ammar Mohammed Lamiaa Fattouh Mohammed Sakre

Year: 2023 Pages: 7-12

DOI: 10.1109/imsa58542.2023.10217539

Get Full-Text PDF Get Analytical Report

Abstract

Text summarization is essential in natural language processing because of the rapid growth of data. Therefore, the user needs to summarize this data into meaningful text quickly. There are two standard methods of text summarization: extractive and abstractive. There are many efforts to summarize Latin texts. However, summarizing Arabic texts is challenging for many reasons, including the language's complexity, structure, and morphology. Also, there is a need for benchmark data sources and a gold-standard Arabic evaluation metrics summary. Thus, the contribution of this research is multi-fold: First, it introduces a new Arabic benchmark dataset, called the HASD, which includes 43k articles with their extractive and abstractive summaries. Second, it presents a new Arabic benchmark dataset called the AASD, which includes 150k articles with their abstractive summaries. Third, this work modifies the well-known extractive EASC benchmarks by adding to each text its abstractive summarization. Fourth, this paper proposes a new measure called the Arabic-rouge measure for the abstractive summary depending on structure and similarity between words. Finally, an investigation of the impact of using abstractive Arabic text summarization on different transformer models with different data sets. The model is tested using the proposed HASD, AASD, modified EASC benchmarks and evaluated using Rouge, Bleu, and Arabic Rouge. The experimental results show satisfactory results compared to state-of-the-art methods.

Keywords:

Automatic summarization Arabic Transformer Natural language processing Computer science Artificial intelligence Linguistics Speech recognition Engineering Electrical engineering Voltage Philosophy

Metrics

Cited By

0.77

FWCI (Field Weighted Citation Impact)

Refs

0.72

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Abstractive Arabic Text Summarization Based on MT5 and AraBart Transformers

Abstract

Metrics

Citation History

Topics

Related Documents

English-Arabic Text Translation and Abstractive Summarization Using Transformers

Fine-Tuning AraBART on AHS Dataset for Arabic Abstractive Summarization

AraBART: a Pretrained Arabic Sequence-to-Sequence Model for Abstractive Summarization

Abstractive Arabic Text Summarization Based on Deep Learning

Arabic abstractive text summarization using RNN-based and transformer-based architectures