JOURNAL ARTICLE

Arabic Extractive Summarization Using Pre-Trained Models

Yasmin EiniehAmal AlmansourAmani Jamal

Year: 2023 Journal:   Journal of King Abdulaziz University-Computing and Information Technology Sciences Vol: 12 (1)

Abstract

Automatic Text Summarization (ATS) is a crucial area of study in Natural Language Processing (NLP) due to the vast amount of online information available. Extractive summarization, which involves selecting important sentences from the original document without altering their wording, is one approach to generating summaries. While many methods for Arabic text summarization exist, deep learning applications are still in their early stages, and there is a shortage of available datasets. Unlike English, there have been fewer experiments conducted on Arabic language summarization due to its unique characteristics. This study aims to fill this gap by experimenting with several models for summarizing Arabic text, including QARiB, AraELECTRA, and AraBERT-base models, all trained using the KALIMA dataset. The AraBERT model performed exceptionally well, achieving high scores of 0.44, 0.26, and 0.44 on the ROUGE-1, ROUGE-2, and ROUGE-L measures, respectively.

Keywords:
Automatic summarization Arabic Natural language processing Artificial intelligence Computer science Linguistics Philosophy

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
39
Refs
0.21
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Low Resource Summarization using Pre-trained Language Models

Mubashir MunafHammad AfzalKhawir MahmoodNaima Iltaf

Journal:   ACM Transactions on Asian and Low-Resource Language Information Processing Year: 2024 Vol: 23 (10)Pages: 1-19
JOURNAL ARTICLE

Biomedical-domain pre-trained language model for extractive summarization

Yongping DuQingxiao LiLulin WangYanqing He

Journal:   Knowledge-Based Systems Year: 2020 Vol: 199 Pages: 105964-105964
© 2026 ScienceGate Book Chapters — All rights reserved.