JOURNAL ARTICLE

Automatic Arabic text summarization using clustering and keyphrase extraction

Abstract

As the number of electronic documents increases rapidly, the need for faster techniques to assess the relevance of these documents emerges. A summary is a concise representation of underlying text. A full understanding of the document is essential to form an ideal summary. However, achieving full understanding is either difficult or impossible for computers. Therefore, selecting important sentences from the original text and presenting these sentences as a summary present the most common techniques in automated text summarization. This paper propose a hybrid clustering method(partitioning and hierarchical) to group many Arabic documents into several clusters .Then keyphrase extraction module is applied to extract important Keyphrases from each cluster, which helps identify the most important sentences and find similar sentences based on several similarity algorithms. It applied to extract one sentence from a group of similar sentences while ignoring the other similar sentences (i.e., sentences that have a greater similarity than the predefined threshold). This model is designed for both single-and multi-document Arabic text summarization. The Recall-Oriented Understudy for Gisting Evaluation (ROGUE) matrix used for the evaluation. For the summarization dataset, Essex Arabic Summaries Corpus was used. It has many topic based articles with multiple human summaries. This model achieved an accuracy of 80 % for single-document and 62% for multi-document summarization.

Keywords:
Automatic summarization Computer science Arabic Cluster analysis Artificial intelligence Natural language processing Multi-document summarization Information retrieval Linguistics

Metrics

28
Cited By
2.41
FWCI (Field Weighted Citation Impact)
19
Refs
0.90
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Information Retrieval and Search Behavior
Physical Sciences →  Computer Science →  Information Systems

Related Documents

JOURNAL ARTICLE

Automatic Multi-Document Arabic Text Summarization Using Clustering and Keyphrase Extraction

Hamzah Noori FejerNazlia Omar

Journal:   Journal of Artificial Intelligence Year: 2014 Vol: 8 (1)Pages: 1-9
JOURNAL ARTICLE

Keyphrase based Evaluation of Automatic Text Summarization

Fatma ElghannamTarek El-Shishtawy

Journal:   International Journal of Computer Applications Year: 2015 Vol: 117 (7)Pages: 5-8
JOURNAL ARTICLE

Web Document Clustering by Using Automatic Keyphrase Extraction

Juhyun HanTae‐Hwan KimJoongmin Choi

Journal:   2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops Year: 2007
JOURNAL ARTICLE

Web Document Clustering by Using Automatic Keyphrase Extraction

Juhyun HanTae-Hwan KimJoongmin Choi

Journal:   2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops Year: 2007 Pages: 56-59
© 2026 ScienceGate Book Chapters — All rights reserved.