JOURNAL ARTICLE

Automatic Multi-Document Arabic Text Summarization Using Clustering and Keyphrase Extraction

Hamzah Noori FejerNazlia Omar

Year: 2014 Journal:   Journal of Artificial Intelligence Vol: 8 (1)Pages: 1-9

Abstract

Automatic text summarization has become important due to the rapid growth of information texts since it is very difficult for human beings to manually summarize large documents of texts.A full understanding of the document is essential to form an ideal summary.However, achieving full understanding is either difficult or impossible for computers.Therefore, selecting important sentences from the original text and presenting these sentences as a summary present the most common techniques in automated text summarization.Arabic natural language processing lacks tools and resources which are essential to advance research in Arabic text summarization.In addition to the limited resources, there has been little attention and research done in this field.Arabic text summarization still suffer from low accuracy as they use simple summarization techniques.The aim of this research is to improve Arabic text summarization by using clustering and keyphrase extraction.This study proposes a combined clustering method to group Arabic documents into several clusters.Keyphrase extraction module is applied to extract important keyphrases from each cluster, which helps to identify the most important sentences and find similar sentences based on several similarity algorithms.These algorithms are applied to extract one sentence from a group of similar sentences while ignoring the other similar sentences.The Recall-Oriented Understudy for Gisting Evaluation (ROGUE) metrics were used for the evaluation.For the summarization dataset the corpus DUC2002 was used.This model achieved an accuracy of 43.4%.The experiments have proved that the proposed model has given better performance in comparison to other work.

Keywords:
Automatic summarization Computer science Arabic Cluster analysis Natural language processing Multi-document summarization Information retrieval Document clustering Artificial intelligence Linguistics

Metrics

22
Cited By
0.97
FWCI (Field Weighted Citation Impact)
4
Refs
0.84
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Web Document Clustering by Using Automatic Keyphrase Extraction

Juhyun HanTae‐Hwan KimJoongmin Choi

Journal:   2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops Year: 2007
JOURNAL ARTICLE

Web Document Clustering by Using Automatic Keyphrase Extraction

Juhyun HanTae-Hwan KimJoongmin Choi

Journal:   2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops Year: 2007 Pages: 56-59
BOOK-CHAPTER

Multi-document Text Summarization Using Sentence Extraction

Ravinder AhujaWillson Anand

Advances in intelligent systems and computing Year: 2017 Pages: 235-242
© 2026 ScienceGate Book Chapters — All rights reserved.