JOURNAL ARTICLE

Automatic text summarization of Wikipedia articles

Abstract

The main objective of a text summarization system is to identify the most important information from the given text and present it to the end users. In this paper, Wikipedia articles are given as input to system and extractive text summarization is presented by identifying text features and scoring the sentences accordingly. The text is first pre-processed to tokenize the sentences and perform stemming operations. We then score the sentences using the different text features. Two novel approaches implemented are using the citations present in the text and identifying synonyms. These features along with the traditional methods are used to score the sentences. The scores are used to classify the sentence to be in the summary text or not with the help of a neural network. The user can provide what percentage of the original text should be in the summary. It is found that scoring the sentences based on citations gives the best results.

Keywords:
Automatic summarization Computer science Text graph Information retrieval Sentence Natural language processing Multi-document summarization Artificial intelligence Text mining

Metrics

37
Cited By
3.77
FWCI (Field Weighted Citation Impact)
12
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Wikis in Education and Collaboration
Social Sciences →  Social Sciences →  Communication

Related Documents

JOURNAL ARTICLE

Text summarization using Wikipedia

Yogesh SankarasubramaniamKrishnan RamanathanSubhankar Ghosh

Journal:   Information Processing & Management Year: 2014 Vol: 50 (3)Pages: 443-461
JOURNAL ARTICLE

A new graph based text segmentation using Wikipedia for automatic text summarization

Mohsen PourvaliPh.D. Mohammad

Journal:   International Journal of Advanced Computer Science and Applications Year: 2012 Vol: 3 (1)
© 2026 ScienceGate Book Chapters — All rights reserved.