Priority Queue-based Extractive Text Summarizer

V. Sherlin Solomi; Ch. Keertana Sarvani; N. Supriya

doi:10.1109/icspc57692.2023.10125761

ScienceGate Book Chapters

JOURNAL ARTICLE

Priority Queue-based Extractive Text Summarizer

V. Sherlin Solomi Ch. Keertana Sarvani N. Supriya

Year: 2023 Vol: 48 Pages: 292-296

DOI: 10.1109/icspc57692.2023.10125761

Get Full-Text PDF Get Analytical Report

Abstract

Text summarization simply means creating a summary from a text document while retaining its main ideas and points of contention. Text summarization is completely aimed to generate a coherent and precise synopsis of the text verbal document. Generating a coherent summary for a large text is a time-consuming task, this paper proposes a novel generic text summarizer for the English language that can accept a maximum input of 160-170 words and generate a summary of 60-80 words, which retains the original context of the input text. This model utilizes a heap queue algorithm for text summarization. The heap queue helps in preserving the phrases from an input text, by skimming the top-scoring sentences, making it easier to be extracted in terms of importance. The input text is tokenized aptly, with all the stop words removed. Further word frequency is calculated, which is used to calculate sentence score, the words are joined together to form a coherent sentence, a summary that uses the summary’s highest-scoring sentences. The model is tested using various scoring methods available and has obtained an accuracy of 86 percent. It is also observed that the cosine similarity for the model-generated output and manual reference summary is 0.86. The proposed model is a generic text summarizer that can be used for any type of data summarization, irrespective of its domain.

Keywords:

Automatic summarization Computer science Sentence Natural language processing Information retrieval Cosine similarity Artificial intelligence Word (group theory) Context (archaeology) Task (project management) Linguistics Pattern recognition (psychology)

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.05

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Priority Queue-based Extractive Text Summarizer

Abstract

Metrics

Topics

Related Documents

EUTS: Extractive Urdu Text Summarizer

An Extractive Text Summarizer Based on Significant Words

Marathi Extractive Text Summarizer Using Graph Based Model

A hybrid PSO model in Extractive Text Summarizer

HINDI TEXT SUMMARIZER USING ABSTRACTIVE AND EXTRACTIVE TECHNIQUE