Automated Kannada Text Summarization using Sentence Features

Arpitha Swamy; S Srinath

doi:10.35940/ijrte.b1531.078219

ScienceGate Book Chapters

JOURNAL ARTICLE

Automated Kannada Text Summarization using Sentence Features

Arpitha Swamy S Srinath

Year: 2019 Journal: International Journal of Recent Technology and Engineering (IJRTE) Vol: 8 (2)Pages: 470-474

DOI: 10.35940/ijrte.b1531.078219

Get Full-Text PDF Get Analytical Report

Abstract

There is a growing requirement for the text summarization due to the difficulty of managing exponential increase of information accessible on the World Wide Web. Text summarization is a process to extract the contents in the original text to the shorter form which provides important information to the user. The summarizer presented in this paper produces the extractive summaries of Kannada text documents. The proposed summarizer system considers five features to determine the important sentences in the document. The features used are Term Frequency, Term Frequency-Inverse Sentence Frequency, Keywords feature, Sentence length and Sentence position. The value of each feature is computed and score for each sentence in the document is the average of all the feature score values. The sentences with the top scores are selected to be included in the extractive summary. The results of the proposed model are evaluated using ROUGE toolkit to measure the performance based on F-score of generated summary. Experimental studies on custom-built dataset with 50 Kannada text documents shows significantly better performance in producing extractive summaries as compared to human summaries.

Keywords:

Automatic summarization Computer science Sentence Natural language processing Feature (linguistics) Kannada Artificial intelligence Information retrieval Text graph Term (time) Multi-document summarization tf–idf Linguistics

Metrics

Cited By

0.31

FWCI (Field Weighted Citation Impact)

Refs

0.69

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Automated Kannada Text Summarization using Sentence Features

Abstract

Metrics

Citation History

Topics

Related Documents

Automated Text Summarization: Sentence Refinement Approach

Kannada Text Summarization

Kannada Text Summarization using Extractive Technique

Sentence Features Fusion for Text Summarization Using Fuzzy Logic

Categorized Text Document Summarization in the Kannada Language by sentence ranking