JOURNAL ARTICLE

An extractive text summarization technique for Bengali document(s) using K-means clustering algorithm

Abstract

Text summarization, a field of data mining, is very important for developing various real-life applications. Many techniques have been developed for summarizing English text(s). But, a few attempts have been made for Bengali text because of its some multifaceted structure. This paper presents a method for text summarization which extracts important sentences from a single or multiple Bengali documents. The input document(s) should be pre-processed by tokenization, stemming operation etc. Then, word score is calculated by Term-Frequency/Inverse Document Frequency (TF/IDF) and sentence score is determined by summing up its constituent words' scores with its position. Cue and skeleton words have also been considered to calculate the sentence score. For single or multiple documents, K-means clustering algorithm has been applied to produce the final summary. The experimental result shows satisfactory outputs in comparison to the existing approaches possessing linear run time complexity.

Keywords:
Bengali Automatic summarization Computer science tf–idf Lexical analysis Cluster analysis Natural language processing Sentence Word (group theory) Artificial intelligence Multi-document summarization Information retrieval Term (time) Field (mathematics) Mathematics

Metrics

55
Cited By
4.13
FWCI (Field Weighted Citation Impact)
13
Refs
0.94
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Mining Algorithms and Applications
Physical Sciences →  Computer Science →  Information Systems

Related Documents

JOURNAL ARTICLE

Extractive Text Summarization Technique Using Fuzzy C-Means Clustering Algorithm

Shafiul GaniJia UddinIftekharul Mobin

Journal:   2019 International Conference on Computer, Communication, Chemical, Materials and Electronic Engineering (IC4ME2) Year: 2019 Pages: 1-6
JOURNAL ARTICLE

Automatic Extractive Text Summarization Using K-Means Clustering

M R PrathimaH R Divakar

Journal:   International Journal of Computer Sciences and Engineering Year: 2018 Vol: 6 (6)Pages: 782-787
JOURNAL ARTICLE

Automatic extractive text summarization using K-means clustering

Krithi ShettyJagadish S. Kallimani

Journal:   2017 International Conference on Electrical, Electronics, Communication, Computer, and Optimization Techniques (ICEECCOT) Year: 2017 Pages: 1-9
JOURNAL ARTICLE

An Improved Extractive Summarization Technique for Bengali Text(s)

P. B. TumpaShamima YeasminAdiba Mahjabin NituMd Palash UddinMasud Ibn AfjalMd Abdulla Al Mamun

Journal:   2018 International Conference on Computer, Communication, Chemical, Material and Electronic Engineering (IC4ME2) Year: 2018 Pages: 1-4
BOOK-CHAPTER

Extractive Text Summarization on Large-scale Dataset Using K-Means Clustering

Ti-Hon NguyenThanh‐Nghi Do

Lecture notes in computer science Year: 2022 Pages: 737-746
© 2026 ScienceGate Book Chapters — All rights reserved.