JOURNAL ARTICLE

Improved Meta-Heuristic Model for Text Document Clustering by Adaptive Weighted Similarity

Gugulothu VenkannaK. F. Bharati

Year: 2023 Journal:   International Journal of Uncertainty Fuzziness and Knowledge-Based Systems Vol: 31 (05)Pages: 749-771   Publisher: World Scientific

Abstract

This paper intends to develop a novel framework for text document clustering with the aid of a new improved meta-heuristic algorithm. Initially, the features are selected from the text document by subjecting each word under Term Frequency-Inverse Document Frequency (TF-IDF) computation. Subsequently, centroid selection plays a vital role in cluster formation, which is done using a new Improved Lion Algorithm (LA) termed as Cross over probability-based LA model (CP-LA). As a novelty, this paper introduced a new inter and intracluster similarity model. Moreover, this centroid selection is made in such a way that the proposed adaptive weighted similarity should be minimal. Based on the characteristics of the document, the weights are automatically adapted with the similarity measure. The proposed adaptive weighted similarity function involves the inter-cluster, and intra-cluster similarity of both ordered and unordered documents. Finally, the superiority of the proposed over other models is proved under different performance measures.

Keywords:
Similarity (geometry) Centroid tf–idf Cluster analysis Computer science Document clustering Heuristic Similarity measure Artificial intelligence Selection (genetic algorithm) Data mining Term (time) Pattern recognition (psychology)

Metrics

1
Cited By
0.26
FWCI (Field Weighted Citation Impact)
36
Refs
0.58
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

A Text Document Clustering Method Based on Weighted BERT Model

Yutong LiJuanjuan CaiJingling Wang

Journal:   2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC) Year: 2020
JOURNAL ARTICLE

Design And Analysis Of Text Document Clustering Using Meta-Heuristic Distance Based Methods

R. Kumaresan

Journal:   Journal of Advanced Research in Dynamic and Control Systems Year: 2020 Vol: 12 (6)Pages: 2262-2269
JOURNAL ARTICLE

Efficient text document clustering with new similarity measures

R. LakshmiS. Baskar

Journal:   International Journal of Business Intelligence and Data Mining Year: 2020 Vol: 18 (1)Pages: 49-49
© 2026 ScienceGate Book Chapters — All rights reserved.