JOURNAL ARTICLE

Research on text clustering algorithm based on improved K-means

Abstract

Text clustering is one of the difficult and hot research fields in the internet search engine research. Using the advantages of K-means clustering and overcoming its disadvantages, a new text clustering algorithm is presented. Firstly, texts are preprocessed to satisfy succeed process. Then, the paper analyzes common K-means clustering algorithm and improves the algorithm principle K-means and corrects its cluster seed selection method of to overcome efficiency of low stability of K-means algorithm which is very sensitive to the initial cluster center and the isolated point text. The experimental results indicate that the improved algorithm has a higher accuracy and has a better stability, compared with the original algorithm.

Keywords:
Cluster analysis Computer science Stability (learning theory) CURE data clustering algorithm Canopy clustering algorithm Affinity propagation Algorithm Cluster (spacecraft) Data mining Process (computing) Correlation clustering Point (geometry) Selection (genetic algorithm) Data stream clustering The Internet Artificial intelligence Machine learning Mathematics

Metrics

4
Cited By
0.00
FWCI (Field Weighted Citation Impact)
9
Refs
0.21
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Management and Algorithms
Physical Sciences →  Computer Science →  Signal Processing
Data Mining Algorithms and Applications
Physical Sciences →  Computer Science →  Information Systems
Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Research and Application of Improved K-means Algorithm in Text Clustering

Shen-yi QIANHuihui LiuDai-yi LI

Journal:   DEStech Transactions on Computer Science and Engineering Year: 2018
JOURNAL ARTICLE

Improved K-Means Algorithm in Text Semantic Clustering

Ma Junhong

Journal:   The Open Cybernetics & Systemics Journal Year: 2014 Vol: 8 (1)Pages: 530-534
JOURNAL ARTICLE

Research on Improved K-Means Clustering Algorithm

Yin Sheng ZhangHui Lin ShanJia Qiang LiJie Zhou

Journal:   Advanced materials research Year: 2011 Vol: 403-408 Pages: 1977-1980
© 2026 ScienceGate Book Chapters — All rights reserved.