Research on text clustering algorithm based on improved K-means

Xinwu Li

doi:10.1109/iccda.2010.5540727

ScienceGate Book Chapters

JOURNAL ARTICLE

Research on text clustering algorithm based on improved K-means

Xinwu Li

Year: 2010 Vol: 22 Pages: V4-573

DOI: 10.1109/iccda.2010.5540727

Get Full-Text PDF Get Analytical Report

Abstract

Text clustering is one of the difficult and hot research fields in the internet search engine research. Using the advantages of K-means clustering and overcoming its disadvantages, a new text clustering algorithm is presented. Firstly, texts are preprocessed to satisfy succeed process. Then, the paper analyzes common K-means clustering algorithm and improves the algorithm principle K-means and corrects its cluster seed selection method of to overcome efficiency of low stability of K-means algorithm which is very sensitive to the initial cluster center and the isolated point text. The experimental results indicate that the improved algorithm has a higher accuracy and has a better stability, compared with the original algorithm.

Keywords:

Cluster analysis Computer science Stability (learning theory) CURE data clustering algorithm Canopy clustering algorithm Affinity propagation Algorithm Cluster (spacecraft) Data mining Process (computing) Correlation clustering Point (geometry) Selection (genetic algorithm) Data stream clustering The Internet Artificial intelligence Machine learning Mathematics

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.21

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Data Management and Algorithms

Physical Sciences → Computer Science → Signal Processing

Data Mining Algorithms and Applications

Physical Sciences → Computer Science → Information Systems

Advanced Clustering Algorithms Research

Physical Sciences → Computer Science → Artificial Intelligence

Research on text clustering algorithm based on improved K-means

Abstract

Metrics

Citation History

Topics

Related Documents

Research on k-means Clustering Algorithm: An Improved k-means Clustering Algorithm

Research and Application of Improved K-means Algorithm in Text Clustering

Improved K-Means Algorithm in Text Semantic Clustering

Research on Improved K-Means Clustering Algorithm

Research on K-means Text Clustering Algorithm Based on Semantic