Research on K-means Text Clustering Algorithm Based on Semantic

Yufang Liu; Shibin Xiao; Xueqiang Lv; Shuicai Shi

doi:10.1109/ccie.2010.39

ScienceGate Book Chapters

JOURNAL ARTICLE

Research on K-means Text Clustering Algorithm Based on Semantic

Yufang Liu Shibin Xiao Xueqiang Lv Shuicai Shi

Year: 2010 Vol: 2005 Pages: 124-127

DOI: 10.1109/ccie.2010.39

Get Full-Text PDF Get Analytical Report

Abstract

Through research on K-means algorithm of text clustering and semantic-based vector space model, a semantic-based K-means text clustering model is proposed to solve the problem on high-dimensional and sparse characteristics of text data set. The model reduces the semantic loss of the text data and improves the quality of text clustering. Experiments prove that semantic-based text clustering increases by more 6 percent than non-semantic-based one in the final evaluation of the F1 index value.

Keywords:

Cluster analysis Computer science Semantic data model Semantic similarity Semantic computing Set (abstract data type) Correlation clustering Vector space model Artificial intelligence Document clustering Data mining Natural language processing Information retrieval Semantic Web

Metrics

Cited By

0.80

FWCI (Field Weighted Citation Impact)

Refs

0.80

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Clustering Algorithms Research

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Computational Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Data Mining Algorithms and Applications

Physical Sciences → Computer Science → Information Systems

Research on K-means Text Clustering Algorithm Based on Semantic

Abstract

Metrics

Citation History

Topics

Related Documents

Improved K-Means Algorithm in Text Semantic Clustering

Research on text clustering algorithm based on improved K-means

Weighted k-Means Algorithm Based Text Clustering

Chinese Text Clustering Algorithm Based k-means

Research on k-means Clustering Algorithm: An Improved k-means Clustering Algorithm