JOURNAL ARTICLE

Research on K-means Text Clustering Algorithm Based on Semantic

Abstract

Through research on K-means algorithm of text clustering and semantic-based vector space model, a semantic-based K-means text clustering model is proposed to solve the problem on high-dimensional and sparse characteristics of text data set. The model reduces the semantic loss of the text data and improves the quality of text clustering. Experiments prove that semantic-based text clustering increases by more 6 percent than non-semantic-based one in the final evaluation of the F1 index value.

Keywords:
Cluster analysis Computer science Semantic data model Semantic similarity Semantic computing Set (abstract data type) Correlation clustering Vector space model Artificial intelligence Document clustering Data mining Natural language processing Information retrieval Semantic Web

Metrics

6
Cited By
0.80
FWCI (Field Weighted Citation Impact)
6
Refs
0.80
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Computational Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Mining Algorithms and Applications
Physical Sciences →  Computer Science →  Information Systems

Related Documents

JOURNAL ARTICLE

Improved K-Means Algorithm in Text Semantic Clustering

Ma Junhong

Journal:   The Open Cybernetics & Systemics Journal Year: 2014 Vol: 8 (1)Pages: 530-534
JOURNAL ARTICLE

Research on text clustering algorithm based on improved K-means

Xinwu Li

Year: 2010 Vol: 22 Pages: V4-573
JOURNAL ARTICLE

Chinese Text Clustering Algorithm Based k-means

Mingyu YaoDechang PiXiangxiang Cong

Journal:   Physics Procedia Year: 2012 Vol: 33 Pages: 301-307
© 2026 ScienceGate Book Chapters — All rights reserved.