JOURNAL ARTICLE

An improved k-means clustering algorithm based on dissimilarity

Abstract

K-means clustering algorithm is one of the most widely used clustering algorithms and has been applied in many fields of science and technology. A major problem of the original k-means clustering algorithm is that the cluster results depend on the initial centroids which choose at random. At the same time, the similarity measure on the algorithm based on distance is not suitable for big high- dimensional dataset. They all lead to severe degradation in performance. In this paper, an improved k-means clustering algorithm based on dissimilarity is proposed. It selects the initial centriods using the Huffman tree which uses dissimilarity matrix to construct. Many experiments confirm that the proposed algorithm is an efficient algorithm with better clustering accuracy on the same algorithm time complexity.

Keywords:
Cluster analysis CURE data clustering algorithm Canopy clustering algorithm Computer science Single-linkage clustering Correlation clustering Fuzzy clustering Algorithm Data stream clustering Huffman coding Data mining k-medoids Similarity (geometry) k-medians clustering Pattern recognition (psychology) Artificial intelligence Data compression

Metrics

17
Cited By
1.89
FWCI (Field Weighted Citation Impact)
16
Refs
0.90
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Management and Algorithms
Physical Sciences →  Computer Science →  Signal Processing
Complex Network Analysis Techniques
Physical Sciences →  Physics and Astronomy →  Statistical and Nonlinear Physics

Related Documents

JOURNAL ARTICLE

Improved K-means Clustering Algorithm based on Dynamic Clustering

Jennefer Mononteliza

Journal:   Asia-pacific Journal of Convergent Research Interchange Year: 2020 Vol: 6 (4)Pages: 1-12
JOURNAL ARTICLE

Improved K-Means Clustering Algorithm Based on Dynamic Clustering

Liguo Zheng

Journal:   International Journal of Advanced Research in Big Data Management System Year: 2020 Vol: 4 (1)Pages: 17-26
BOOK-CHAPTER

Fuzzy Dissimilarity Measure Based K-Means Clustering

Guanli YueYanpeng QuAnsheng Deng

Lecture notes on data engineering and communications technologies Year: 2021 Pages: 1792-1799
© 2026 ScienceGate Book Chapters — All rights reserved.