JOURNAL ARTICLE

Fast K-Means Algorithm Clustering

Raied SalmanVojislav KecmanQi LiRobert StrackErik Test

Year: 2011 Journal:   International journal of Computer Networks & Communications Vol: 3 (4)Pages: 17-31

Abstract

k-means has recently been recognized as one of the best algorithms for\nclustering unsupervised data. Since k-means depends mainly on distance\ncalculation between all data points and the centers, the time cost will be high\nwhen the size of the dataset is large (for example more than 500millions of\npoints). We propose a two stage algorithm to reduce the time cost of distance\ncalculation for huge datasets. The first stage is a fast distance calculation\nusing only a small portion of the data to produce the best possible location of\nthe centers. The second stage is a slow distance calculation in which the\ninitial centers used are taken from the first stage. The fast and slow stages\nrepresent the speed of the movement of the centers. In the slow stage, the\nwhole dataset can be used to get the exact location of the centers. The time\ncost of the distance calculation for the fast stage is very low due to the\nsmall size of the training data chosen. The time cost of the distance\ncalculation for the slow stage is also minimized due to small number of\niterations. Different initial locations of the clusters have been used during\nthe test of the proposed algorithms. For large datasets, experiments show that\nthe 2-stage clustering method achieves better speed-up (1-9 times).\n

Keywords:
Cluster analysis Computer science Stage (stratigraphy) Algorithm Data point Data mining Artificial intelligence

Metrics

16
Cited By
0.39
FWCI (Field Weighted Citation Impact)
12
Refs
0.71
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Management and Algorithms
Physical Sciences →  Computer Science →  Signal Processing
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

A FAST K-MEANS TYPE CLUSTERING ALGORITHM

Xiaolin WuIan H. Witten

Journal:   PRISM (University of Calgary) Year: 1985
JOURNAL ARTICLE

Bilateral k-Means Algorithm for Fast Co-Clustering

Junwei HanKun SongFeiping NieXuelong Li

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2017 Vol: 31 (1)
JOURNAL ARTICLE

Fast K-Means Clustering Algorithm using Prediction Data

Tae-Chang JeeHyun-Jin LeeYill-Byung Lee

Journal:   The Journal of the Korea Contents Association Year: 2009 Vol: 9 (1)Pages: 106-114
JOURNAL ARTICLE

A Highly Efficient Fast Global K-Means Clustering Algorithm

Liang XianFuheng QuYong YangHua Cai

Journal:   Advances in engineering research/Advances in Engineering Research Year: 2015
© 2026 ScienceGate Book Chapters — All rights reserved.