JOURNAL ARTICLE

DETERMINISTIC INITIALIZATION OF THE K-MEANS ALGORITHM USING HIERARCHICAL CLUSTERING

M. EMRE CELEBIHASSAN A. KINGRAVI

Year: 2012 Journal:   International Journal of Pattern Recognition and Artificial Intelligence Vol: 26 (07)Pages: 1250018-1250018   Publisher: World Scientific

Abstract

K-means is undoubtedly the most widely used partitional clustering algorithm. Unfortunately, due to its gradient descent nature, this algorithm is highly sensitive to the initial placement of the cluster centers. Numerous initialization methods have been proposed to address this problem. Many of these methods, however, have superlinear complexity in the number of data points, making them impractical for large data sets. On the other hand, linear methods are often random and/or order-sensitive, which renders their results unrepeatable. Recently, Su and Dy proposed two highly successful hierarchical initialization methods named Var-Part and PCA-Part that are not only linear, but also deterministic (nonrandom) and order-invariant. In this paper, we propose a discriminant analysis based approach that addresses a common deficiency of these two methods. Experiments on a large and diverse collection of data sets from the UCI machine learning repository demonstrate that Var-Part and PCA-Part are highly competitive with one of the best random initialization methods to date, i.e. k-means++, and that the proposed approach significantly improves the performance of both hierarchical methods.

Keywords:

Metrics

51
Cited By
4.55
FWCI (Field Weighted Citation Impact)
39
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Data Management and Algorithms
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Hierarchical initialization approach for K-Means clustering

Jiewei LuJiaowei TangZhenmin TangJingyu Yang

Journal:   Pattern Recognition Letters Year: 2008 Vol: 29 (6)Pages: 787-795
JOURNAL ARTICLE

Deterministic Initialization of $k$-means Clustering by Data Distribution Guide

Chaloemphon SirikayonArit Thammano

Journal:   2022 Joint International Conference on Digital Arts, Media and Technology with ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunications Engineering (ECTI DAMT & NCON) Year: 2022 Pages: 279-284
JOURNAL ARTICLE

Cluster center initialization algorithm for K-means clustering

Shehroz S. KhanAmir Ahmad

Journal:   Pattern Recognition Letters Year: 2004 Vol: 25 (11)Pages: 1293-1302
© 2026 ScienceGate Book Chapters — All rights reserved.