BOOK-CHAPTER

Stability of K-Means Clustering

Abstract

We phrase K-means clustering as an empirical risk minimization procedure over a class ℋK and explicitly calculate the covering number for this class. Next, we show that stability of K-means clustering is characterized by the geometry of ℋK with respect to the underlying distribution. We prove that in the case of a unique global minimizer, the clustering solution is stable with respect to complete changes of the data, while for the case of multiple minimizers, the change of Ω(n1/2) samples defines the transition between stability and instability. While for a finite number of minimizers this result follows from multinomial distribution estimates, the case of infinite minimizers requires more refined tools. We conclude by proving that stability of the functions in ℋK implies stability of the actual centers of the clusters. Since stability is often used for selecting the number of clusters in practice, we hope that our analysis serves as a starting point for finding theoretically grounded recipes for the choice of K.

Keywords:
Cluster analysis Stability (learning theory) Computer science Artificial intelligence Machine learning

Metrics

95
Cited By
4.10
FWCI (Field Weighted Citation Impact)
11
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Bayesian Methods and Mixture Models
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence
Statistical Methods and Inference
Physical Sciences →  Mathematics →  Statistics and Probability

Related Documents

BOOK-CHAPTER

Stability of k-Means Clustering

Shai Ben-DavidDávid PálHans Ulrich Simon

Lecture notes in computer science Year: 2007 Pages: 20-34
JOURNAL ARTICLE

Clustering stability-based Evolutionary K-Means

Zhenfeng HeChunyan Yu

Journal:   Soft Computing Year: 2018 Vol: 23 (1)Pages: 305-321
JOURNAL ARTICLE

Stability and model selection in k-means clustering

Ohad ShamirNaftali Tishby

Journal:   Machine Learning Year: 2010 Vol: 80 (2-3)Pages: 213-243
JOURNAL ARTICLE

Stability analysis in K ‐means clustering

Douglas Steinley

Journal:   British Journal of Mathematical and Statistical Psychology Year: 2007 Vol: 61 (2)Pages: 255-273
JOURNAL ARTICLE

A notion of stability for k-means clustering

Thibaut Le GouicQuentin Paris

Journal:   arXiv (Cornell University) Year: 2018
© 2026 ScienceGate Book Chapters — All rights reserved.