JOURNAL ARTICLE

Incomplete clustering analysis via multiple imputation

Jung Wun LeeOfer Harel

Year: 2022 Journal:   Journal of Applied Statistics Vol: 50 (9)Pages: 1962-1979   Publisher: Taylor & Francis

Abstract

Clustering analysis is a prevalent statistical method which divides populations into several subgroups of similar units. However, most existing clustering methods require complete data. One general method that addresses incomplete data is multiple imputation (MI) which avoids many limitations found in other single imputation-based methods and complete case analyses. Nevertheless, adopting MI framework to clustering analysis can be challenging since each imputed data might consist of a different number of clusters and there is not a unique parameter for clustering analysis. In response to this problem, we have developed MICA: Multiply Imputed Cluster Analysis. MICA is a framework for clustering incomplete data consisting of two clustering stages. We assess the properties of MICA and its superiority over other existing incomplete clustering strategies based on a simulation study under various data structures. In addition, we demonstrate the usage of MICA by applying it to the Youth Risk Behavior Surveillance System (YRBSS) 2019 data.

Keywords:
Cluster analysis Imputation (statistics) Data mining Computer science Consensus clustering Missing data Fuzzy clustering Statistics CURE data clustering algorithm Mathematics Machine learning

Metrics

5
Cited By
0.98
FWCI (Field Weighted Citation Impact)
23
Refs
0.73
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Bayesian Methods and Mixture Models
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Clustering Algorithms Research
Physical Sciences →  Computer Science →  Artificial Intelligence
Data-Driven Disease Surveillance
Health Sciences →  Medicine →  Epidemiology

Related Documents

JOURNAL ARTICLE

Prototype Imputation Guided Incomplete Multi-View Clustering

Rui HongXiaoping ChenYan ZhouHui LiuTiancai Wan

Journal:   IEEE Signal Processing Letters Year: 2025 Vol: 32 Pages: 1565-1569
JOURNAL ARTICLE

Analysis of incomplete longitudinal binary data using multiple imputation

Xiaoming LiDevan V. MehrotraJohn Barnard

Journal:   Statistics in Medicine Year: 2005 Vol: 25 (12)Pages: 2107-2124
JOURNAL ARTICLE

Multiple imputation for the analysis of incomplete compound variables

Jiwei ZhaoRichard J. CookChangbao Wu

Journal:   Canadian Journal of Statistics Year: 2015 Vol: 43 (2)Pages: 240-264
© 2026 ScienceGate Book Chapters — All rights reserved.