JOURNAL ARTICLE

Missing categorical data imputation approach based on similarity

Abstract

Imputation for missing data is an important task of data mining, which may influence the data mining result. In this paper, Missing Categorical Data Imputation Based on Similarity (MIBOS) is proposed to solve this problem. The algorithm defines a similarity model between objects with incomplete data, constructing the similarity matrix of objects and further gets the nearest undifferentiated object sets of each object to impute the missing data iteratively. In the imputing process, the imputed value will be directly applied to the same iteration and the following iterations. Experiments with three UCI benchmark data sets show the improvement of the proposed algorithm from perspectives of complete rate, accuracy and time efficiency.

Keywords:
Imputation (statistics) Categorical variable Missing data Data mining Computer science Similarity (geometry) Data modeling Benchmark (surveying) Artificial intelligence Pattern recognition (psychology) Machine learning

Metrics

17
Cited By
1.00
FWCI (Field Weighted Citation Impact)
15
Refs
0.77
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Rough Sets and Fuzzy Logic
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Data Management and Algorithms
Physical Sciences →  Computer Science →  Signal Processing
Data Mining Algorithms and Applications
Physical Sciences →  Computer Science →  Information Systems

Related Documents

JOURNAL ARTICLE

Latent class based multiple imputation approach for missing categorical data

Mulugeta GebregziabherStacia M. DeSantis

Journal:   Journal of Statistical Planning and Inference Year: 2010 Vol: 140 (11)Pages: 3252-3262
JOURNAL ARTICLE

Categorical missing data imputation approach via sparse representation

Xiaochen ShaoSen WuXiaodong FengRui Song

Journal:   International Journal of Services Technology and Management Year: 2016 Vol: 22 (3/4/5)Pages: 256-256
JOURNAL ARTICLE

A nonparametric multiple imputation approach for missing categorical data

Muhan ZhouYulei HeMandi YuChiu-Hsieh Hsu

Journal:   BMC Medical Research Methodology Year: 2017 Vol: 17 (1)Pages: 87-87
JOURNAL ARTICLE

Missing Data Imputation for Categorical Variables

Jaroslav HorníčekHana Řezanková

Journal:   Statistika Statistics and Economy Journal Year: 2022 Vol: 102 (3)Pages: 249-260
© 2026 ScienceGate Book Chapters — All rights reserved.