JOURNAL ARTICLE

Advanced Data Imputation Techniques for Predicting Type 2 Diabetes using Machine Learning

Sofia GoelSudhansh Sharma

Year: 2019 Journal:   International Journal of Innovative Technology and Exploring Engineering Vol: 9 (2)Pages: 4142-4149   Publisher: Blue Eyes Intelligence Engineering and Sciences Publication

Abstract

Type 2 Diabetes mellitus is a serious metabolic disorder that is prevailing worldwide at an alarming rate. Medical dataset often suffers from the problem of missing data and outliers. However, handling of missing data with traditional mean based imputing may lead towards a bias model and return unpredictable outcome. Making complex models by combining multiple classifiers as well as some other methods could increase the accuracy which again is a time-consuming approach and requires heavy computation capability which significantly increases the deployment cost. The proposed research is to design a model to classify the data using class wise imputation technique and outlier handling. Performance of the proposed model is evaluated on nine machine learning classifiers and compared with traditional approaches like simple mean, median, and linear regression. Experimental results show the superiority of the proposed model in terms of classification accuracy and model complexity. The accuracy achieved by the proposed approach is 88.01%, which is highest as compared to the previous studies. The proposed research work is presented to improve accuracy, scalability and overall performance of the classification in the medical dataset, which ultimately proves to be a lifesaver if the diagnosis is achieved efficiently at an early stage.

Keywords:
Imputation (statistics) Computer science Missing data Machine learning Artificial intelligence Outlier Scalability Data mining

Metrics

4
Cited By
1.47
FWCI (Field Weighted Citation Impact)
0
Refs
0.88
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Artificial Intelligence in Healthcare
Health Sciences →  Health Professions →  Health Information Management
Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Diabetes, Cardiovascular Risks, and Lipoproteins
Health Sciences →  Medicine →  Endocrinology, Diabetes and Metabolism
© 2026 ScienceGate Book Chapters — All rights reserved.