JOURNAL ARTICLE

Imputation Techniques and Recursive Feature Elimination in Machine Learning Applied to Type II Diabetes Classification

Abstract

Type II diabetes is a chronic metabolic disease secondary to elevated blood glucose levels. Complications of this disease include heart attack, stroke, blindness, renal failure, lower limb amputation and mortality. Due to its rising prevalence and consequent mortality, it is important to identify at an early stage those patients at high risk of developing diabetes. We applied 8 machine learning techniques namely: support vector machine, logistic regression, k-nearest neighbor, naïve Bayes, decision tree, random forest, AdaBoost and XGBoost in predicting diabetes using a publicly available diabetes dataset. In our study, Naïve Bayes with median imputation and recursive feature elimination obtained the highest performance with an accuracy rate of 81.0%. Although the results are very promising, one major limitation in this study is the small number of samples in the dataset. Early accurate detection can help patients to proactively monitor their lifestyle habits mitigating the risks of complications of uncontrolled diabetes.

Keywords:
Naive Bayes classifier Random forest Decision tree Logistic regression Diabetes mellitus Artificial intelligence Computer science Support vector machine Machine learning Blindness Medicine Optometry

Metrics

10
Cited By
2.21
FWCI (Field Weighted Citation Impact)
24
Refs
0.90
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Artificial Intelligence in Healthcare
Health Sciences →  Health Professions →  Health Information Management
Traditional Chinese Medicine Studies
Health Sciences →  Medicine →  Complementary and alternative medicine
Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Cardiovascular disease prediction with imputation techniques and recursive feature elimination

Vincent Peter C. MagbooMa. Sheila A. Magboo

Journal:   AIP conference proceedings Year: 2023 Vol: 2683 Pages: 030016-030016
JOURNAL ARTICLE

Advanced Data Imputation Techniques for Predicting Type 2 Diabetes using Machine Learning

Sofia GoelSudhansh Sharma

Journal:   International Journal of Innovative Technology and Exploring Engineering Year: 2019 Vol: 9 (2)Pages: 4142-4149
JOURNAL ARTICLE

Classification of Type 2 Diabetes Using Machine Learning Techniques

Ziynet PamukCeren Kaya

Journal:   European Journal of Science and Technology Year: 2021
© 2026 ScienceGate Book Chapters — All rights reserved.