JOURNAL ARTICLE

Diabetes Early Prediction Using Machine Learning and Ensemble Methods

Hyung-Ho HaH. Jin KimYoung Hyun YuHyun Sub Sim

Year: 2025 Journal:   International Journal on Advanced Science Engineering and Information Technology Vol: 15 (2)Pages: 363-375   Publisher: Insight Society

Abstract

This study aims to develop and validate an enhanced early prediction model for diabetes utilizing machine learning and ensemble techniques, aimed at addressing the rapid increase in diabetes prevalence and the associated healthcare burden. Leveraging diverse datasets, including the Pima Indian Diabetes Dataset, electronic health records from local hospitals, and wearable device data, this research employs a variety of innovative methods. Generative Adversarial Networks (GAN) are used for data augmentation to address class imbalances, while SHAP (Shapley Additive exPlanations) provides interpretability for machine learning predictions, enhancing trust and understanding in clinical applications. The methodology integrates several machine learning algorithms—Support Vector Machine (SVM), Random Forest, XGBoost, Artificial Neural Networks (ANN), Convolutional Neural Networks (CNN), and Long Short-Term Memory (LSTM) networks—comparing their efficacy in diabetes prediction. Ensemble methods further refine the predictive accuracy, reliability, and applicability of the models. The study evaluates these models based on standard performance metrics such as accuracy, precision, recall, and F1-score across different configurations and combined approaches. Results indicate that ensemble methods significantly enhance predictive performance, achieving higher accuracy and precision compared to individual models. Particularly, the integration of deep learning techniques with traditional machine learning models provides substantial improvements in detecting early signs of Type 1 and Type 2 diabetes, utilizing insights from insulin and C-peptide data. The application of XAI techniques like SHAP not only clarifies model decisions but also assists in tailoring interventions and management strategies in clinical setting.

Keywords:
Ensemble learning Computer science Machine learning Artificial intelligence

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.16
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Artificial Intelligence in Healthcare
Health Sciences →  Health Professions →  Health Information Management

Related Documents

JOURNAL ARTICLE

Early Stage Diabetes Prediction Using Machine Learning Methods

Özge Nur ERGÜNHamza Osman İlhan

Journal:   European Journal of Science and Technology Year: 2021
JOURNAL ARTICLE

Diabetes Prediction Using Machine Learning Ensemble Model

Ong Yee HangWiwied VirgiyantiRosly Rosaida

Journal:   Journal of Advanced Research in Applied Sciences and Engineering Technology Year: 2024 Vol: 37 (1)Pages: 82-98
JOURNAL ARTICLE

Early Prediction of Diabetes Using an Ensemble of Machine Learning Models

Aishwariya DuttaMd. Kamrul HasanMohiuddin AhmadMd. Abdul AwalMd. Akhtarul IslamMehedi MasudHossam Meshref

Journal:   International Journal of Environmental Research and Public Health Year: 2022 Vol: 19 (19)Pages: 12378-12378
JOURNAL ARTICLE

Diabetes Prediction Using Machine Learning Analytics: Ensemble Learning Techniques

Deeksha TripathiSaroj Kr. BiswasS. ReshmiArpita Nath BoruahBiswajit Purkayastha

Journal:   2022 2nd Asian Conference on Innovation in Technology (ASIANCON) Year: 2022 Pages: 1-7
© 2026 ScienceGate Book Chapters — All rights reserved.