JOURNAL ARTICLE

Chronic kidney disease prediction using machine learning techniques

Pasi, Ashok KumarMeesala, Sai AryanDoddi, VineshAnde, NandanaChinta, ThilakSoma, Nithin

Year: 2025 Journal:   Zenodo (CERN European Organization for Nuclear Research)   Publisher: European Organization for Nuclear Research

Abstract

In today’s fast-paced world, maintaining health often takes a backseat until visible symptoms arise. Unfortunately, certain diseases, like Chronic Kidney Disease (CKD), develop silently, presenting no noticeable symptoms in the early stages. This delay in detection often leads to severe complications, including kidney failure, cardiovascular disease, or even death. CKD’s silent progression highlights the critical need for proactive and predictive healthcare tools that can identify risks early. Machine Learning (ML) offers a promising solution, capable of analyzing vast amounts of data and predicting potential health risks with high accuracy. In this study, we explored the potential of nine ML techniques for predicting CKD: K-nearest Neighbors (KNN), support vector machines (SVM), logistic regression (LR), Naïve Bayes, Extra Tree Classifiers, AdaBoost, XG Boost, and Light GBM. Using a dataset obtained from Kaggle.com with 14 attributes and 400 records related to CKD, we aimed to identify the most effective model for this task. The attributes included clinical parameters such as blood pressure, specific gravity, albumin, sugar, and more, providing a comprehensive foundation for prediction. Each ML model was meticulously trained and tested, with hyperparameters fine-tuned to achieve optimal performance. Feature scaling and data preprocessing were conducted to ensure the models handled the dataset effectively. Evaluation metrics, including accuracy, precision, recall, F1-score, and ROC-AUC, were used to assess performance. Among the models, LightGBM emerged as the top performer, achieving an impressive accuracy of 99.00%. This model reformed its counterparts due to its ability to handle imbalanced datasets, fast training speed, and exceptional performance in capturing complex patterns.

Keywords:
Logistic regression Kidney disease Random forest Support vector machine Preprocessor Decision tree Regression Feature (linguistics)

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.37
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Geochemistry and Geologic Mapping
Physical Sciences →  Computer Science →  Artificial Intelligence
Geological Modeling and Analysis
Physical Sciences →  Earth and Planetary Sciences →  Geochemistry and Petrology
Electrical and Electromagnetic Research
Physical Sciences →  Physics and Astronomy →  Atomic and Molecular Physics, and Optics

Related Documents

JOURNAL ARTICLE

Chronic kidney disease prediction using machine learning techniques

Ashok Kumar PasiSai Aryan MeesalaVinesh DoddiNandana AndeThilak ChintaNobukazu Soma

Journal:   World Journal of Advanced Research and Reviews Year: 2025 Vol: 25 (2)Pages: 981-989
JOURNAL ARTICLE

Chronic kidney disease prediction using machine learning techniques

Dibaba Adeba DebalTilahun Melak Sitote

Journal:   Journal Of Big Data Year: 2022 Vol: 9 (1)
JOURNAL ARTICLE

Chronic Kidney Disease Prediction Using Machine Learning Techniques

Saurabh Pal

Journal:   Biomedical Materials & Devices Year: 2022 Vol: 1 (1)Pages: 534-540
© 2026 ScienceGate Book Chapters — All rights reserved.