Stroke is an acute cerebrovascular disease with high morbidity, mortality, and disability rate. The survivors could suffer from long-term motor impairment. Due to the persistent lack of effective treatments, prevention is currently considered the best measure. In this study, two objectives are investigated. The former attempts to analyze which characteristics would cause people prone to have a stroke. The latter explores machine learning algorithms with satisfactory performances for stroke risk prediction. To achieve this goal, five machine learning algorithms are validated, including K-Nearest Neighbors (KNN), Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), and XGBoost (XGB). Afterward, an ensemble learning algorithm, namely weighed voting algorithm is implemented to further improve performances. Various metrics are leveraged to comprehensively evaluate the results, including accuracy, precision, recall, F1-score, and the Area Under Curve (AUC). Finally, an acceptable result is achieved with an AUC of 0.711 and a recall of 57.9%. This work demonstrates that the ensemble learning method could improve performances and be further exploited as a reliable classifier for stroke prediction.
Rishabh GurjarH K SahanaC NeelambikaSparsha B SathishS Ramys
Nikolaos ZafeiropoulosArgyro MavrogiorgouSpyridon KleftakisKonstantinos MavrogiorgosAthanasios KiourtisDimosthenis Kyriazis
Kandra HarshithaP HarshithaGunjan GuptaP. VaishakK. Prajna
Nayab KanwalSabeen JavaidDhita Diana Dewi
Asif RahmanFaisal RahmanAnharul IslamIfrat JahanK. M. A. Salam