FEATURE SELECTION USING EXTRA TREES CLASSIFIER FOR PARKINSON’S DISEASE CLASSIFICATION

Gauri Sabherwal

doi:10.26782/jmcms.spl.11/2024.05.00010

ScienceGate Book Chapters

JOURNAL ARTICLE

FEATURE SELECTION USING EXTRA TREES CLASSIFIER FOR PARKINSON’S DISEASE CLASSIFICATION

Gauri Sabherwal

Year: 2024 Journal: JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES Vol: spl11 (1) Publisher: Institute of Mechanics of Continua and Mathematical Sciences

DOI: 10.26782/jmcms.spl.11/2024.05.00010

Get Full-Text PDF Get Analytical Report

Abstract

Parkinson's disease (PD) is chronic, permanent, and life-threatening. Neurologically protective treatments for PD rely on early detection. Recent studies have demonstrated that clinical data, cerebrospinal Fluid (CSF) based proteomes, and gene mutations are important biomarkers for accurate and early detection of PD. This study aims to investigate the heterogeneous data comprised of CSF-based clinical data, CSF-based proteomic analysis data as well as the mutation information of the genes, Glucose Beta Acid (GBA), leucine-rich kinase (LRRK2) to classify controls into PD-affected and Healthy Control (HC). The dataset contains 1103 controls (569 PD affected and 534 HC). Automated Machine Learning (AutoML) framework using PyCaret is utilized. The study has proposed an Extra Tree Classifier (ETC) as a feature selection mechanism to select features that significantly affect the PD classification. Selected features are further used to train Random Forest (RF), Logistic Regression (LR), and Decision Tree (DT) classifiers. Accuracy, sensitivity, specificity, area under the receiver operating characteristic curve (AUC-ROC), and the confusion matrix are used to evaluate the performance of classifiers. RF has depicted the best performance in terms of accuracy value of 96.12%, sensitivity of 95.59%, and specificity of 95.34% while LR has shown the highest AUC value of 98.33. RF has made the highest number of correct predictions 316 out of 331.

Keywords:

Random forest Feature selection Artificial intelligence Receiver operating characteristic Decision tree Classifier (UML) Logistic regression LRRK2 Pattern recognition (psychology) Machine learning Confusion matrix Computer science Disease Parkinson's disease Medicine Internal medicine

Metrics

Cited By

0.77

FWCI (Field Weighted Citation Impact)

Refs

0.62

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Parkinson's Disease Mechanisms and Treatments

Health Sciences → Medicine → Neurology

Neurological disorders and treatments

Health Sciences → Medicine → Neurology

FEATURE SELECTION USING EXTRA TREES CLASSIFIER FOR PARKINSON’S DISEASE CLASSIFICATION

Abstract

Metrics

Citation History

Topics

Related Documents

Leukocyte classification based on feature selection using extra trees classifier: a transfer learning approach

Feature Selection Using Extra Trees Classifier for Research Productivity Framework in Indonesia

Parkinson’s Disease Classification Using Random Forest Kerb Feature Selection

Parkinson’s disease classification using nature inspired feature selection and recursive feature elimination

Implementation of Extra Trees Classifier and Chi-Square Feature Selection for Early Detection of Liver Disease