RSKD Ensemble Classifier with Stable Ensemble Feature Selection for High Dimensional Low Sample Size Cancer Datasets

Archana Suhas Vaidya; Dipak V. Patil

doi:10.5815/ijitcs.2025.02.05

ScienceGate Book Chapters

JOURNAL ARTICLE

RSKD Ensemble Classifier with Stable Ensemble Feature Selection for High Dimensional Low Sample Size Cancer Datasets

Archana Suhas Vaidya Dipak V. Patil

Year: 2025 Journal: International Journal of Information Technology and Computer Science Vol: 17 (2)Pages: 49-59

DOI: 10.5815/ijitcs.2025.02.05

Get Full-Text PDF Get Analytical Report

Abstract

This study presents the RSKD ensemble classifier, developed with ensemble feature selection techniques, to address high-dimensional, low-sample-size cancer datasets. Ensemble classifiers are advantageous in such scenarios, offering better classification accuracy than traditional methods by combining multiple models. This combination enhances predictive performance on high-dimensional datasets. However, stability—a key factor for consistent performance on unseen data—often involves a tradeoff with accuracy. Ensemble methods, due to their generalization capabilities, exhibit higher stability, with feature selection stability measured using a consistency index, averaging 65–70%. The RSKD classifier integrates ensemble feature selection methods SU-R and ChS-R, which enhance feature selection stability and classification accuracy. Its performance was evaluated on seven high-dimensional, low-sample-size datasets and compared against state-of-the-art classifiers, including Adaboost, GradientBoost, REPTree, asBagging_FSS, SRKNN, MF-GE, and eAdaBoost with DSC. The RSKD ensemble classifier achieved an accuracy improvement of 7.69% to 12.35% over these methods. Among the feature selection approaches, SU-R combined with RSKD outperformed ChS-R, demonstrating superior results in cancer prediction tasks. The findings of this study underscore the potential of RSKD for achieving generalized, robust performance on challenging datasets. By leveraging ensemble classifiers and ensemble feature selection techniques, researchers can address the inherent difficulties of high-dimensional, low-sample-size datasets, enhancing both accuracy and stability. This work provides a valuable foundation for developing diverse, heterogeneous ensemble approaches for cancer prediction and similar applications.

Keywords:

Computer science Classifier (UML) Feature selection Artificial intelligence Ensemble learning Pattern recognition (psychology) Ensemble forecasting Machine learning

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.11

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Artificial Intelligence in Healthcare

Health Sciences → Health Professions → Health Information Management

AI in cancer detection

Physical Sciences → Computer Science → Artificial Intelligence

Gene expression and cancer classification

Life Sciences → Biochemistry, Genetics and Molecular Biology → Molecular Biology

RSKD Ensemble Classifier with Stable Ensemble Feature Selection for High Dimensional Low Sample Size Cancer Datasets

Abstract

Metrics

Topics

Related Documents

Ensemble feature selection in high dimension, low sample size datasets: Parallel and serial combination approaches

Intelligence ensemble feature selection and ensemble classifier for cervical cancer diagnosis

Ensemble feature selection and deep learning ensemble classifier for cervical cancer diagnosis

Intelligence Ensemble Feature Selection (IEFS) and Ensemble Classifier for Cervical Cancer Diagnosis

A GA-based feature selection and ensemble learning for high-dimensional datasets