JOURNAL ARTICLE

An Efficient SVM-Based Feature Selection Model for Cancer Classification Using High-Dimensional Microarray Data

Passent El KafrawyHanaa FathiMohammed QaraadAyda K. KelanyXumin Chen

Year: 2021 Journal:   IEEE Access Vol: 9 Pages: 155353-155369   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Feature selection is critical in analyzing microarray data, which has many features (genes) or dimensions. However, with only a few samples the large search space and time consumed during their selection make selecting relevant and informative genes that improve classification performance a complex task. This paper proposed a hybrid model for gene selection known as (SVM-mRMRe), the proposed model provides a framework for combining filter-based, ensemble, and embedded methods to select the most relevant and informative genes from high-dimensional microarray data by fusing embedded SVM coefficients (features ranking) with ensemble mRMRe. Eight of the most commonly used microarray datasets for various types of cancer were used to evaluate the model. The selected subset feature is evaluated by four different types of classifiers: random forest (RF), multilayer perceptron (MLP), k-nearest neighbors (k-NN), and Support Vector Machine (SVM). The experimental results show that the proposed model reduces time consumption and dimensionality and improves the differentiation of cancer tissues from benign tissues. Furthermore, the selected genes for the brain cancer dataset are biologically interpreted, and it agrees with the findings of relevant biomedical studies and plays an important role in patient prognosis.

Keywords:
Feature selection Support vector machine Computer science Artificial intelligence Pattern recognition (psychology) Selection (genetic algorithm) Data mining Feature (linguistics) Microarray analysis techniques Feature extraction Data modeling Machine learning Database Gene expression Biology

Metrics

53
Cited By
3.12
FWCI (Field Weighted Citation Impact)
41
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Gene expression and cancer classification
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
AI in cancer detection
Physical Sciences →  Computer Science →  Artificial Intelligence
Bioinformatics and Genomic Networks
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
© 2026 ScienceGate Book Chapters — All rights reserved.