JOURNAL ARTICLE

Genetic Clustering Algorithm-Based Feature Selection and Divergent Random Forest for Multiclass Cancer Classification Using Gene Expression Data

L. SenbagamalarS. Logeswari

Year: 2024 Journal:   International Journal of Computational Intelligence Systems Vol: 17 (1)   Publisher: Springer Nature

Abstract

Abstract Computational identification and classification of clinical disorders gather major importance due to the effective improvement of machine learning methodologies. Cancer identification and classification are essential clinical areas to address, where accurate classification for multiple types of cancer is still in a progressive stage. In this article, we propose a multiclass cancer classification model that categorizes the five different types of cancers using gene expression data. To perform efficient analysis of the available clinical data, we propose feature selection and classification methods. We propose a genetic clustering algorithm (GCA) for optimal feature selection from the RNA-gene expression data, consisting of 801 samples belonging to the five major classes of cancer. The proposed feature selection method reduces the 1621 gene expressions into a cluster of 21 features. The optimum feature set acts as input data to the proposed divergent random forest. Based on the features computed, the proposed classifier categorizes the data samples into 5 different classes of cancers, including breast cancer, colon cancer, kidney cancer, lung cancer, and prostate cancer. The proposed divergent random forest provided performance improvisation in terms of accuracy with 95.21%, specificity with 93%, and sensitivity with 94.29% which outperformed all the other existing multiclass classification algorithms.

Keywords:
Random forest Feature selection Cluster analysis Artificial intelligence Gene selection Pattern recognition (psychology) Computer science Feature (linguistics) Selection (genetic algorithm) Data mining Machine learning Gene Gene expression Biology Genetics Microarray analysis techniques

Metrics

12
Cited By
5.76
FWCI (Field Weighted Citation Impact)
28
Refs
0.92
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Gene expression and cancer classification
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Machine Learning and ELM
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Gene expression data classification using genetic algorithm-based feature selection

Öznur Sinem SÖNMEZMustafa DağtekinTolga Ensarı

Journal:   TURKISH JOURNAL OF ELECTRICAL ENGINEERING & COMPUTER SCIENCES Year: 2021 Vol: 29 (7)Pages: 3165-3179
JOURNAL ARTICLE

An Integrated Feature Selection Algorithm for Cancer Classification using Gene Expression Data

Saeed AhmedMd. Mohsin KabirZakir AliMuhammad ArifFarman AliDong‐Jun Yu

Journal:   Combinatorial Chemistry & High Throughput Screening Year: 2018 Vol: 21 (9)Pages: 631-645
JOURNAL ARTICLE

Classification and Biomarker Genes Selection for Cancer Gene Expression Data Using Random Forest

Malihe RamAli NajafiMohammad Taghi Shakeri

Journal:   Iranian journal of pathology Year: 2017 Vol: 12 (4)Pages: 339-347
JOURNAL ARTICLE

Multiclass Classification for Large Medical Data using Adaptive Random Forest and Improved Feature Selection Methods

Matsa RamG. V. SureshNarasimha Swamy Biyappu

Journal:   2022 12th International Conference on Cloud Computing, Data Science & Engineering (Confluence) Year: 2022 Pages: 98-105
© 2026 ScienceGate Book Chapters — All rights reserved.