JOURNAL ARTICLE

Hybrid feature selection using micro genetic algorithm on microarray gene expression data

C. PragadeeshRohana JeyarajK. SiranjeeviR. AbishekG. Jeyakumar

Year: 2019 Journal:   Journal of Intelligent & Fuzzy Systems Vol: 36 (3)Pages: 2241-2246   Publisher: IOS Press

Abstract

Research has proved that DNA Microarray data containing gene expression profiles are potentially excellent diagnostic tools in the medical industry. A persistent problem with regard to accessible microarray datasets is that the number of samples are much lesser than the number of features that are present. Thus, in order to extract accurate information from the dataset, one must use a robust technique. Feature selection (FS) has proved to be an effective way by which irrelevant and noisy data can be discarded. In FS, relevant features are picked, and result in commendable classification accuracy. This paper proposes a model that employs a compounded hybrid feature selection technique (Filter + Wrapper) to classify microarray cancer data. Initially, a filter method called Information Gain (IG) to eliminate redundant features that will not contribute significantly to the final classification is used. Following to that, an evolutionary computing technique (micro Genetic Algorithm (mGA)) to find the best minimal subset of required features is employed. Then the features are classified using a traditional Support Vector Classifier and also cross validated to obtain high classification accuracy, using a minimal number of features. The complexity of the model is reduced significantly by adding mGA, as opposed to already existing models that use various other feature selection algorithms.

Keywords:
Feature selection Computer science Classifier (UML) Data mining Pattern recognition (psychology) Gene selection Filter (signal processing) Support vector machine Microarray analysis techniques Artificial intelligence Selection (genetic algorithm) Feature (linguistics) Algorithm Gene Gene expression Biology

Metrics

35
Cited By
2.16
FWCI (Field Weighted Citation Impact)
13
Refs
0.85
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Gene expression and cancer classification
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Evolutionary Algorithms and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Neural Networks and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.