BOOK-CHAPTER

Feature Selection of Microarray Data Using Genetic Algorithms and Artificial Neural Networks

Abstract

Microarrays, which allow for the measurement of thousands of gene expression levels in parallel, have created a wealth of data not previously available to biologists along with new computational challenges. Microarray studies are characterized by a low sample number and a large feature space with many features irrelevant to the problem being studied. This makes feature selection a necessary pre-processing step for many analyses, particularly classification. A Genetic Algorithm -Artificial Neural Network (ANN) wrapper approach is implemented to find the highest scoring set of features for an ANN classifier. Each generation relies on the performance of a set of features trained on an ANN for fitness evaluation. A publically-available leukemia microarray data set (Golub et al., 1999), consisting of 25 AML and 47 ALL Leukemia samples, each with 7129 features, is used to evaluate this approach. Results show an increased performance over Golub's initial findings.

Keywords:
Feature selection Computer science Artificial neural network Classifier (UML) Artificial intelligence Data mining Machine learning Feature (linguistics) Pattern recognition (psychology) Set (abstract data type) Genetic algorithm Selection (genetic algorithm)

Metrics

4
Cited By
0.94
FWCI (Field Weighted Citation Impact)
29
Refs
0.64
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Gene expression and cancer classification
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Evolutionary Algorithms and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Neural Networks and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.