JOURNAL ARTICLE

Improving Breast Cancer Diagnosis Using Grammatical Evolution-Based Feature Selection

Abstract

Abstract Machine learning has significantly advanced breast cancer diagnosis, yet challenges such as high-dimensional data, severe class imbalance, and limited interpretability persist. To address these issues, we proposed a Grammatical Evolution (GE)-based Feature Selection (FS) approach, integrated with a class-balancing technique called STEM, which combines Synthetic Minority Oversampling Technique, Edited Nearest Neighbour and Mixup, effectively handling both inter-class and intra-class imbalance. Our study evaluates the performance of the GE-based FS method against other FS models, including Logistic Regression (LR) and Extreme Gradient Boosting (XGBoost), in identifying critical features for breast cancer diagnosis. The results demonstrate that the GE-based FS method effectively identifies critical features and achieves superior Area Under the Curve (AUC) scores, particularly with smaller subsets of features, unlike LR and XGBoost, which perform optimally with the full feature set. The analysis was conducted on the Digital Database for Screening Mammography and Wisconsin Breast Cancer datasets, which originally contained 52 and 30 features, respectively. The GE-based FS produces the highest AUC with subsets of 10 and 15 features, while LR and XGBoost achieve their best results using the entire feature set, underscoring the superiority of the GE-based FS method.

Keywords:
Selection (genetic algorithm) Feature (linguistics) Feature selection Cancer Artificial intelligence Computer science Grammatical evolution Breast cancer Natural language processing Linguistics Medicine Internal medicine Philosophy

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
43
Refs
0.03
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

AI in cancer detection
Physical Sciences →  Computer Science →  Artificial Intelligence
Biomedical Text Mining and Ontologies
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Gene expression and cancer classification
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology

Related Documents

DISSERTATION

Interpretable breast cancer diagnosis using grammatical evolution

Hasan, Yumnah

University:   University of Limerick Institutional Repository (University of Limerick) Year: 2025
JOURNAL ARTICLE

Breast cancer diagnosis using feature selection techniques

Sabrine TounsiImen KallelMohamed Kallel

Journal:   2022 2nd International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET) Year: 2022 Pages: 1-5
JOURNAL ARTICLE

Breast cancer diagnosis improvement using feature selection

Wahyuni, Elvira SukmaSetiawan, Noor AkhmadNugroho, Hanung Adi

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2014
© 2026 ScienceGate Book Chapters — All rights reserved.