JOURNAL ARTICLE

Importance of Feature Selection and Data Visualization Towards Prediction of Breast Cancer

Abstract

Background: Breast cancer is one of the most common forms of cancers among women and the leading cause of death among them. Countries like United States, England and Canada have reported a high number of breast cancer patients every year and this number is continuously increasing due to detection at later stages. Hence, it is very important to create awareness among women and develop such algorithms which help to detect malignant cancer. Several research studies have been conducted to analyze the breast cancer data. Objective: This paper presents an effective method in predicting breast cancer and its stage and will also analyze the performance of different supervised learning algorithms such as Random Classifier, Chi2 Square test used in order to predict. The paper focuses on the three important aspects such as the feature selection, the corresponding data visualisation and finally making a prediction call on different machine learning models. Methods: The dataset used for this work is breast cancer Wisconsin data taken from UCI library. The dataset has been used to show the different 32 features which are all important and how it can be achieved using data visualisation. Secondly, after the feature selection, different machine learning models have been applied. Conclusion: The machine learning models involved are namely Support Vector Machine (SVM), KNearest Neighbour (KNN), Random Forest, Principal Component Analysis (PCA), Neural Network using Perceptron (NNP). This has been done to check which type of model is better under what conditions. At different stages several charts have been plotted and eliminated based on relative comparison. Results have shown that Random Tree classifier along with Chi2 Square proves to be an efficient one.

Keywords:
Computer science Machine learning Random forest Feature selection Artificial intelligence Support vector machine Breast cancer Multilayer perceptron Artificial neural network Classifier (UML) Visualization Perceptron Principal component analysis Data mining Pattern recognition (psychology) Cancer Medicine

Metrics

5
Cited By
0.61
FWCI (Field Weighted Citation Impact)
24
Refs
0.74
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

AI in cancer detection
Physical Sciences →  Computer Science →  Artificial Intelligence
Gene expression and cancer classification
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Artificial Intelligence in Healthcare
Health Sciences →  Health Professions →  Health Information Management

Related Documents

BOOK-CHAPTER

Breast Cancer Prediction: Importance of Feature Selection

Prateek Prateek

Advances in intelligent systems and computing Year: 2019 Pages: 733-742
JOURNAL ARTICLE

Feature Selection based Breast Cancer Prediction

Rakibul HasanAamir Shafi

Journal:   International Journal of Image Graphics and Signal Processing Year: 2023 Vol: 15 (2)Pages: 13-23
JOURNAL ARTICLE

Breast Cancer Prediction System using Feature Selection and Data Mining Methods

Gayathri Devi.S

Journal:   International Journal of Advanced Research in Computer Science Year: 2011 Vol: 2 (1)Pages: 81-87
JOURNAL ARTICLE

Breast Cancer Prediction Feature Selection Using ML Algorithms

Lin Jiang

Journal:   Journal of Physics Conference Series Year: 2023 Vol: 2547 (1)Pages: 012021-012021
© 2026 ScienceGate Book Chapters — All rights reserved.