All around the world, the most common type of cancer diagnosed in women is breast cancer. This type of cancer start occurring in glandular tissue which is called lobules or else other parts of breast tissues. It is very important to detect cancer as early as possible. Tumours are of two types cancerous and non-cancerous commonly known as malignant and benign. In this paper, the Wisconsin Breast cancer data set has been used. It is a tabular form of data set. The prime goal is to visualize the data that we have and then select the best features. After getting all the best features will apply all the machine learning algorithms like KNN, Random Forest, Decision Tree, Naive Bayes, SVM, Logistic regression, and XG boost. Classifiers can help us to build a system that will help to detect breast cancer soon in women. XG Boost algorithm outperforms the other algorithms on our selected feature. It gives an accuracy of 98.75%.
P. DhivyaA. BazilabanuPonniah Thirumalaikolundusubramanian
Shaik Shabana BegumSoma BeraDebasis ChakrabortyRam Sarkar
Rajalakshmi KrishnamurthiNiyati AggrawalLokendra Kumar SharmaDiva SrivastavaShivangi Sharma
Praveen Kumar MisraG. Meena DeviY. CruzJuan Carlos Cotrina-AliagaPratik ChatterjeeDurgaprasad Gangodkar