JOURNAL ARTICLE

Feature Selection for Machine Learning-based Phishing Websites Detection

Abstract

Phishing is a social engineering technique that is commonly used to deceive users in an attempt to obtain sensitive information such as username, passwords or credit card details. While there was extensive research on machine learning-based phishing detection, some prior works proposed a large number of features and not all of them are feasible to extract for real-time detection. This work combined two datasets with 30 and 48 features respectively, to identify 18 common features. Moreover, feature selection was conducted to identify 13 optimal features for a more robust model. A comparison with prior research works on the same datasets showed that the best models built on all features using the random forest algorithm scored lower on the 30 feature dataset, and achieved better performance on the 48 features dataset. The best model on the 13 features achieved an accuracy of 0.937.

Keywords:
Phishing Random forest Feature selection Computer science Credit card Password Machine learning Artificial intelligence Feature (linguistics) Feature engineering Selection (genetic algorithm) Data mining Feature extraction Pattern recognition (psychology) Deep learning The Internet World Wide Web Computer security

Metrics

9
Cited By
2.04
FWCI (Field Weighted Citation Impact)
24
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Spam and Phishing Detection
Physical Sciences →  Computer Science →  Information Systems
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Sentiment Analysis and Opinion Mining
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Feature Classification and Extreme Learning Machine Based Detection of Phishing Websites

Pallavi M. BhagatSurendra WaghmareManisha WajeRupali B. PatilKavita JoshiMeeta Bakuli

Journal:   International Journal on Recent and Innovation Trends in Computing and Communication Year: 2023 Vol: 11 (8s)Pages: 132-136
JOURNAL ARTICLE

Feature selections for the machine learning based detection of phishing websites

Ebubekir BuberÖnder DemirÖzgür Koray Şahingöz

Journal:   2017 International Artificial Intelligence and Data Processing Symposium (IDAP) Year: 2017 Pages: 1-5
JOURNAL ARTICLE

Detection of phishing websites using an efficient feature-based machine learning framework

Routhu Srinivasa RaoAlwyn Roshan Pais

Journal:   Neural Computing and Applications Year: 2018 Vol: 31 (8)Pages: 3851-3873
JOURNAL ARTICLE

Detection of Phishing Websites using an Efficient Feature-Based Machine Learning Framework

V. RamalingamParas YadavPrakhar Srivastava

Journal:   International Journal of Engineering and Advanced Technology Year: 2020 Vol: 9 (3)Pages: 2857-2862
© 2026 ScienceGate Book Chapters — All rights reserved.