Feature Selection for Machine Learning-based Phishing Websites Detection

Smriti Dangwal; Arghir-Nicolae Moldovan

doi:10.1109/cybersa52016.2021.9478242

ScienceGate Book Chapters

JOURNAL ARTICLE

Feature Selection for Machine Learning-based Phishing Websites Detection

Smriti Dangwal Arghir-Nicolae Moldovan

Year: 2021 Pages: 1-6

DOI: 10.1109/cybersa52016.2021.9478242

Get Full-Text PDF Get Analytical Report

Abstract

Phishing is a social engineering technique that is commonly used to deceive users in an attempt to obtain sensitive information such as username, passwords or credit card details. While there was extensive research on machine learning-based phishing detection, some prior works proposed a large number of features and not all of them are feasible to extract for real-time detection. This work combined two datasets with 30 and 48 features respectively, to identify 18 common features. Moreover, feature selection was conducted to identify 13 optimal features for a more robust model. A comparison with prior research works on the same datasets showed that the best models built on all features using the random forest algorithm scored lower on the 30 feature dataset, and achieved better performance on the 48 features dataset. The best model on the 13 features achieved an accuracy of 0.937.

Keywords:

Phishing Random forest Feature selection Computer science Credit card Password Machine learning Artificial intelligence Feature (linguistics) Feature engineering Selection (genetic algorithm) Data mining Feature extraction Pattern recognition (psychology) Deep learning The Internet World Wide Web Computer security

Metrics

Cited By

2.04

FWCI (Field Weighted Citation Impact)

Refs

0.89

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Spam and Phishing Detection

Physical Sciences → Computer Science → Information Systems

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Sentiment Analysis and Opinion Mining

Physical Sciences → Computer Science → Artificial Intelligence

Feature Selection for Machine Learning-based Phishing Websites Detection

Abstract

Metrics

Citation History

Topics

Related Documents

Modeling Hybrid Feature-Based Phishing Websites Detection Using Machine Learning Techniques

Feature Classification and Extreme Learning Machine Based Detection of Phishing Websites

Feature selections for the machine learning based detection of phishing websites

Detection of phishing websites using an efficient feature-based machine learning framework

Detection of Phishing Websites using an Efficient Feature-Based Machine Learning Framework